OAuth Token Management for SaaS Integrations

OAuth token management looks easy when you only have one connection.

You redirect the user to the provider, exchange the authorization code, store the access token and refresh token, and call the API. Maybe you even ship the first integration in a day or two.

Then production happens.

A token expires halfway through a delivery batch. Two workers refresh the same connection at the same time. A customer revokes access in the provider UI and your system keeps retrying with a dead credential until someone gets paged. One provider rotates refresh tokens. Another only returns one on the original grant. A third changes behavior depending on connected-app policy or consent mode.

This is why OAuth token handling becomes operational infrastructure long before it feels like it should.

For customer-facing integrations, a token store is not enough. You need a token lifecycle strategy.

The OAuth lifecycle in an integration system

At a high level, the flow is familiar:

The customer authorizes your app.
Your backend exchanges the authorization code for tokens.
You use the access token to call the provider API.
The access token expires.
You use the refresh token to get a new access token.
Eventually the connection is revoked, narrowed, or needs re-authorization.

That description is correct, but it hides what matters in production: the token is attached to a long-lived customer connection, not a one-off login session.

That means your system has to answer questions like:

Where do token records live?
How do you encrypt them?
How do you know when to refresh?
What happens if two workers refresh at once?
How do you distinguish a transient 500 from a permanent invalid_grant-style auth failure?
How do you stop a broken connection from creating an on-call storm?

Where production OAuth systems break

Most OAuth incidents are not caused by misunderstanding the protocol. They come from treating the protocol as if it were the whole problem.

Expiry during delivery

An access token can expire between queueing the work and actually performing the API call. If your jobs sit in a queue for a few minutes, "token was valid when enqueued" is meaningless by the time the worker runs.

The fix is simple in principle: validate freshness immediately before the provider call, not only at connection time.

Refresh race conditions

This is one of the most common failures in multi-worker systems.

Imagine five jobs for the same customer all start within a few seconds. Each worker sees an almost-expired token. Each worker independently decides to refresh it. Now you have five refresh attempts and five competing writes.

OAuth Token Management for SaaS Integrations - The Patterns That Don't Break at 3am

OAuth tokens do not fail on schedule. They expire during jobs, race during refresh, and get revoked without warning. This guide covers the multi-tenant patterns that keep SaaS integrations running.

The OAuth lifecycle in an integration system

Where production OAuth systems break

Expiry during delivery

Refresh race conditions

Revoked or narrowed access

Provider quirks

Multi-tenant OAuth realities

Practical patterns that hold up in production

1. Model the connection explicitly

2. Refresh proactively, not only on 401

3. Serialize refresh with a lock

4. Design for overlap windows

5. Add an auth-failure circuit breaker

6. Encrypt tokens and scope every read

Provider-specific gotchas worth planning for

Google: offline access and refresh-token loss conditions

HubSpot: short-lived access tokens and token response metadata

Providers differ on refresh-token behavior

When infrastructure helps

The takeaway