Tom Howland

Posted on Jun 10 • Edited on Jun 11 • Originally published at wambamboozle.github.io

OAuth for Remote MCP Servers

#oauth #mcp #security #ai

OAuth for Remote MCP Servers

How each AI assistant signs in to a remote MCP (Model Context Protocol) server, and why the flow differs by client and by where it runs.

Overview

The protocol throughout is standard OAuth 2.1 — an open, widely implemented authorization standard. The human sign-in runs through oauth2-proxy, one of the most widely deployed open-source auth proxies; the only deployment-specific piece is a thin, spec-conforming authorization server (the /oauth endpoints) that hands MCP clients their tokens. Every client ends up the same way — a person signs in against Google (restricted to your organization's domain), and the client holds a short-lived bearer token it presents on each /mcp call. Two things differ between assistants: where the client runs (the user's machine — local — vs. the vendor's cloud), which decides where the OAuth callback lands; and what kind of OAuth client it is — a public client proving itself with PKCE (Proof Key for Code Exchange, which lets a client with no secret prove the token request comes from the same client that started the flow), or a confidential client proving itself with a secret.

The participants

oauth2-proxy — the public-facing reverse proxy. It authenticates the human against Google (the sign-in restricted to your organization's domain) and forwards the verified identity to the app behind it. Only oauth2-proxy faces the internet. It is a mature, heavily-deployed open-source project — the standard way to put Google/OIDC (OpenID Connect) single sign-on in front of a service, widely used in Kubernetes deployments — so the most security-sensitive leg of the flow (the OAuth exchange with the identity provider) runs on battle-tested code.
The MCP server — the app on a loopback port behind the proxy. It plays two roles: the OAuth authorization server (/oauth/authorize, /oauth/token, /oauth/register, .well-known discovery) and the /mcp tool endpoint. It mints codes and tokens, and validates a token on every /mcp call.
Google — the identity provider oauth2-proxy delegates the actual sign-in to.
The MCP client — whatever holds the token and calls the tools: a local client (Claude Code, Cursor) on a machine, or a cloud client (Claude Desktop, Gemini Enterprise) on the vendor's servers.
The user's browser — where the person completes the Google sign-in.

The two axes

Local vs. cloud — where the client runs

Local clients (Claude Code, Cursor) run on the user's machine and receive the OAuth callback on loopback (127.0.0.1) — the authorization code never leaves the machine.

Cloud clients (Claude Desktop, Gemini Enterprise) run in the vendor's cloud, so the callback is a registered vendor URL — which is why the server has to be reachable from the internet.

Public vs. confidential — the OAuth client type

Public clients self-register at runtime (dynamic registration) and prove themselves with PKCE, no secret — Claude Code, Cursor, and Claude Desktop.

A confidential client is pre-provisioned once with a client_id + secret and proves itself with that secret — only Gemini Enterprise.

The configurations at a glance

Assistant	Client runs	OAuth callback	OAuth client	Proves itself with	Notes / requirements
Claude Code (CLI)	local machine	loopback	dynamic, public	PKCE	works once registered
Cursor (IDE)	local machine	loopback	dynamic, public	PKCE	works via the `mcp-remote` shim *
Claude Desktop	Anthropic cloud	vendor URL (`claude.ai`)	dynamic, public	PKCE	works once registered
Gemini Enterprise	Google cloud	vendor URL (Google)	pre-provisioned, confidential	client secret	requires a Gemini Enterprise license + admin connector registration
Cursor Cloud Agents	Cursor cloud	vendor URL	dynamic / static	PKCE / secret	requires a Cursor team admin to add the server

* Cursor's local IDE connects through the mcp-remote shim — see its section.

Claude Code & Cursor — local

This is the local baseline: the client runs on the user's machine, so it completes the full OAuth flow with a loopback callback (127.0.0.1). It self-registers (dynamic registration), the loopback redirect is allowlisted, and the authorization code never leaves the machine. It is a public client: no secret — PKCE ties the token request back to the same client that started the flow.

Local PKCE flow — everything but the Google sign-in stays on the user's machine.

The Cursor exception

Cursor follows the same local flow, but a known Cursor bug stops the IDE from opening the browser after it registers — so the sign-in step never starts. The workaround is the mcp-remote shim (npx -y mcp-remote@latest https://mcp.example.com/mcp), which runs the OAuth flow itself and hands Cursor a working connection. Nothing on the server changes.

Claude Desktop — cloud

Claude Desktop's connector runs in Anthropic's cloud. It is still a public client: it discovers the server's endpoints and registers itself dynamically (PKCE, no secret), exactly like the local clients — the only difference is that the callback is a cloud URL (claude.ai) instead of loopback, so the authorization code — minted by your server, not by Google — transits Anthropic's servers. The person still signs in with their organization-domain Google account in the browser.

Cloud PKCE flow — like the local one, but the callback and token-bearing calls originate from Anthropic's cloud, so the server must be reachable from the internet.

Gemini Enterprise — cloud, confidential (requires license + admin registration)

Gemini Enterprise is the one confidential client. Instead of registering itself at runtime, an admin mints a client_id + secret once (out of band) and enters them into the Gemini Enterprise connector config. The connector runs in Google's cloud. The human still signs in (legs 1–6); then, server-to-server with no browser, Google's cloud exchanges the code for a token using its secret (leg 7) and calls /mcp (leg 9). This path requires a Gemini Enterprise license and admin registration of the connector on the Google side.

Confidential cloud flow — the connector proves itself with a pre-shared secret at token exchange (leg 7) rather than PKCE. Note the two "Googles": Google cloud is the connector (the OAuth client, redirecting through vertexaisearch.cloud.google.com); Google is the identity provider that signs the person in.

Cursor Cloud Agents — cloud (requires team admin)

Cursor's Cloud Agents would connect from Cursor's cloud like Gemini Enterprise and Claude Desktop (Streamable HTTP, with OAuth). But adding the server is gated by Cursor's own permissions — only a Cursor team admin can add an MCP server to the team ("Only team admins can manage the default team marketplace"). Until an admin adds the server, no OAuth flow runs, so there is no completed flow to diagram. This is a vendor-side gate, not a property of the OAuth design.

Connecting a client

What to type into each assistant once the server is deployed. Every path
ends the same way: a browser opens and the person signs in with their
organization Google account.

Claude Desktop — Settings → Connectors → Add custom connector; paste https://mcp.example.com/mcp and click Connect.
Claude Code — claude mcp add --transport http my-server https://mcp.example.com/mcp, then run /mcp and choose Authenticate.
Cursor — configure the server through the mcp-remote shim in ~/.cursor/mcp.json, then toggle it off and back on under Settings → Tools & Integrations → MCP:

{
  "mcpServers": {
    "my-server": {
      "command": "npx",
      "args": ["-y", "mcp-remote@latest", "https://mcp.example.com/mcp"]
    }
  }
}

Gemini Enterprise — an admin registers the connector in Gemini Enterprise with the pre-provisioned client id and secret and enables it for the team; each person authorizes once from the Gemini side panel.

References

The OAuth 2.1 backbone here is well-trodden: the authorization flow, PKCE, dynamic client registration, and Protected Resource Metadata discovery all follow the published standard and the common explainers.

Authorization — Model Context Protocol specification — the source of truth: OAuth 2.1, PKCE (S256) for every client, and the 401 → metadata discovery sequence.
Remote MCP in the Real World: OAuth 2.1, Dynamic Client Registration, and Protected Resource Metadata
OAuth 2.1 for Remote MCP Servers — Streamable HTTP explained (2026)
Scalekit — Secure your MCP servers with OAuth 2.1 · Aembit — MCP, OAuth 2.1, PKCE, and the Future of AI Authorization
Claude Connector Authentication: How OAuth Works and When You Need It and Anthropic — MCP connector — the Claude Desktop leg.

The takeaway

The protocol is not the interesting part — the standard is borrowed and identical for every client. What varies, and what this comparison maps, are the two axes that decide everything else: where the client runs (local vs. cloud), which fixes where the OAuth callback lands; and how it proves itself (PKCE vs. a pre-shared secret), which fixes whether it can self-register or must be provisioned by an admin. The deployment shape that makes this work is worth naming: front the server with oauth2-proxy for the Google sign-in, place a thin spec-conforming authorization server behind it, and serve every client from a single internet-reachable host — the OAuth callback must be public for cloud clients anyway, and one host keeps the topology simple. Authentication, not network placement, is the boundary. Within that shape, only a confidential client (Gemini Enterprise) needs a pre-shared secret, and the practical friction is rarely the protocol — it is vendor-side gates such as a client browser-open bug or a team-admin permission on adding the server.

Top comments (3)

Alex Shev • Jun 11

Remote MCP makes auth unavoidable. The tool call is not just a convenience layer; it is an authority boundary.

For developer workflows, I want the terminal to show exactly what identity is being used, what scope was granted, and what artifact proves the call was legitimate. Otherwise agent tooling becomes hard to audit fast.

Tom Howland • Jun 11

Agreed on the framing — "authority boundary" is exactly right, and it's why the article leans so hard on the server picking its trust anchors deliberately.

The pieces you're asking for mostly exist today, but on the wrong side of the wire. The bearer JWT is the artifact — identity, scope, expiry, all decodable — and a server can log every tool call under the resolved user (ours does). We also exposed a whoami tool so a session's effective identity is one question away.

What's missing is the client surfacing any of it. No MCP client I've used shows "this call went out as X with scope Y" in the terminal. Nothing in the spec prevents it — the client holds the token and could render its claims next to every call. Until clients do, the auditable record lives server-side, and the terminal is the blind spot. I'd like to see that become table stakes in agent tooling too.

Alex Shev • Jun 11

That is the exact gap I keep coming back to: the server can be disciplined and still leave the operator with no practical visibility at the moment of execution.

A minimal client-side pattern could be very simple: show the resolved actor, token expiry, requested scope, and server identity beside the tool call before/while it runs. Not a full security dashboard, just enough context that a developer can spot “wrong account / wrong workspace / wrong privilege” before the agent acts.

Server logs are necessary for audit, but the terminal needs a small amount of live provenance too. Otherwise the audit trail is only useful after the damage is already done.