foxck016077

Posted on May 16 • Edited on May 25

Gmail OAuth client_id is not a secret â design notes for self-host Actors

#api #architecture #google #security

This question keeps coming up:

"My Gmail OAuth client_id got leaked. Is the system compromised?"

Short answer: client_id was never a secret. What you actually need to protect is the token, the client_secret (if your flow uses one), and the overall authorization exchange boundary.

This matters more in a self-host Actor scenario, because you are not shipping a single deployment â every user runs it their own way.

Public identifier, not sensitive credential

The OAuth client_id is closer to an application identifier than a password. It needs to be visible to the OAuth server and the front-end flow. In many scenarios it appears in places that are reasonable to see.

Treating client_id like a password and "hiding" it is usually a misunderstanding:

You cannot hide it (front-end, request URLs, logs all expose it)
Hiding it does not raise overall security
Worse, it can distract you from the real attack surface

The question worth asking is: if an attacker knows your client_id, what can they still do? If the answer is "nothing â they cannot complete an authorization exchange or obtain a valid token", your design is pointing in the right direction.

Where the security budget actually goes

For a self-host Actor, security is about flow integrity, not a single magical value. I focus on four layers:

redirect URI allowlist â only registered and verifiable callbacks
state / anti-CSRF â every authorization round-trip is bound to the originating session
token storage and rotation â least privilege, shortest exposure, revocable
tenant isolation â every token is namespaced to a tenant and never crosses boundaries

If these four layers are solid, client_id visibility is not a primary risk.

Common mistake: budget in the wrong place

I have seen projects spend real effort on "obfuscating client_id" while ignoring problems that actually cause incidents:

weak callback endpoint validation
tokens accidentally landing in logs with broad ACLs
error handlers leaking internal state to the caller
inconsistent multi-tenant key naming that lets one tenant trip into another's data

Those are the things that bite.

Security is not about mystique. It is about staying recoverable in the worst case.

What this means for multi-tenant Actor design

Once you accept "client_id is visible by design", the architecture gets cleaner. Attention naturally shifts to:

scope minimization â request only the permissions you truly need (in my Actor, gmail.readonly and nothing else)
explicit token lifecycle â when it renews, when it expires, when it can be revoked
auditable execution path â which tenant triggered which run when

These decisions directly affect product trust and how much firefighting future-you will be doing.

Self-host does not exempt you from threat modeling

A tempting shortcut: "It's self-host, so the risk lives with the user."

That is half right. Yes, deployment responsibility moves to the operator. But as the author you still owe secure defaults:

safe defaults rather than risky defaults
explicit documentation rather than verbal hints
observable errors rather than silent failures

Otherwise you are not reducing risk, you are just shifting it.

Documentation pattern

For OAuth-touching repos, I now write three things first:

Which values are public, which must stay private
The lifecycle and revocation path for every token
What a user can actually do when an authorization error happens

Two effects:

new users do not get blocked on a panic about the wrong thing
experienced reviewers can audit your security logic quickly

The clearer you are, the easier it is for the community to trust your project.

Open-source carries extra weight

In open source, a misleading security narrative is more dangerous than in a private product, because it gets copied.

If you frame client_id as "top secret", others will copy that posture and ship the same broken model with real problems intact.

I prefer to spell it out in the README:

client_id is expected to be visible
the real sensitive surface is token handling and flow protection
multi-tenant isolation is enforced through data structure and routing logic

That way, even a fork carries a less-wrong threat model forward.

Closing: stop asking "can I hide it"

The question I keep on a sticky note for OAuth design is:

"If this value gets seen, is the system still safe?"

If a single exposed value collapses the system, the problem is not secret management â it is brittle architecture.

For Gmail OAuth in a self-host Actor, client_id is not the protagonist. The real protagonist is a verifiable, revocable, isolated authorization system.

Source: foxck016077/apify-gmail-inbox-intel — MIT, why OAuth client IDs are public by design, plus self-host actor hardening notes for real deployments.

Discussion question: What misconception about OAuth client IDs do you see most often in self-hosted setups, and how do you correct it in docs?

📊 Cold-start update (4 days in): 7 articles, 1 star, 0 sales, 4 days — what an MIT open-source Apify Actor cold start actually looks like — design notes only matter if the cold-start mechanics deliver readers. The follow-up post has the actual traffic numbers.

Update (May 19): The Actor that uses this pattern is live: apify.com/foxck/gmail-inbox-intel — free MIT, paste 3 OAuth fields (client_id is in the public input schema as documented above), get stalled threads ranked. Companion PDF now pay-what-you-want from $1.

Day 7 update (later May 19): I shipped a product pivot — the Gumroad listing above is now a Self-Host Bundle for engineers (full Actor source + docker-compose.yml + 5-min OAuth setup), PWYW from $5 suggested $19. The original PDF still ships inside as a bonus. Same URL.

Day 7 write-up with the funnel audit that triggered the pivot: funnel audit found 7 of 9 articles had no buy link, then I pivoted the product.

Sample report preview: Friday Triage gist — anonymized 10-thread example of the $99 Done-For-You triage output. Grounded in r/sales 1tdngew (49 comments on re-engaging cold prospects) and r/smallbusiness 1td0827 (60-comment thread, top reply at 61 score: "holding 50 open loops in your head").

More from the shop:

Claude Code Mastery: The Reverse-Engineering Guide — $49, 19 pages, every env var / hook event / settings key extracted from the v2.1.90 binary
5 n8n Workflows that Save 10+ Hours/Week — $29, the bundle
AI Lead Auto-Responder — $39, Gmail → instant AI-classified reply

Read the latest checkpoint: Day 16 — +51 reader spike in 85 min, 0 sales

Day 18 — pbot v1 dev preview shipped

After 18 days of this ZERO-TEN cold start: $9 PDF killed at Day 17, pivoted to pbot — a one-click personal knowledge bot you install on your own machine. Talk to it from LINE / Telegram / Zalo on your phone.

v1 dev preview is real: 93 MB macOS .dmg packaged, 15k-chunk SQLite FTS5 queries in 0-3 ms, Anthropic real calls with source citations, daemon auto-start on boot. Day 18 deep dive: the 7-line bigram fix for Chinese search.

Join the pbot waitlist ($29 · first-100 get -30% → $20) →