Virginia Nyambura Mwega

Posted on Jun 28 • Originally published at virginiamwegahashnodedev.hashnode.dev

Your WHERE clause is not a security boundary (multi-tenant RAG with pgvector + RLS)

#ai #database #security #postgres

TL;DR: app-layer filtering is a single point of failure. Push tenant isolation into Postgres with RLS — and watch out for the security definer trap in your vector-search function.

Your WHERE clause is not a security boundary

My app is an AI wellness coach for parents. Every user's data is about the most private thing they have: how they're actually coping. Their check-ins, their bad nights, the things they'd never say out loud. The whole product runs on retrieval — when someone talks to the coach, the system pulls their relevant history out of a vector store and grounds the response in it.

Which means the single scariest bug I can imagine isn't a crash. It's user A asking a question and the retrieval quietly returning a snippet of user B's private history. No error. No stack trace. Just one person's worst night surfacing in another person's conversation.

In a multi-tenant app, that bug is one forgotten line of code away at all times. Here's how I make sure it can't happen — and the part of it that no tutorial warns you about.

The obvious fix is a single point of failure

The instinctive way to keep tenants apart is to filter in your query:

``
sql
select * from embeddings
where user_id = $current_user
order by embedding <=> $query
limit 5;

This works. It also relies on me, a tired human, remembering to write where user_id = ... on every single query that ever touches that table, forever, across every feature, including the ones I haven't built yet.

That's not a security boundary. That's a promise. And the failure mode of a promise is that the day you forget it — or a new query path skips it, or a refactor drops it there is nothing underneath to catch you. The app returns the wrong tenant's data and looks completely healthy doing it. That's exactly the shape of bug I caught in my own audit once. I didn't want to rely on never making it again.

Isolation belongs in the database, not the application

The fix is to move the boundary down a layer, into Postgres itself, using Row Level Security. RLS lets the database enforce which rows a user is even allowed to see, regardless of what the query asks for.

``
sql
alter table embeddings enable row level security;

create policy "Users read their own embeddings"
on embeddings for select
using (auth.uid() = user_id);

Now the rule isn't "please remember to filter." The rule is: this user physically cannot select another user's rows, because the database won't return them. A query that forgets the filter still comes back isolated, because the isolation isn't in the query anymore — it's in the table.

This is defense in depth, the same principle security people have leaned on for decades. The app-layer filter is still there as the first line. RLS is the backstop that makes a mistake in that first line survivable instead of catastrophic. One layer can fail without the whole guarantee failing.

The pgvector trap nobody mentions

Here's where it gets interesting, and where I'd put real money that most "build RAG on Supabase" tutorials are quietly broken.

Vector similarity search is usually wrapped in a SQL function — a match_documents-style RPC so you can call it cleanly from your app and keep the ANN index happy:

``
sql
create function match_user_docs(query_embedding vector(1536), match_count int)
returns setof embeddings
language sql
as $$
select *
from embeddings
order by embedding <=> query_embedding
limit match_count;
$$;

The footgun is the function's security mode. If you mark a function security definer — and a lot of copy-pasted vector-search examples do, to smooth over permissions — it runs with the definer's privileges and bypasses the caller's RLS entirely. You carefully set up Row Level Security on the table, then call it through a function that turns that protection off, and you'd never know: the function returns results, the app works in the demo, and every tenant's vectors are quietly reachable through that one call.

The fix is boring and important: keep the search function security invoker so the caller's RLS still applies, or — if it genuinely has to be security definer — filter by auth.uid() inside the function and pin the search_path. The point is to never let the convenience wrapper become the hole in the wall you just built.

One more wrinkle: filtering and approximate search fight a little

There's a subtle performance interaction worth knowing. pgvector's index (HNSW or IVFFlat) does approximate nearest-neighbor search — it returns roughly the closest vectors, fast. Add RLS on top, and the isolation filter trims that candidate set down to the current tenant's rows.

If you ask the index for the global top 5 and then isolation removes the ones that aren't yours, you can end up with fewer than 5 results — or, in a busy table, none. The pattern is to over-fetch: ask the index for more candidates than you need, so that after isolation you still have enough to ground a good answer. It's a small thing that only shows up under real multi-tenant load, which is exactly why it's worth saying out loud.

The takeaway

The model gets all the attention, but the part of an AI app that has to be certain is rarely the model. Here, it's the data boundary. And a boundary you enforce in application code is only as strong as your memory on your worst day.

So I push it down to where it can't be forgotten. The app filters because it should. The database isolates because it must. One forgotten where clause should be a non-event, not a breach — and the only way to guarantee that is to stop trusting the query and start trusting the table.

Top comments (3)

Pon • Jun 29

Good on you for naming the security definer trap -- most people skip right past it. What I'd add from getting bitten by exactly this: the reason it stays invisible is that every test you write runs as one user, and a single-tenant test passes whether the isolation holds or not. Policy on, policy bypassed by a definer function -- same green either way. The only check that sees the hole is a second identity reading the first one's rows with an expected-empty result. And the bit that decays: someone adds a definer function six months later to smooth over permissions, the isolation is silently gone, and nothing turns red unless that cross-tenant test is standing there asserting the deny. Table-not-the-query is right; I'd just say the proof has to live at the same level -- a test that isn't you.

Virginia Nyambura Mwega • Jun 29

Ha, "a test that isn't you" I'm stealing that. Sums it up better than my whole post did.
One thing I'd add: that empty result can lie to you. A broken query comes back empty too, so the test goes green whether isolation held or you just fat-fingered the tenant id. So I've started pairing them in the same case tenant A has to see its own row, tenant B has to come back empty for A's. If only one side passes, something's off. Otherwise you're one typo from a false pass that looks exactly like the real thing.
And the pgvector angle makes your decay point worse. The ANN index covers every tenant's rows, so the similarity search is already pulling other tenants' rows back as candidates the only thing holding them out is the RLS predicate. Someone drops in a SECURITY DEFINER function six months later to smooth over permissions, those candidates come right back, nothing filtering them. No red, no warning. Standing test or it rots, like you said.

Pon • Jun 30

Stealing it right back -- the two-sided case is the version I should have written. One thing I'd add underneath it, same trap one level down: tenant B coming back empty only proves isolation if B owns a row your query would pull in the first place. If all of B's rows sit far from A's query vector, B comes back empty whether RLS holds or not, and you're green on nothing. pgvector makes that easy to do by accident, because the negative fixture has to be a near-neighbor -- B needs a row that ranks high for A's query, close enough that the index would hand it back the moment the predicate drops. So seed B with a deliberate look-alike, not just any row. Otherwise the deny passes because nothing was in range to leak, not because the boundary held. Same shape as yours: the test has to be able to fail, or it's decoration that happens to be green.