The Four Questions I Ask Before Any Team Gets LLM Access: Cost, Safety, Access, Audit

#security #ai #llm #devops

TL;DR

LLM governance comes down to four questions per request: what's it costing (FinOps), is it safe (guardrails), who's allowed to make it (access control), and can we reconstruct it later (audit).
Budgets and guardrails both support an audit-first rollout: run in audit/observe mode to see what would have been blocked before you actually start blocking anything.
Policies should get stricter as you go from dev to staging to prod, not uniform everywhere - permissive dev environments and locked-down prod environments solve different problems.

Cost: can you cap it before it caps you

It's genuinely easy to blow up an LLM bill with a bug - an agent stuck in a retry loop, a prompt that keeps growing, a batch job someone forgot was scheduled. Budget limiting with a hard block is one option, but there's a middle step worth knowing about: audit mode, where you deploy a budget rule, watch real traffic against it for a full period, and only flip on enforcement once you trust the number. Useful when you genuinely don't know what a reasonable cap looks like yet and don't want to find out by breaking something.

Safety: is sensitive data leaving, and is the model behaving

This is the guardrails half of the equation: PII/PHI detection, prompt injection and jailbreak defense, content moderation, and general DLP, applied to what goes in and what comes back. TrueFoundry's built-in guardrails cover content moderation, PII detection, and prompt injection out of the box on the SaaS gateway (backed by Azure AI Content Safety and Azure AI Language under the hood) - worth noting these three specific ones aren't available if you're running the gateway fully self-hosted, where you'd reach for OpenAI Moderations, Bedrock Guardrails, or a bring-your-own-key option instead.

There's also a layer most people don't think of as a "guardrail" at first: fine-grained access control over which tools an agent can call, using something like Cedar or OPA policy language with default-deny. That matters more as agents start calling MCP tools that can actually do things, not just answer questions.

Access: who's allowed to call which models, how often

This is RBAC and rate limiting - scoped API keys per team or application, and per-user/per-model/per-application throttles. One pattern I like from here: pinning a production gateway so it only accepts requests tagged env: prod, using metadata that's injected automatically from virtual accounts rather than something a developer has to remember to set by hand. It means a dev build pointed at the wrong URL gets rejected instead of quietly running against prod.

Audit: can you answer "who did this" six months later

Every AI call, logged: who made it, what they sent, what came back. Not glamorous, but it's the difference between "we think this is what happened" and "here's exactly what happened" when someone asks during a compliance review or an incident postmortem.

The pattern across all four: audit before enforce, tighten as you go to prod

Two things show up repeatedly in how these are meant to be rolled out. First, both budgets and guardrails support an audit-only mode - deploy the rule, watch what it would have done against real traffic, then switch on enforcement once you trust it. That's a much less painful way to introduce a new policy than shipping it straight to blocking and finding out in production that your thresholds were wrong.

Second, policies should get progressively stricter from dev to staging to prod, not applied uniformly. A permissive dev gateway lets engineers iterate without guardrails getting in the way of testing edge cases; a locked-down prod gateway is where you actually want PII redaction and prompt injection defense running on every request. Applying prod-strictness everywhere just trains people to route around it.

What TrueFoundry's AI Gateway Provides Here

Everything in this post - virtual keys, RBAC, budgets, rate limits, audit logs, residency rules, and guardrails as enforced policy — is something TrueFoundry's AI Gateway expresses as configuration in one control plane. Access control defines who (users, teams, virtual accounts) may call which provider accounts and models; Personal Access Tokens and Virtual Account Tokens are how applications authenticate to the gateway instead of holding raw provider keys; rate-limit and budget configs apply per user, team, virtual account, model, or any custom metadata key; and guardrails — including Cedar and OPA as policy-as-code at the MCP-tool boundary — run as enforced rules at four lifecycle hooks.

TrueFoundry's AI Gateway is an enterprise-grade control plane that sits between your applications and 1,600+ models — across OpenAI, Anthropic, Google, AWS Bedrock, Azure OpenAI, and your own self-hosted models — behind a single OpenAI-compatible API. It turns the governance controls in this post into configuration rather than per-service code: Virtual Accounts for non-human production identity, Personal Access Tokens for development, RBAC scoped per provider account, rate-limit and budget rules expressed as YAML with per-user/per-team/per-model/per-metadata scopes, and policy-as-code guardrails at the MCP tool boundary.

Because the gateway already sits on every request and emits a complete trace for every call, it is also where compliance-grade audit becomes practical. Identity, policy decisions, guardrail outcomes, model choice, token counts, and cost all land on the same trace ID — visible in Request Traces in the UI, exportable through OpenTelemetry, and accessible by API for SIEM integration. The same gateway adds exact and semantic caching, fallbacks and retries, and observability dashboards, deploys as SaaS or in your VPC, on-prem, or air-gapped with SOC 2, HIPAA, and ITAR compliance, and is recognized in Gartner's Market Guide for AI Gateways.

Which of the four - cost, safety, access, or audit is the one your org has actually solved, and which one is still duct tape? I'd guess audit is the one most teams skip until they need it.