Kong AI Gateway vs TrueFoundry: the honest version of this comparison

#ai #apigateway #mcp #llm

Kong is one of the most battle-tested infrastructure projects in the industry. NGINX/OpenResty under the hood, tens of thousands of production deployments, a plugin ecosystem that's been built and debugged across a decade of real-world traffic. If your team already runs Kong for API management, you have expertise, tooling, and institutional knowledge that has real value. That's not a small thing.

This comparison is a more specific question: when you need AI routing and governance, does extending Kong give you what you need, or does the general-purpose architecture start working against you at AI-specific requirements? The answer genuinely depends on where you're starting from.

Also: Kong released Agent Gateway in April 2026. One claim you'll see repeatedly on comparison pages - including versions of TrueFoundry's own is that Kong has no native async execution for agent loops. As of v3.14, that's outdated. I'll cover what actually changed.

Where Kong is genuinely strong

Ecosystem depth and maturity. Kong's plugin library covers authentication, rate limiting, logging, transformation, and security patterns that have been validated in production at serious scale. This isn't academic — teams that already run Kong for REST APIs get all of that for AI traffic too, with no new vendor relationship.

Performance at the core. Kong's NGINX/OpenResty core handles high-throughput traffic efficiently. Rate limiting, auth, and traffic management plugins are fast and well-understood. The operational model is familiar to platform engineering teams worldwide.

decK for GitOps. Kong's declarative configuration with decK is a genuinely good GitOps story for gateway state. If your infrastructure-as-code practice is built around it, extending that to AI routes is incremental.

The OSS entry point is real. Kong Gateway OSS is free and production-capable. You can start routing AI traffic without a commercial relationship, which matters for teams evaluating options without a budget commitment.

Kong AI Gateway has moved fast in 2026. The April 2026 v3.14 release added Agent Gateway — governing LLM, MCP, and agent-to-agent (A2A) traffic from a unified control plane. MCP server aggregation mode lets multiple MCP servers sit behind a single Kong route, presenting a unified tool view to clients. This is a meaningful capability update, and comparison pages written before April 2026 miss it.

OpenMeter acquisition (September 2025). Kong acquired OpenMeter, a usage metering platform, which should improve token-based cost attribution and usage analytics. This closes a gap that TrueFoundry's page highlights — but the integration depth in current releases isn't fully documented yet.

Where the friction shows

AI features are license-gated in specific ways. This is real and worth auditing carefully before standardizing. AI Rate Limiting Advanced (required to limit by tokens, not just by requests) is an enterprise plugin. OIDC/SSO integrations are commonly enterprise-tier. PII sanitization depth varies by plugin version and license. Enterprise pricing starts in the mid-five-figures annually and scales significantly — not a surprise for Kong's market, but a planning variable worth confirming before evaluating capabilities you expect to use.

The key question: look at the specific AI capabilities on your requirements list and confirm which Kong tier they require. The OSS entry point is real, but the AI-specific features you need may pull you into commercial tiers you didn't budget for.

Plugin composability has overhead. Kong's power comes from composing plugins. That's also where complexity compounds for AI-specific use cases. Which plugins are compatible at which versions, which require enterprise licensing, how AI plugin chains interact with existing API management plugins — this requires ongoing version management. For teams without existing Kong expertise, the ramp-up is real.

Data residency is partially gated. TrueFoundry runs all enforcement (auth, rate limits, guardrails, PII detection) in-process with no external calls. Kong's most advanced data governance features — including bidirectional PII sanitization — are enterprise-gated and version-dependent. For teams where in-process PII detection is a hard compliance requirement, confirm your tier covers it before committing.

Prompt lifecycle management is thin. Kong's AI Prompt Decorator handles gateway-level prompt injection cleanly. But there's no versioning registry, no per-model prompt overrides, no playground, and no CI-gated prompt deployment. For teams doing active prompt iteration with compliance requirements, this matters.

Cost attribution for self-hosted models is absent. Kong routes to wherever you point it — self-hosted endpoints included. But cost attribution for self-hosted model fleets (tracking token cost across private GPU infra) isn't natively supported. The OpenMeter acquisition may change this, but current capabilities don't cover it.

What's genuinely different about TrueFoundry

TrueFoundry's starting assumption is different from Kong's: it was built for AI workloads, not extended to them. That shows most clearly in a few areas.

Everything on the hot path runs in-memory inside the cluster. Auth, rate limits, RBAC, budget enforcement, guardrails, PII/PHI detection — none of these make external calls on a live request. The published latency: ~3ms overhead at 250 RPS per pod, scaling linearly with pods. [Note: vendor-stated figure, not independently benchmarked.] For air-gapped or strictly regulated environments, in-process enforcement is the architecture, not a configuration option.

MCP governance is purpose-built. Virtual MCP Servers, Cedar-based policy, per-invocation guardrail hooks before and after each tool call, and credential isolation via Secret Groups. Kong's AI MCP Proxy (enhanced in v3.14) is a real MCP control surface - but post-tool-call inspection of what a tool returns before it reaches the model remains a gap to verify in Kong's current release.

Model deployment is unified with routing. This is the biggest structural difference. TrueFoundry manages external API routing and self-hosted model deployment from the same control plane — deployment, training, fine-tuning, and gateway in one system. Kong routes to wherever you point it; deploying the models is an entirely separate concern.

Prompt lifecycle is production-grade. Version history, compare/diff, CI-gated deployment gates enforced at the routing layer, and dry-run previews. For teams where prompt changes are regulated artifacts, this matters in a way Kong's Prompt Decorator doesn't address.

The real decision frame

Kong vs TrueFoundry at a Glance

Category	Kong AI Gateway	TrueFoundry
Core philosophy	API gateway extended with AI capabilities	AI-native platform built around AI, MCP, and agent workloads
Original use case	API management and microservices	Production AI systems and GenAI applications
AI routing	Available through AI plugins	Native model-aware routing and traffic policies
Token-level observability	Limited compared to AI-native platforms	Built-in token, latency, and cost observability
MCP support	Available through newer extensions and plugins	Native MCP Gateway capabilities
Agent governance	Emerging support	Built-in agent governance and tracing
Kubernetes integration	Excellent and battle-tested	Strong Kubernetes-native deployment model
API management	Mature ecosystem and plugin marketplace	Not intended to replace a full API management platform
Best fit	Organizations already standardized on Kong	Teams building AI infrastructure as a first-class platform capability
Learning curve	Familiar for API platform teams	Familiar for AI platform and MLOps teams

Stay with Kong if:

You're already running Kong for API management and AI traffic is additive, not transformational
Your team has Kong expertise that has real value — you're not starting from scratch
Your AI capability needs fit within the plugin model and your current (or planned) Kong license tier
You want a single gateway controlling all API and AI traffic with uniform policies
The Agent Gateway A2A support (v3.14) covers your near-term agent requirements

Evaluate TrueFoundry if:

You're building AI infrastructure without existing Kong investment - there's no switching cost
Your AI workloads require in-process PII detection, air-gap readiness, or data sovereignty that runs without enterprise plugin gating
You're planning self-hosted model deployment and want unified governance with routing
You need CI-gated prompt lifecycle management beyond gateway-level injection
You want MCP post-tool-call guardrails as a built-in rather than something to verify by plugin version

The honest "already on Kong" answer. Extending Kong to AI traffic is a legitimate path and often the lowest-friction one. The calculation gets harder when: the AI capabilities you need are behind a license tier you're not on; your compliance requirements need in-process PII detection that your current plugins don't provide; or your stack is moving toward self-hosted models and agentic workloads that Kong wasn't designed to deploy and govern end-to-end. None of those are hypothetical for most enterprise AI teams in 2026.

What needs checking before you decide

Kong Agent Gateway scope (v3.14): The April 2026 release added A2A traffic support and MCP server aggregation. The source material for this post described Kong as having no native async execution for agents — that was accurate before v3.14 and may be outdated now. Anyone evaluating Kong for agent workloads should read the v3.14 release notes directly rather than relying on comparison pages.

OpenMeter integration depth: Kong acquired OpenMeter for usage metering. How deeply this is integrated into Kong's token-level cost attribution in current releases isn't fully documented. If cost attribution is a decision factor, verify current capabilities directly with Kong.

License tier for your required AI features: Confirm which plugins you need, at which Kong tier they're available, and what the annual cost is at your required scale before comparing TCO with a commercial alternative. The OSS entry point is real; the specific AI capabilities you need may not be.

TrueFoundry pricing: TrueFoundry is a commercial platform with pricing not publicly listed. Get a quote before assuming it fits your budget — or that the TCO math favors it over Kong's enterprise tier at your scale.

If you've run Kong AI Gateway in production at scale especially post-v3.14, I'd be interested in what the agent governance story looks like in practice. The release notes describe capabilities; what it's like to operate them is a different question.