Omkar padwale

Posted on May 29

How This 2-Person Y-combinator backed Team Built "AWS for AI Agents" - $500k raised - Tech breakdown

#typescript #startup #yc

Hey I'm Omkar yoooo

Owner of FounderDeepDives and I'm doing a series where we introduce YC backed startups and their founders and the core technology they're working on

so let's get started lesgooo

TL;DR

Truffle Al (YC W25) is a Bengaluru/SF-based developer platform that turns Al agents into plug-and-play APIs, letting engineers integrate customer support agents into WhatsApp or analytics agents into Slack with just a few lines of TypeScript code. The company raised $500K seed from Y Combinator in February 2025 and is tackling the reality that developers spend 60% of their time on infrastructure "glue code" instead of building actual Al solutions. Founded by Shaunak Srivastava (ex-CMU GenAl and 3D Research, Hyperverge) and Rahul Karajgikar (ex-AWS OpenSearch SDE-2), Truffle abstracts state management, memory, vector databases, tool integration, and deployment infrastructure.

*The Problem Space: Why Al Agent Production is Actually Hard
*
Most people think "just call an LLM API and you're done." If you've built production Al agents, you know that's bullshit.

The real bottleneck isn't model access it's the infrastructure overhead. Developers need to handle state management with checkpointing and thread persistence, memory systems for both short-term sliding windows and long-term vector storage, vector databases with embedding pipelines and indexing, tool integrations with OAuth flows for Gmail, GitHub, Notion, Slack with rate limiting and error handling, and deployment infrastructure for containerization, scaling, monitoring, and cost tracking.

This is why companies burn $100K to $200K per year on DevOps overhead just to manage Al agent infrastructure. A $100K vendor quote typically translates to $140K to $160K in actual Year 1 costs when you account for hidden maintenance, model retraining, and security audits.

The industry data confirms this: developers spend 60% of their time on glue code instead of building actual Al solutions. Most Al projects never make it to production because of this engineering complexity, not because the models aren't good enough.

Deloitte's 2025 survey found that 48% of organizations see data searchability as a challenge to their Al automation strategy, and only 11% are actively using Al agents despite this being supposed "the year of Al agents

Architecture Breakdown: How Truffle Al Actually Works

Truffle's architecture is built around a simple principle: expose agents as HTTP APIs that can be called from anywhere, with all the heavy lifting handled by the platform.

The system flows like this. Your application (whether it's WhatsApp, Slack, a web app, mobile app, or any HTTP client) sends requests to Truffle's API gateway which handles authentication and rate limiting. The request then goes to the agent orchestration layer which manages tool routing and prompt templating. From there, the state and memory layer handles checkpointing to a database, vector store retrieval for RAG, and workflow state persistence.

The tool integration layer sits beneath this with pre-built OAuth connections for Gmail, Google Drive, Sheets, Docs, Calendar, GitHub, Notion, Twitter, Reddit, Tavily, Exa, Slack, and WhatsApp. Authentication is dashboard-managed via OAuth so developers don't handle tokens directly.

At the bottom is the managed LLM runtime that supports gpt-40-mini as the default, plus gpt-40 and Claude 3.5. Truffle handles model versioning, failover, and cost optimization so you don't need to manage GPU clusters or inference servers.

The TypeScript SDK is where developers actually interact with the system. You initialize the client with your API key, deploy an agent in three lines by specifying the name, instruction, model, and tool, then run one-off tasks or start persistent chat sessions with context memory. Conversation history is accessible through the chat object.

Engineering Challenges and How They Solved Them

Challenge 1: Memory Architecture That Doesn't Suck

Most people think "just dump everything into a vector DB." If you've built enduring agents, you know that creates a debugging nightmare.

The problem is that mixing conversational context, persistent workflow status, user historical data, and knowledge retrieval into a single vector database makes the system hard to reason about later.

Truffle's solution separates memory into distinct layers. Structured data goes into a traditional database for workflow state and user metadata. The vector store handles semantic retrieval for RAG and long-term memory. Checkpoints persist workflow state so threads can be resumed.

This approach matches what experienced engineers do in production. Keeping structured data in a database, using vectors for retrieval, and implementing checkpoints for workflows makes debugging much simpler.

*Challenge 2: OAuth Hell at Scale
*
The problem is that each tool integration requires handling OAuth flows, token refresh, permission scopes, and error handling. For 12+ integrations, that's hundreds of lines of brittle code per integration.

Truffle's solution centralizes OAuth management in the dashboard. Token refresh is handled automatically by the platform. Permission scopes are pre-configured for each integration, like Reddit needing Identity, Read, Submit, and Edit permissions. Developers just toggle permissions and follow OAuth prompts with no code required.

*Challenge 3: State Persistence for Multi-Step Agents
*
The problem is that agents that perform multi-step tasks need to checkpoint progress so workflows can resume after failures. Without checkpointing, a 5-step task that fails at step 3 means starting over from scratch.

Truffle's solution uses built-in checkpointer with LangGraph-style state management. State is persisted to a database after each step. Threads can be resumed at any point. Conversation history is accessible through the chat object.

Challenge 4: Context Management Without Noise

The problem is that a significant challenge is determining which context is genuinely valuable for the next phase. Combining everything creates noise that diminishes retrieval accuracy.

Truffle's solution uses a sliding window for recent context, a vector store summary for past interactions, and best practices built into the SDK. The documentation explicitly recommends keeping related conversations in the same chat session and clearing chat history when starting new topics.

**The Business Context: Funding, Customers, Market Timing

Funding and Traction
**
Truffle Al is in YC Winter 2025, raised a $500K seed round from Y Combinator in February 2025, and was founded in 2024. The team consists of 2 founders based in Bengaluru, India with a presence in SF. Harj Taggar is the primary YC partner working with them.

The $500K seed from YC in February 2025 gives them approximately 12 to 18 months of runway at a lean 2-person team.

Why Now? Market Timing

Al Agent Infrastructure in 2024 to 2025 shows a clear gap. LLM APIs became commodity with GPT-4, Claude, and open-source models widely available. Frameworks emerged like LangChain and Llamalndex but still require 60% glue code. Only 11% of organizations actively use Al agents despite the hype. And 48% of companies struggle with data searchability for Al automation.

The window is open because companies are actively looking to deploy Al agents but are blocked by infrastructure complexity. Truffle's timing aligns with model commoditization shifting focus to the application layer, developer fatigue with glue code creating demand for managed solutions, and YC's Al push providing strong distribution for Al infrastructure plays.

*Customers and Use Cases
*
Developers have already built customer support agents for WhatsApp and Slack automation, analytics agents that fetch and analyze spreadsheet data, and multi-agent workflows coordinating different Al tasks.

Specific examples from the documentation include virtual assistants with calendar integration, research assistants using Tavily for web research, and content assistants repurposing YouTube videos into blog posts.

*Open Source Strategy
*
Truffle maintains a public GitHub repository with examples. The repo is called truffle-ai slash truffle-examples and requires Node.js version 16 or higher with npm or yarn. The content includes example applications showcasing different agent capabilities.

They're not open-sourcing the core platform since it's a managed service, but they're being generous with examples and documentation to reduce friction for adoption.

The TypeScript SDK is their primary developer touchpoint, following a developer-first philosophy with clean APIs.

The Founders and Team

Shaunak Srivastava - Co-Founder

Shaunak has a background in GenAl and 3D Research at CMU with publications at NeurIPS 2022 and WACV 2023 in collaboration with Meta. He previously built Al products for face recognition and health est
imation at Hyperverge and co-founded Perz Al as a previous startup. He studied at CMU and BITS.

Shaunak brings Al research credibility with published work at top venues. NeurlPS is one of the most competitive ML conferences. His Hyperverge experience shows he's shipped Al products in production since face recognition is a hard computer vision problem.

Rahul Karajgikar - Co-Founder and CTO

Rahul is a former SDE-2 at AWS OpenSearch from April 2023 to January 2025, and was an SDE at AWS from July 2021 to April 2023. He's a contributor to the OpenSearch Project from July 2021 to November 2024 and studied at BITS Pilani.

Rahul brings production infrastructure expertise. Working on OpenSearch at AWS means he's dealt with distributed search at scale, which directly translates to the vector database and retrieval challenges Truffle solves. His open source contribution to OpenSearch shows he understands the developer experience side of infrastructure tools.

The combination is strong: Shaunak handles the Al research and product vision while Rahul owns the infrastructure and platform engineering. This is the classic founder split that works when building developer platforms.

Top comments (1)

Harjot Singh • May 31

"AWS for AI agents" is the land-grab everyone's racing for right now, and the 2-person + $500k detail is the real story: infra for agents is buildable by a tiny team precisely because the hard part is orchestration logic, not datacenters. The bet is that agents need a managed runtime (sandboxing, state, tool access, billing-per-run) the way web apps needed managed hosting - and whoever becomes the default substrate wins big.

The open question I'd love the breakdown to answer: is the moat the infra, or the developer experience on top of it? "AWS for X" companies win on DX and lock-in more than raw capability. I'm building adjacent (Moonshift - a multi-agent pipeline that takes a prompt to a shipped SaaS on your own GitHub + Vercel, ~$3 flat per build) and the thing I keep concluding is that for agents, owning the orchestration + cost story matters more than owning the metal. First run's free, no card. Great breakdown - did they build their own sandboxing/runtime, or wrap existing cloud primitives? That build-vs-wrap call decides whether they're infra or a thick wrapper.