I cut my LLM API costs by 71% — here's the open-source SDK I built

#llm #ai #python #opensource

After my LangChain agent cost me $12 in one afternoon, I built AgentFuse.

What it does

Semantic caching — similar prompts return cached results without hitting the API (87.5% hit rate in benchmarks)

Per-run budget enforcement — hard cap spend per agent run before it blows up your bill

Zero infrastructure — no proxy server, just pip install and 2 lines of code

pip install agentfuse-runtime

LangChain, CrewAI, LangGraph, OpenAI Agents SDK, MCP, Pydantic AI

Would love feedback from anyone building agents.