<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sean Markwei</title>
    <description>The latest articles on DEV Community by Sean Markwei (@seanmarkwei).</description>
    <link>https://dev.to/seanmarkwei</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3957006%2F153fa963-391e-459d-b383-a8194e8eaa9d.jpg</url>
      <title>DEV Community: Sean Markwei</title>
      <link>https://dev.to/seanmarkwei</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/seanmarkwei"/>
    <language>en</language>
    <item>
      <title>How I built AgentRAM: a memory API for AI agents without a vector DB</title>
      <dc:creator>Sean Markwei</dc:creator>
      <pubDate>Thu, 28 May 2026 18:18:36 +0000</pubDate>
      <link>https://dev.to/seanmarkwei/how-i-built-agentram-a-memory-api-for-ai-agents-without-a-vector-db-281</link>
      <guid>https://dev.to/seanmarkwei/how-i-built-agentram-a-memory-api-for-ai-agents-without-a-vector-db-281</guid>
      <description>&lt;p&gt;I'm a solo developer in Accra, Ghana, and I just shipped my first real product. It's called AgentRAM (&lt;a href="https://agentram.dev" rel="noopener noreferrer"&gt;agentram.dev&lt;/a&gt;), and it's a memory API for AI agents. This is the build story and the stack.&lt;/p&gt;

&lt;h2&gt;
  
  
  The problem I kept seeing
&lt;/h2&gt;

&lt;p&gt;Over the last year, AI agents have gone from research toys to actual things people ship. But every agent that needs to remember anything across sessions runs into the same wall: where does the memory go?&lt;/p&gt;

&lt;p&gt;The existing answers all felt heavy for what they were doing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Mem0, Zep, Letta&lt;/strong&gt; want you to set up embedding pipelines and vector databases. Powerful for RAG-style semantic search, but overkill if you just need "remember that user X likes dark mode."&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI's Assistants API memory&lt;/strong&gt; is locked to their platform and billed per-token, which means costs are unpredictable as conversation length grows.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Rolling your own with Postgres or Redis&lt;/strong&gt; works, but it's a real chunk of infrastructure to maintain for each agent project, including auth, multi-tenancy, TTLs, and an HTTP layer.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I wanted something that handled the 70% case ("remember this fact about this agent") without the 100% solution's setup cost. So I built it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What AgentRAM actually does
&lt;/h2&gt;

&lt;p&gt;One HTTP call to store. One to retrieve. Scoped by agent ID, with optional TTLs and shared namespaces.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Store a memory&lt;/span&gt;
curl &lt;span class="nt"&gt;-X&lt;/span&gt; POST https://api.agentram.dev/memory &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;-H&lt;/span&gt; &lt;span class="s2"&gt;"x-api-key: YOUR_KEY"&lt;/span&gt; &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;-d&lt;/span&gt; &lt;span class="s1"&gt;'{"agent_id":"my-agent","key":"user_pref","value":"dark mode"}'&lt;/span&gt;

&lt;span class="c"&gt;# Retrieve it&lt;/span&gt;
curl &lt;span class="s2"&gt;"https://api.agentram.dev/memory?agent_id=my-agent&amp;amp;key=user_pref"&lt;/span&gt;
&lt;span class="c"&gt;# {"value":"dark mode"}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That's the whole interaction model. No embeddings, no vector similarity, no semantic chunking, no token accounting. Just durable key-value memory scoped per agent.&lt;/p&gt;

&lt;p&gt;Other endpoints fill in the practical needs: list all memories for an agent, full-text search across them, shared namespaces so multiple agents can read from a common pool, and atomic credit-based usage tracking so cost is predictable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why not just Redis or Postgres?
&lt;/h2&gt;

&lt;p&gt;This is the question I keep wrestling with, so let me be honest about it.&lt;/p&gt;

&lt;p&gt;If you're already running infrastructure for your product, you should absolutely just add a memory table to your existing database. AgentRAM isn't for you.&lt;/p&gt;

&lt;p&gt;But for everyone else, and there are a lot of "everyone else" right now in the vibe-coding and agent-prototyping era, AgentRAM removes some real friction:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;No new infra to provision&lt;/li&gt;
&lt;li&gt;No auth layer to write for your agents&lt;/li&gt;
&lt;li&gt;No HTTP wrapper to build around your DB&lt;/li&gt;
&lt;li&gt;No multi-tenant logic to get right&lt;/li&gt;
&lt;li&gt;Built-in TTLs, search, and shared namespaces&lt;/li&gt;
&lt;li&gt;Predictable per-operation pricing (no per-token surprises)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It's a Twilio-style argument: yes, you could roll your own SMS gateway, but for most people, paying per-message is cheaper than the time cost.&lt;/p&gt;

&lt;h2&gt;
  
  
  The stack
&lt;/h2&gt;

&lt;p&gt;Nothing exotic, all standard boring tech:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;API:&lt;/strong&gt; Node.js with Express, deployed on Railway. Auto-deploys from GitHub.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Database:&lt;/strong&gt; Supabase (Postgres), with atomic credit-update logic so concurrent payments and reads don't race.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Payments:&lt;/strong&gt; Paystack. I'm in Ghana, Stripe doesn't operate here yet, but Stripe acquired Paystack in 2020. Paystack handles cards globally plus mobile money and Apple Pay, so coverage is actually broader than Stripe-alone for some users.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Email:&lt;/strong&gt; Resend, with Cloudflare Email Routing for inbound on &lt;a href="mailto:hello@agentram.dev"&gt;hello@agentram.dev&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DNS:&lt;/strong&gt; Cloudflare, with WAF and rate limiting.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Frontend:&lt;/strong&gt; Static HTML on Netlify, with Cabinet Grotesk self-hosted. No framework, no build step, just hand-written HTML and CSS.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The whole thing is six HTML pages, one Node server, one Supabase project. It's not a lot. That's intentional.&lt;/p&gt;

&lt;h2&gt;
  
  
  The pricing model: credits, not subscriptions
&lt;/h2&gt;

&lt;p&gt;After agonising over this, I went with credit-based pricing instead of a monthly subscription.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;100 free credits on signup, no card required&lt;/li&gt;
&lt;li&gt;1 credit per operation (read, write, delete, search all count as one)&lt;/li&gt;
&lt;li&gt;Top-ups: $5 for 50,000 ops, $15 for 200,000 ops, $40 for 600,000 ops&lt;/li&gt;
&lt;li&gt;Founding member tier: $249 one-time for 500,000 ops plus 20% off all future top-ups, for as long as the account is active&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Why credits over subscriptions:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Aligns with how AI agent usage actually varies (bursty, unpredictable)&lt;/li&gt;
&lt;li&gt;No "wasted" subscription months for users who weren't building that month&lt;/li&gt;
&lt;li&gt;No churn anxiety on my side&lt;/li&gt;
&lt;li&gt;The unit price is easy to reason about: 1¢ per 100 operations at the Starter tier&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The downside is it's slightly weirder to project revenue against. But I'd rather have a model my users actually feel good about.&lt;/p&gt;

&lt;h2&gt;
  
  
  The "did it actually work" moment
&lt;/h2&gt;

&lt;p&gt;I shipped this today. The full deployment took about a day, including:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Pushing the API to Railway&lt;/li&gt;
&lt;li&gt;Pointing api.agentram.dev at Railway via Cloudflare CNAME&lt;/li&gt;
&lt;li&gt;Deploying the static site to Netlify&lt;/li&gt;
&lt;li&gt;Pointing agentram.dev at Netlify via Cloudflare CNAME&lt;/li&gt;
&lt;li&gt;Verifying Let's Encrypt SSL provisioned on both domains&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Then I made my first real $5 test charge to my own account. Watched the credit count tick from 100 to 50,100. Confirmation email landed. The whole pipeline worked end to end.&lt;/p&gt;

&lt;p&gt;That's the moment that justifies all the work that came before.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I'm building next
&lt;/h2&gt;

&lt;p&gt;The build is done. Now the harder thing: distribution. Things I'm planning over the next few weeks:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;MCP server wrapper.&lt;/strong&gt; Anthropic's Model Context Protocol is becoming the standard for how AI tools discover and use external services. An MCP server for AgentRAM means Claude Desktop and Cline users can add persistent memory with one config line.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;LangChain memory backend.&lt;/strong&gt; Implement &lt;code&gt;BaseMemory&lt;/code&gt; so AgentRAM works as a drop-in memory layer for any LangChain agent.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;LlamaIndex memory module&lt;/strong&gt; and &lt;strong&gt;AutoGen / CrewAI integrations&lt;/strong&gt; for the same reason.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Official SDKs&lt;/strong&gt; for Python and TypeScript so the curl examples become idiomatic library calls.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What I'd love feedback on
&lt;/h2&gt;

&lt;p&gt;Genuinely, not as a marketing-friendly closer. If you've built agents that need memory, I want to know:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Does the API surface feel complete enough, or is something missing?&lt;/li&gt;
&lt;li&gt;What would make you pick this over rolling your own with Postgres?&lt;/li&gt;
&lt;li&gt;Which integration would actually move the needle for you?&lt;/li&gt;
&lt;li&gt;What would make you suspicious of this as a solo dev's project?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://agentram.dev" rel="noopener noreferrer"&gt;agentram.dev&lt;/a&gt; if you want to poke at it. 100 free credits if you want to try the API. Comments welcome here, or email me directly at &lt;a href="mailto:hello@agentram.dev"&gt;hello@agentram.dev&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>showdev</category>
      <category>api</category>
      <category>buildinpublic</category>
    </item>
  </channel>
</rss>
