<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Sourabh Mourya</title>
    <description>The latest articles on DEV Community by Sourabh Mourya (@sourabh_48c218e07674a2af0).</description>
    <link>https://dev.to/sourabh_48c218e07674a2af0</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3950516%2F047f76ef-7223-4608-98af-a2056c1969dc.png</url>
      <title>DEV Community: Sourabh Mourya</title>
      <link>https://dev.to/sourabh_48c218e07674a2af0</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/sourabh_48c218e07674a2af0"/>
    <language>en</language>
    <item>
      <title>You Don’t Need More Tokens, You Need Better Thinking</title>
      <dc:creator>Sourabh Mourya</dc:creator>
      <pubDate>Thu, 28 May 2026 12:52:42 +0000</pubDate>
      <link>https://dev.to/sourabh_48c218e07674a2af0/you-dont-need-more-tokens-you-need-better-thinking-4dk1</link>
      <guid>https://dev.to/sourabh_48c218e07674a2af0/you-dont-need-more-tokens-you-need-better-thinking-4dk1</guid>
      <description>&lt;p&gt;I got my first Anthropic bill and genuinely thought it was a mistake.&lt;/p&gt;

&lt;p&gt;It wasn't. I was just being wasteful without realizing it.&lt;/p&gt;

&lt;p&gt;After a week of obsessing over token usage, I cut my costs by nearly 60% with &lt;em&gt;better&lt;/em&gt; results. Here's what I learned.&lt;/p&gt;

&lt;h2&gt;
  
  
  What even is a token?
&lt;/h2&gt;

&lt;p&gt;A token isn't a word. It's closer to a word-chunk.&lt;/p&gt;

&lt;p&gt;As a rough rule: &lt;strong&gt;1 token ≈ 4 characters&lt;/strong&gt; in English. So "unbelievable" is about 3 tokens. A full paragraph might be 80–120 tokens.&lt;/p&gt;

&lt;p&gt;Every API call counts both directions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Input tokens&lt;/strong&gt; — everything you send (your prompt + system prompt + conversation history)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Output tokens&lt;/strong&gt; — everything the model sends back&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Output tokens cost more. Always. Keep that in mind.&lt;/p&gt;

&lt;h2&gt;
  
  
  Where most people bleed tokens without knowing
&lt;/h2&gt;

&lt;p&gt;When I audited my prompts, I found the same mistakes every time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bloated system prompts.&lt;/strong&gt; I had one that was 800 words of "be helpful, be concise, be professional..." repeated five different ways. That runs on &lt;em&gt;every single call&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Dumping full context when partial context works.&lt;/strong&gt; I was sending entire documents and saying "answer this question about section 3." The model reads all of it. Every time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Asking for long outputs when short ones do the job.&lt;/strong&gt; "Explain this concept" gets you 400 words. "Explain this in 2 sentences" gets you 40 words and usually a cleaner answer.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Conversation history that never gets trimmed.&lt;/strong&gt; In multi-turn chats, every previous message gets re-sent. A 20-turn conversation can be 3,000 tokens before you've even typed your next message.&lt;/p&gt;

&lt;h2&gt;
  
  
  What actually works to cut costs
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Be specific about output length.&lt;/strong&gt;&lt;br&gt;
Add "in 1 paragraph" or "in under 100 words" to your prompt. Models respect this and it forces tighter answers.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Chunk your context.&lt;/strong&gt;&lt;br&gt;
Instead of sending a full document, extract and send only the relevant section. A focused 200-token excerpt beats a 2,000-token dump every time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Shrink your system prompt ruthlessly.&lt;/strong&gt;&lt;br&gt;
Cut anything that's vague or repeated. "Be concise" said once beats "please ensure your responses are as concise as possible and avoid unnecessary verbosity" said once. Less is genuinely more.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Use a cheaper model for simple tasks.&lt;/strong&gt;&lt;br&gt;
Not every call needs your most powerful model. Routing classification, summarization, or formatting tasks to a smaller model can cut costs 5–10x with zero quality loss.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Cache repeated context.&lt;/strong&gt;&lt;br&gt;
If you're building something and sending the same instructions or documents repeatedly, look into prompt caching. Anthropic, OpenAI, and others support it — cached tokens cost a fraction of fresh ones.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;6. Ask for structured output.&lt;/strong&gt;&lt;br&gt;
"Return JSON with keys: summary, action, confidence" is shorter to process and parse than "please write a detailed explanation followed by the recommended action and your confidence level."&lt;/p&gt;

&lt;h2&gt;
  
  
  The mindset shift that helped most
&lt;/h2&gt;

&lt;p&gt;I stopped thinking "how do I write better prompts" and started thinking &lt;strong&gt;"what's the minimum context this model actually needs to answer correctly?"&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That question changed everything.&lt;/p&gt;

&lt;p&gt;The model doesn't need your life story. It needs the right facts, a clear goal, and a defined output format. Everything else is noise you're paying for.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Now I want to hear from you:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Have you ever actually looked at your token usage per call, or do you just watch the bill?&lt;/li&gt;
&lt;li&gt;What's the dumbest token waste you've caught in your own prompts?&lt;/li&gt;
&lt;li&gt;Have you found a trick that cut costs without hurting quality?&lt;/li&gt;
&lt;li&gt;Are you using prompt caching yet — and did it actually help?
Drop your answer below. Even "I had no idea tokens worked this way" counts — we've all been there.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>productivity</category>
      <category>automation</category>
    </item>
    <item>
      <title>AI Agents Changed My Workflow</title>
      <dc:creator>Sourabh Mourya</dc:creator>
      <pubDate>Tue, 26 May 2026 11:33:11 +0000</pubDate>
      <link>https://dev.to/sourabh_48c218e07674a2af0/ai-agents-changed-my-workflow-1d8d</link>
      <guid>https://dev.to/sourabh_48c218e07674a2af0/ai-agents-changed-my-workflow-1d8d</guid>
      <description>&lt;p&gt;I'll be honest I thought "agentic AI" was just a fancy way of saying chatbot.&lt;/p&gt;

&lt;p&gt;Then I tried it. And I haven't gone back.&lt;/p&gt;

&lt;h2&gt;
  
  
  What even is an AI agent?
&lt;/h2&gt;

&lt;p&gt;Here's the simplest way I can explain it.&lt;/p&gt;

&lt;p&gt;A normal AI tool waits for you to ask something, answers, and stops. You're still driving every step.&lt;/p&gt;

&lt;p&gt;An agent takes a &lt;strong&gt;goal&lt;/strong&gt; not a task and figures out the steps itself. It reads files, runs commands, hits errors, self-corrects, and keeps going.&lt;/p&gt;

&lt;p&gt;That's a completely different thing.&lt;/p&gt;

&lt;h2&gt;
  
  
  The moment it clicked for me
&lt;/h2&gt;

&lt;p&gt;I had a broken API integration. Wrong payload format, 400 errors, the usual headache.&lt;/p&gt;

&lt;p&gt;Normally that's 20–30 minutes of my life gone. Log pulling, doc checking, trace and patch and repeat.&lt;/p&gt;

&lt;p&gt;I handed the goal to an agentic AI instead.&lt;/p&gt;

&lt;p&gt;It read the stack trace, checked recent commits, found the exact line where the payload structure broke, wrote a fix, and ran the test.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Under four minutes.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I just sat there. Not impressed &lt;em&gt;unsettled&lt;/em&gt;. Like watching someone else parallel park your car perfectly on the first try.&lt;/p&gt;

&lt;h2&gt;
  
  
  The tool I've been using
&lt;/h2&gt;

&lt;p&gt;I went deep on &lt;strong&gt;OpenClaw&lt;/strong&gt; specifically it's open-source, runs locally, brings your own API key, and it can read files, control your browser, send messages, and run shell commands autonomously.&lt;/p&gt;

&lt;p&gt;180,000+ GitHub stars in three months. That's not hype. That's developers voting with their attention.&lt;/p&gt;

&lt;p&gt;I wrote up my full experience with it what it did well, where it failed badly, and whether I'd actually trust it in a real workflow:&lt;/p&gt;

&lt;p&gt;👉 &lt;a href="https://www.themodernblog.com/openclaw-agentic-ai/" rel="noopener noreferrer"&gt;OpenClaw and the Rise of Agentic AI for Faster Coding&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  But it's not magic let me be real
&lt;/h2&gt;

&lt;p&gt;Agents break in weird ways.&lt;/p&gt;

&lt;p&gt;I watched OpenClaw loop on the same wrong fix seven times without realizing it. I watched it "solve" a bug by deleting the test catching it. I watched it make a three-second architectural decision I'd have thought about for three days and get it completely wrong.&lt;/p&gt;

&lt;p&gt;The demos you see online are cherry-picked. Real usage is messier.&lt;/p&gt;

&lt;p&gt;Right now you still need a human who understands what's happening not to write every line, but to &lt;strong&gt;catch the agent when it confidently goes off the rails.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  The real shift I'm noticing
&lt;/h2&gt;

&lt;p&gt;The skill that's mattering now isn't "can you code fast."&lt;/p&gt;

&lt;p&gt;It's "can you think clearly enough about a problem to give an agent a goal it won't misinterpret."&lt;/p&gt;

&lt;p&gt;That's harder than it sounds. And most of us are figuring it out in real time.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;I want to hear from you:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Have you tried any agentic AI tools in your actual workflow yet?&lt;/li&gt;
&lt;li&gt;What's the most impressive thing one did for you?&lt;/li&gt;
&lt;li&gt;What's the most embarrassingly wrong thing it did that you caught just in time?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Drop it below. Even one line counts.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>productivity</category>
      <category>agents</category>
    </item>
    <item>
      <title>I Let an AI Agent Loose on My Codebase. Here's What Actually Happened.</title>
      <dc:creator>Sourabh Mourya</dc:creator>
      <pubDate>Mon, 25 May 2026 11:19:34 +0000</pubDate>
      <link>https://dev.to/sourabh_48c218e07674a2af0/i-let-an-ai-agent-loose-on-my-codebase-heres-what-actually-happened-4en3</link>
      <guid>https://dev.to/sourabh_48c218e07674a2af0/i-let-an-ai-agent-loose-on-my-codebase-heres-what-actually-happened-4en3</guid>
      <description>&lt;p&gt;Okay, real talk.&lt;/p&gt;

&lt;p&gt;I kept seeing "agentic AI" everywhere Twitter, YouTube, every second DEV post. And honestly? I thought it was just another buzzword people were using to feel ahead of the curve.&lt;/p&gt;

&lt;p&gt;Then I actually tried it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Wait, what even &lt;em&gt;is&lt;/em&gt; an agentic AI?
&lt;/h2&gt;

&lt;p&gt;Here's how I'd explain it to past-me.&lt;/p&gt;

&lt;p&gt;A regular AI tool (think: Copilot autocomplete) waits for you to ask something, gives you a suggestion, and stops. You're still driving.&lt;/p&gt;

&lt;p&gt;An &lt;em&gt;agent&lt;/em&gt; is different. You give it a &lt;strong&gt;goal&lt;/strong&gt; not a task, a &lt;em&gt;goal&lt;/em&gt; and it figures out the steps itself. It reads your files, runs commands, hits errors, self-corrects, and keeps going until it's done or stuck.&lt;/p&gt;

&lt;p&gt;It's less "smart autocomplete" and more "intern who actually reads the whole repo before touching anything."&lt;/p&gt;

&lt;h2&gt;
  
  
  The moment it clicked for me
&lt;/h2&gt;

&lt;p&gt;I had a broken webhook integration. Nothing catastrophic, but annoying wrong payload format, 400 errors, the usual.&lt;/p&gt;

&lt;p&gt;I would normally spend 20–30 minutes on it. Open logs, check docs, trace the request, patch, test, repeat.&lt;/p&gt;

&lt;p&gt;I pointed an agent at it instead.&lt;/p&gt;

&lt;p&gt;It read the stack trace. Checked the last three commits. Found the exact line where the payload structure changed. Wrote a fix. Ran the test. Done.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Under four minutes.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I just sat there staring at my screen. Not excited &lt;em&gt;unsettled&lt;/em&gt;. Like watching someone else parallel park your car perfectly on the first try.&lt;/p&gt;

&lt;h2&gt;
  
  
  But here's where I need to be honest with you
&lt;/h2&gt;

&lt;p&gt;Agents are not magic. They're more like a very confident junior dev who sometimes has no idea what they don't know.&lt;/p&gt;

&lt;p&gt;I've also watched an agent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Loop on the same broken fix &lt;em&gt;seven times&lt;/em&gt; without realizing it was wrong&lt;/li&gt;
&lt;li&gt;"Solve" a bug by deleting the test that was catching it&lt;/li&gt;
&lt;li&gt;Make an architectural decision in 3 seconds that I'd have thought about for 3 days and get it completely wrong
The demos you see online are cherry-picked. Production reality is messier.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Right now, according to Anthropic's own data, developers can fully hand off only &lt;strong&gt;0–20% of tasks&lt;/strong&gt; to agents without supervision. The rest still needs a human in the loop.&lt;/p&gt;

&lt;p&gt;So no, agents aren't replacing you. But they &lt;em&gt;are&lt;/em&gt; changing what "your job" actually means.&lt;/p&gt;

&lt;h2&gt;
  
  
  The real question I keep asking myself
&lt;/h2&gt;

&lt;p&gt;If an agent can handle the &lt;em&gt;doing&lt;/em&gt; part of coding the mechanical execution what exactly is the skill that matters now?&lt;/p&gt;

&lt;p&gt;I think it's &lt;strong&gt;judgment&lt;/strong&gt;. Knowing what to build. Knowing when the agent is confidently wrong. Knowing which 20% of decisions actually matter and can't be delegated.&lt;/p&gt;

&lt;p&gt;That's not a junior skill. That's not even a mid-level skill. That's the stuff that takes years to develop.&lt;/p&gt;

&lt;p&gt;Which makes me wonder are we about to see a massive gap open up between developers who can &lt;em&gt;think clearly about problems&lt;/em&gt; and developers who are just really fast at typing code?&lt;/p&gt;

&lt;h2&gt;
  
  
  I genuinely want to hear from you
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Have you used an AI agent in your actual workflow yet or just played with it?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;And if you have what's the task it handled best? What's the most embarrassing thing it got wrong?&lt;/p&gt;

&lt;p&gt;Drop it in the comments. Genuinely curious whether my experience is typical or if I just got unlucky with my first few tries.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Follow me if you want more unfiltered takes on building with AI no hype, no doom, just what's actually happening day-to-day.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>discuss</category>
      <category>productivity</category>
    </item>
  </channel>
</rss>
