<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: surajdev9</title>
    <description>The latest articles on DEV Community by surajdev9 (@surajdev9).</description>
    <link>https://dev.to/surajdev9</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1105683%2Fc8fc4c58-44d1-40ec-800a-50a3178d8037.png</url>
      <title>DEV Community: surajdev9</title>
      <link>https://dev.to/surajdev9</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/surajdev9"/>
    <language>en</language>
    <item>
      <title>Dawa Saathi: I finally finished the part of my medicine bot that actually mattered</title>
      <dc:creator>surajdev9</dc:creator>
      <pubDate>Fri, 05 Jun 2026 12:47:51 +0000</pubDate>
      <link>https://dev.to/surajdev9/dawa-saathi-i-finally-finished-the-part-of-my-medicine-bot-that-actually-mattered-l99</link>
      <guid>https://dev.to/surajdev9/dawa-saathi-i-finally-finished-the-part-of-my-medicine-bot-that-actually-mattered-l99</guid>
      <description>&lt;p&gt;AI medicine-awareness bot worked — but only in English, the one language most of the people I built it for can't read. Here's the problem, the one-day sprint to fix it, and how GitHub Copilot got me through it.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;GitHub Finish-Up-A-Thon submission.&lt;/strong&gt; I shipped this project months ago, but it was never truly &lt;em&gt;finished&lt;/em&gt; — it was missing the one feature that decided whether a real person in my own community could use it. This challenge was the push that made me sit down and close that gap in a single night.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  The problem we face
&lt;/h2&gt;

&lt;p&gt;In India, you can walk into most pharmacies and buy prescription drugs without a prescription. We've quietly normalized something genuinely dangerous, and the research is blunt about it.&lt;/p&gt;

&lt;p&gt;A simulated-patient study in Bengaluru sent two researchers into &lt;strong&gt;261 pharmacies&lt;/strong&gt; with fake symptoms. Antibiotics were handed over &lt;strong&gt;without any prescription at roughly two-thirds (66.7%) of them.&lt;/strong&gt; Not a single pharmacy warned about side effects. Only about one in five even mentioned that a doctor's prescription was needed. The "guidance" most people walked out with was &lt;em&gt;"take it twice a day"&lt;/em&gt; — and nothing else. (&lt;a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676185/" rel="noopener noreferrer"&gt;study&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;Here is the part that kept me up at night: it's not because people are careless. The shopkeeper isn't a doctor. The label is printed in tiny English. A proper consultation costs time and money many families simply don't have. So people take what they're handed — and hope it's fine.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why it's important
&lt;/h2&gt;

&lt;p&gt;This isn't a small inconvenience. Taking the wrong medicine, or the right medicine the wrong way, is one of the biggest drivers of &lt;strong&gt;antibiotic resistance&lt;/strong&gt; — the slow disaster where medicines stop working.&lt;/p&gt;

&lt;p&gt;The landmark 2019 Lancet GRAM study estimated &lt;strong&gt;1.27 million deaths worldwide were directly attributable&lt;/strong&gt; to drug-resistant infections in a single year. In India alone, an estimated &lt;strong&gt;297,000 deaths were directly attributable to AMR, and about 1.04 million were associated with it.&lt;/strong&gt; (&lt;a href="https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(21)02724-0/fulltext" rel="noopener noreferrer"&gt;Lancet GRAM 2019&lt;/a&gt;, &lt;a href="https://journals.lww.com/pmrr/fulltext/2024/01040/antimicrobial_resistance__an_emerging_global.1.aspx" rel="noopener noreferrer"&gt;India figures&lt;/a&gt;)&lt;/p&gt;

&lt;p&gt;I'm not a doctor and I can't change how medicines are sold. But I'm a developer, and I kept circling one small thing I might actually be able to build:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;What if a person could at least understand what they're holding — before they take it?&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;So I built &lt;strong&gt;Dawa Saathi&lt;/strong&gt;. You send a photo of a medicine label, and it reads it with AI and explains, in plain language: what the medicine is commonly used for, its common and serious side effects, and the key warnings (pregnancy, liver, kidney, alcohol, children, the elderly). It does &lt;strong&gt;not&lt;/strong&gt; diagnose. It does &lt;strong&gt;not&lt;/strong&gt; give a dose. It never says a medicine is "safe to take." Because with medicine, a confident wrong answer is more dangerous than no answer at all.&lt;/p&gt;

&lt;h2&gt;
  
  
  The gap I shipped with (the "before")
&lt;/h2&gt;

&lt;p&gt;Here's the uncomfortable truth about the first version: &lt;strong&gt;it only spoke English.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I built Dawa Saathi for people in rural and small-town India — the elderly, people with little formal schooling, people who grew up reading Kannada, Hindi, or Marathi, not English. I come from that world myself; I went to a Marathi-medium government school in a village before I ever wrote a line of code. And yet the tool I built &lt;em&gt;for&lt;/em&gt; those people answered every single question in fluent English.&lt;/p&gt;

&lt;p&gt;It &lt;em&gt;worked&lt;/em&gt;. The AI read labels accurately, the safety guardrails held, caching and rate limits were solid. But it was "finished" the way a bridge is finished when it stops halfway across the river. The one feature that decided whether a real person in my community could actually use it — language — was the thing I'd left undone.&lt;/p&gt;

&lt;p&gt;The Finish-Up-A-Thon prompt ("revive a project you started but never completed") landed on exactly that nerve.&lt;/p&gt;

&lt;h2&gt;
  
  
  One day, under pressure
&lt;/h2&gt;

&lt;p&gt;I gave myself the length of the challenge to fix it, and it came down to a single overnight session — about &lt;strong&gt;six hours straight&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;That kind of pressure cuts both ways. On one hand, a deadline is the best forcing function I know; the project had sat "good enough" for months precisely because nothing forced me to finish it. On the other hand, language support is not a cosmetic change — it touches the welcome flow, every button, every error message, the medicine response itself, and the storage layer. At 2 a.m., with a medical app where a sloppy mistake isn't just a bug, that's a lot of surface area to get right while tired.&lt;/p&gt;

&lt;p&gt;There were stretches where I went down the wrong path — I first started building a translation dictionary, realized it would silently drop medical words, and had to back it out and rethink the whole approach. Hours like that are where solo projects usually die.&lt;/p&gt;

&lt;h2&gt;
  
  
  How GitHub Copilot helped me through it
&lt;/h2&gt;

&lt;p&gt;I enabled GitHub Copilot specifically for this sprint, and I'll be honest about exactly what it did — good and bad — because that's the point of this challenge.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Where it genuinely sped me up:&lt;/strong&gt; the repetitive scaffolding. The new i18n layer has dozens of string entries across two languages; the Telegram handler needed every reply rewired to read from it; the callback router needed a whole new branch for language selection. From a comment and the surrounding code, Copilot autocompleted big chunks of that pattern-heavy work. Tasks that would have been an hour of careful typing became a few minutes of accepting and fixing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Where it helped me think under fatigue:&lt;/strong&gt; when the bot stubbornly kept replying in English even after I'd changed the code, Copilot Chat helped me reason through the deploy pipeline and spot that I was running an old compiled build (&lt;code&gt;npm start&lt;/code&gt; runs &lt;code&gt;dist/&lt;/code&gt;, not &lt;code&gt;src/&lt;/code&gt; — I'd forgotten to rebuild). At 3 a.m., that saved me from chasing a ghost for an hour. It also caught a sloppy callback branch where my "language" and "purpose" buttons could collide.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Where I had to stay in charge:&lt;/strong&gt; Copilot is confidently wrong sometimes. It suggested code that &lt;em&gt;looked&lt;/em&gt; right but quietly dropped a safety guardrail, and once invented an API shape that didn't exist. In a normal app that's an annoyance; in a &lt;strong&gt;medical&lt;/strong&gt; app, an unreviewed AI suggestion is exactly the "confident wrong answer" my whole project exists to prevent. So I read every line it gave me. The speed was real — but the judgment stayed mine.&lt;/p&gt;

&lt;p&gt;Honestly, Copilot didn't replace that six-hour night. It made the difference between finishing it and giving up around hour three.&lt;/p&gt;

&lt;h2&gt;
  
  
  The major update: what actually changed
&lt;/h2&gt;

&lt;p&gt;The first thing the bot now does on &lt;code&gt;/start&lt;/code&gt; is ask which language you're comfortable in:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;🌐 Please choose your language
ದಯವಿಟ್ಟು ನಿಮ್ಮ ಭಾಷೆಯನ್ನು ಆಯ್ಕೆಮಾಡಿ
[ English ]   [ ಕನ್ನಡ ]
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Tap a button, and &lt;strong&gt;everything afterward&lt;/strong&gt; — welcome, buttons, questions, and the full medicine explanation — comes back in that language. The choice is remembered, and you can change it anytime with &lt;code&gt;/language&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;Under the hood, I split the text into two kinds and treated them very differently:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Fixed UI text&lt;/strong&gt; (buttons, labels, errors, and the all-important medical disclaimer) lives in a small &lt;strong&gt;i18n table&lt;/strong&gt;. It's instant, costs zero tokens, and — critical for health content — the disclaimer in each language can be reviewed word-for-word instead of machine-guessed.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The medicine explanation is generated by the LLM directly in the user's language.&lt;/strong&gt; I deliberately did &lt;em&gt;not&lt;/em&gt; translate the English afterward. The awareness model writes its answer in Kannada from the start, while keeping JSON keys in English (so the parser never changes) and drug names in Latin script (so "Tranexamic Acid 500mg" is never mangled). Every safety rule — no dosage, no diagnosis, no "safe to take," always "consult a doctor" — is repeated for the target language.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;That choice means &lt;strong&gt;one model call instead of two&lt;/strong&gt;, no brittle dictionary dropping medical words, and a reply that reads like it was written &lt;em&gt;for&lt;/em&gt; a Kannada speaker rather than awkwardly converted into Kannada. Each language is cached separately, so the next person who scans the same strip gets an instant answer.&lt;/p&gt;

&lt;h2&gt;
  
  
  What this means for people in Karnataka
&lt;/h2&gt;

&lt;p&gt;This is the part I actually care about.&lt;/p&gt;

&lt;p&gt;Kannada is the first language of around &lt;strong&gt;44 million people&lt;/strong&gt;, and roughly &lt;strong&gt;two-thirds of Karnataka's ~68 million residents&lt;/strong&gt; speak it natively (&lt;a href="https://en.wikipedia.org/wiki/Kannada" rel="noopener noreferrer"&gt;2011 census / overview&lt;/a&gt;). Meanwhile, by census-based estimates only about &lt;strong&gt;one in ten Indians&lt;/strong&gt; use English at all — and a far smaller share read it comfortably enough to parse a medicine label or a list of side effects.&lt;/p&gt;

&lt;p&gt;Put those two facts together and the old version's flaw is stark: an English-only health tool quietly excluded the majority of the exact people it was meant to help. The grandmother in a Belagavi village holding a strip of tablets, the daily-wage worker in Kalaburagi who left school early — they could open the bot, but they couldn't &lt;em&gt;read&lt;/em&gt; it.&lt;/p&gt;

&lt;p&gt;With this update, that same grandmother can now ask in Kannada and get an answer she can actually understand — what the medicine is for, what to watch out for, and a clear, repeated nudge to see a doctor. It doesn't make the bot a doctor. It makes the bot &lt;em&gt;legible&lt;/em&gt; to the people who need it most. For a medicine-awareness tool, legibility in the user's own language isn't a feature — it's the whole job.&lt;/p&gt;

&lt;h2&gt;
  
  
  The before → after, at a glance
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;/th&gt;
&lt;th&gt;Before&lt;/th&gt;
&lt;th&gt;After&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Languages&lt;/td&gt;
&lt;td&gt;English only&lt;/td&gt;
&lt;td&gt;English + ಕನ್ನಡ (more in ~2 lines)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;First interaction&lt;/td&gt;
&lt;td&gt;Dumped an English welcome&lt;/td&gt;
&lt;td&gt;Asks your language, then speaks it&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Who could actually use it&lt;/td&gt;
&lt;td&gt;English readers (~1 in 10)&lt;/td&gt;
&lt;td&gt;Tens of millions of Kannada speakers&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Translation approach&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;LLM generates in-language; i18n for fixed UI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Reachable by my own family&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Future plan
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Hindi and Marathi next&lt;/strong&gt; — the architecture already supports them; each new language is roughly a two-line change plus a reviewed disclaimer.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Voice output&lt;/strong&gt;, for users who can't read at all — arguably the people who need this most.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Reviewed translations&lt;/strong&gt; of the fixed disclaimer with help from native-speaker volunteers, so the safety text is never just my best guess.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pharmacy and NGO partnerships&lt;/strong&gt; — the long-term home for something like this is alongside health workers and CSR programs, not as a lone bot.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The honest impact
&lt;/h2&gt;

&lt;p&gt;I want to be careful not to overclaim. This won't fix the bigger problem. That needs doctors, pharmacists, and real regulation. One Telegram bot does not move a national health statistic.&lt;/p&gt;

&lt;p&gt;But the gap it closes is real and specific: a person can now understand, in their own language, what they're about to put in their body — before they take it. If that makes even a few people pause and think, or take one question to a doctor they'd otherwise have skipped, it was worth the night.&lt;/p&gt;

&lt;p&gt;If you work in public health or medicine, I'd genuinely value your guidance on doing this responsibly.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try it / code
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Bot:&lt;/strong&gt; TELEGRAM_BOT_LINK — &lt;a href="https://t.me/dawasaathi_bot" rel="noopener noreferrer"&gt;https://t.me/dawasaathi_bot&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://github.com/SurajKhonde/Dawa_saathi_mediAssist" rel="noopener noreferrer"&gt;https://github.com/SurajKhonde/Dawa_saathi_mediAssist&lt;/a&gt; (MIT licensed)&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Demo:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Before_challenge&lt;/strong&gt;:-&lt;a href="https://drive.google.com/file/d/1UW5HsSfBZKWx93P-flJ9sUKVrg03JflS/view?usp=sharing" rel="noopener noreferrer"&gt;https://drive.google.com/file/d/1UW5HsSfBZKWx93P-flJ9sUKVrg03JflS/view?usp=sharing&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;After_challenge&lt;/strong&gt; :- &lt;a href="https://drive.google.com/file/d/1pDaeMIelnb9ymHF9UygBmR4DY6lT7Vh1/view?usp=sharing" rel="noopener noreferrer"&gt;https://drive.google.com/file/d/1pDaeMIelnb9ymHF9UygBmR4DY6lT7Vh1/view?usp=sharing&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;




&lt;p&gt;&lt;em&gt;Built solo. Not a substitute for a doctor — by design.&lt;/em&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Sources
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Non-prescription antimicrobial dispensing in Bangalore pharmacies (261-pharmacy simulated-client study): &lt;a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676185/" rel="noopener noreferrer"&gt;https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676185/&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Global burden of bacterial AMR, 2019 (Lancet, GRAM): &lt;a href="https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(21)02724-0/fulltext" rel="noopener noreferrer"&gt;https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(21)02724-0/fulltext&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;India-specific AMR mortality (297,000 attributable / 1,042,500 associated, 2019): &lt;a href="https://journals.lww.com/pmrr/fulltext/2024/01040/antimicrobial_resistance__an_emerging_global.1.aspx" rel="noopener noreferrer"&gt;https://journals.lww.com/pmrr/fulltext/2024/01040/antimicrobial_resistance__an_emerging_global.1.aspx&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Kannada speakers and Karnataka share (2011 census overview): &lt;a href="https://en.wikipedia.org/wiki/Kannada" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/Kannada&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>devchallenge</category>
      <category>githubchallenge</category>
    </item>
    <item>
      <title>Uncle, I burned ₹1000 in 4 runs — what did I do wrong?</title>
      <dc:creator>surajdev9</dc:creator>
      <pubDate>Fri, 05 Jun 2026 07:17:23 +0000</pubDate>
      <link>https://dev.to/surajdev9/uncle-i-burned-1000-in-4-runs-what-did-i-do-wrong-59il</link>
      <guid>https://dev.to/surajdev9/uncle-i-burned-1000-in-4-runs-what-did-i-do-wrong-59il</guid>
      <description>&lt;p&gt;"An uncle-nephew conversation about the new line item on every developer's bill: LLM tokens."&lt;/p&gt;




&lt;h1&gt;
  
  
  Uncle, I burned ₹1000 in 4 runs
&lt;/h1&gt;

&lt;p&gt;&lt;em&gt;An uncle-nephew conversation on why some AI calls cost paise, and others cost thousands.&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Uncle, I have a problem. I put ₹1000 of my own money into the AI API this month and it finished in &lt;em&gt;four runs&lt;/em&gt;. I don't even know what happened.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Sit down, beta. Drink chai first. Then tell me everything you built this month, from the beginning.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Okay. I built four things. Three of them were fine. One of them ate the whole month.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Good. We'll go one by one. Start with the easy one.&lt;/p&gt;




&lt;h2&gt;
  
  
  Project one: image text extraction
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; First one was simple. User uploads a photo — invoice, prescription, screenshot, whatever. I send it to the vision model. It returns the text. Done.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; How many calls per user?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; One. Maybe two if the image is bad and they retry.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; And the cost?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Honestly, kaka, almost nothing. A few paise per image. I scanned around 200 images during testing and barely touched ₹50.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Now stop and think. Why was it so cheap?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Aha… because it's just one image in, some text out. One round trip.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Exactly. &lt;strong&gt;One input, one output, one transaction.&lt;/strong&gt; Like sending one SMS. You know what you're paying before you press send. Keep this model in your head. It's the cheapest pattern that exists.&lt;/p&gt;




&lt;h2&gt;
  
  
  Project two: a customer support bot
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Second one was a support bot for a client. User types a question. Bot replies.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Where does the bot get its answers from?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I wrote a system prompt — around 800 words — explaining the product, common questions, tone of voice. Every user message goes in with that prompt. Bot replies.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; And the cost?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; A bit higher than OCR. Each conversation was maybe ₹2 to ₹3. Total for the month, around ₹200 across hundreds of conversations.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Tell me — why is this one costlier than the image project?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; (thinking) …because the system prompt is going every time?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Yes. &lt;strong&gt;You're paying for the same 800 words of instructions on every single message.&lt;/strong&gt; The user pays for one message, you pay for 800 + their message. But conversations are short, so it doesn't get out of hand.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; And the bot replies are also small — 100, 150 words.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Small input, small output, slightly bigger envelope than OCR. Still safe. Now the next project.&lt;/p&gt;




&lt;h2&gt;
  
  
  Project three: document Q&amp;amp;A with RAG
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Third one was a document Q&amp;amp;A. User uploads a 50-page PDF, asks questions, gets answers from inside the document.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Smart. How did you handle the document?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I chunked the PDF, embedded each chunk, stored them in a vector database. When the user asks, I pull the top 5 relevant chunks and send them with the question to the LLM.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Beta, that's actually a sensible design. Cost?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; ₹3 to ₹4 per question. The project as a whole cost me around ₹300 across some testing and a few real users.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Why higher than the support bot?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Because each question carries 5 chunks of document context. Maybe 3000 words of input on every call.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Correct. &lt;strong&gt;Bigger input on every call.&lt;/strong&gt; But notice what you didn't do — you didn't send the entire 50-page PDF every time. You sent only the 5 most relevant pieces. That's a smart RAG.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; What's a dumb one?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; A dumb RAG sends the whole 50-page document every time the user asks anything. Some people actually build this. They pay for 50 pages of input when they could have paid for 5. Don't be that person.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Noted.&lt;/p&gt;




&lt;h2&gt;
  
  
  Project four: the disaster
&lt;/h2&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Now tell me about the project that ate the rest of your money.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; (sighs) Job search agent. Idea was — user uploads their resume, my agent searches the web for matching jobs, returns the top ones with reasoning. Like a personal recruiter.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; How does the agent search?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; It has a web search tool. Searches job boards, company career pages — whatever comes up. Clicks into postings, reads them, decides if they match the resume, then either suggests them or searches again with refined keywords.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; How many of these search-and-read cycles per user?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I didn't think about it carefully. The agent does around 5 searches. Each search returns 10 results. Then it fetches the most promising 3 or 4 pages in full. Reasons over them. Maybe searches again.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Cost per user?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; …₹200 to ₹250 per user, kaka.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; (silence)&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I tested it four times to debug. ₹1000 gone. And it didn't even get anyone a job. It just &lt;em&gt;suggested&lt;/em&gt; some.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where the money actually went
&lt;/h2&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Beta, listen carefully now. I want you to understand what really happened, because it will change how you think about every AI project from now on.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Tell me.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Look at the four projects we discussed.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;OCR:                [image] → [text]                          one call
Support bot:        [prompt + query] → [answer]               one call
RAG:                [prompt + chunks + query] → [answer]      one call
Web search agent:   [prompt + query] → [search results]
                  → [prompt + results] → [decide to fetch]
                  → [prompt + results + FULL PAGE] → [decide next]
                  → [prompt + results + page + ANOTHER PAGE] → ...
                  → and on, and on
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Oh… the last one is a loop.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Yes. And here is the part that breaks beginners. &lt;strong&gt;Every call inside that loop includes everything that came before it.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Wait, what?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; When the agent makes its second decision, it must see what the first search returned. So the second call carries the first search's results. When it makes the third decision, it must see the first two searches plus whatever pages were fetched in between. By the fifth step, the input on each call has grown to thousands and thousands of words.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; So the context keeps stacking.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; And web pages are &lt;em&gt;huge&lt;/em&gt;. One stripped-down job posting page is easily 2000 to 5000 words. Multiply by 3 or 4 pages, your agent is reasoning over 15,000 to 20,000 words on every step. Five steps of looping — you're at 80,000+ words of input billed for &lt;strong&gt;one user session.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Compared to OCR…&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Compared to OCR, your web search agent cost you roughly &lt;strong&gt;400 times more per user&lt;/strong&gt;. For the same kind of API.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; (quietly) I see.&lt;/p&gt;




&lt;h2&gt;
  
  
  The one rule to remember
&lt;/h2&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Pick up your pen. Write this down. Stick it above your laptop.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;The cost of an LLM application is not the cost of one call.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;It's the cost of one call × how many times you call × how big the input has grown by then.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Three things multiply together. Most beginner cost disasters come from forgetting one of them.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Which one bit me?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; All three at once. &lt;strong&gt;Many calls&lt;/strong&gt; because of the agent loop. &lt;strong&gt;Growing input&lt;/strong&gt; because of accumulated page content. And &lt;strong&gt;each call seemed cheap when you tested in isolation&lt;/strong&gt;, so you never saw the multiplication coming.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Kaka, this is like electricity. Each appliance seems small. The bill at the end of the month tells the truth.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Yes! Exactly that. AI tokens are electricity, not water connection. Variable, not fixed. Some operations are fans. Some are air conditioners running all day.&lt;/p&gt;




&lt;h2&gt;
  
  
  How to build the same agent without going broke
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; So how do I build a job search agent that doesn't bankrupt me?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; I'll give you five rules. Learn them properly.&lt;/p&gt;

&lt;h3&gt;
  
  
  Rule 1: Match the model to the job
&lt;/h3&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Not every step needs your most expensive model. There's a hierarchy.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Like what?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Extracting text from an image? Use a smaller, cheaper vision model. The task is mechanical. Classifying whether a job posting matches a resume? A small model is more than enough. Writing the final personalized recommendation letter to the user? &lt;em&gt;Now&lt;/em&gt; use your best model. That's the part where quality is visible.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I was using the smartest, most expensive model for &lt;em&gt;every&lt;/em&gt; step.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; I know. Most people do. It's like using a JCB to plant a tulsi plant.&lt;/p&gt;

&lt;h3&gt;
  
  
  Rule 2: Don't pass full pages — pass extracts
&lt;/h3&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; If your agent fetches a 5000-word page, do not shove all 5000 words into the next reasoning step.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Then what?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Run a &lt;em&gt;cheap&lt;/em&gt; model first to pull only the relevant 200 words. Pass those 200 words to your expensive reasoning model. Two-stage processing. You can cut 80–90% of cost this way.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Bad:    [agent — best model] reads 5000-word page → decides
Good:   [extractor — cheap model] pulls 200 relevant words
     → [agent — best model] reads 200 words → decides
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Rule 3: Cap the loop
&lt;/h3&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Set a maximum number of iterations on your agent. Three searches. Four. Not "keep going until satisfied."&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Why?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Because an agent with no cap will happily run forever if you let it. Polite, hard-working, expensive servants need supervision.&lt;/p&gt;

&lt;h3&gt;
  
  
  Rule 4: Cache like your wallet depends on it
&lt;/h3&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; If two users have similar resumes, you're searching for the same jobs and reading the same pages. Cache the search results. Cache the extracted summaries. Cache the model's interpretations.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Disk is cheaper than tokens?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Disk is cheaper than tokens by a factor of a thousand. Always.&lt;/p&gt;

&lt;h3&gt;
  
  
  Rule 5: Use specialized APIs, not generic crawling
&lt;/h3&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; This was the biggest mistake in your job project.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Tell me.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; You were doing general web search and crawling job board HTML. That is the most expensive path that exists. Job boards have APIs. LinkedIn, Naukri, Indeed, Glassdoor — many of them have official APIs or structured data feeds.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Even paid?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Even paid. Pay ₹1000 once for an API subscription instead of ₹250 per user for crawling. The math turns positive at the fourth user.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; I thought building my own crawler was the clever, free way.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; It was the &lt;em&gt;expensive&lt;/em&gt; way wearing a free costume.&lt;/p&gt;




&lt;h2&gt;
  
  
  The bigger lesson
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Kaka, I think I've been treating the AI API like a database. Like a fixed cost.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; And what is it actually?&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; More like electricity. The more you do, the more it draws. Some operations draw a lot more than others.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Now you're thinking like an engineer, not just a developer.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; What's the difference?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; A developer asks, "does this work?" An engineer asks, "does this work, and what does it cost to run for ten thousand users?" Both questions matter. But only one of them will tell you whether your product survives past the third month.&lt;/p&gt;




&lt;h2&gt;
  
  
  The math before you build
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; So before I build my next AI project, what do I do?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Estimate four numbers for your worst-case user.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Tell me.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;ol&gt;
&lt;li&gt;How many model calls per session.&lt;/li&gt;
&lt;li&gt;How big the input is on each call (in words — roughly 1 word ≈ 1.3 tokens).&lt;/li&gt;
&lt;li&gt;How big the output is on each call.&lt;/li&gt;
&lt;li&gt;What you're paying per million input and per million output tokens for your chosen model.&lt;/li&gt;
&lt;/ol&gt;
&lt;/blockquote&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Multiply them out. If the per-user cost is more than what the user pays you — or more than you can absorb as a free tier — you don't have a product. You have a charity that will close in two months.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Or worse, a bill that goes to my father.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; (laughs) Don't make me explain this conversation to him.&lt;/p&gt;




&lt;h2&gt;
  
  
  A short summary for the next nephew
&lt;/h2&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Kaka, can you give me a small table I can keep in my notes?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Take this.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Project type&lt;/th&gt;
&lt;th&gt;Pattern&lt;/th&gt;
&lt;th&gt;Cost behaviour&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Image OCR&lt;/td&gt;
&lt;td&gt;One image → one output&lt;/td&gt;
&lt;td&gt;Cheap, predictable&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Support bot&lt;/td&gt;
&lt;td&gt;Prompt + question → answer&lt;/td&gt;
&lt;td&gt;Cheap, mildly accumulating&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Document RAG&lt;/td&gt;
&lt;td&gt;Retrieved chunks + question&lt;/td&gt;
&lt;td&gt;Moderate, scales with chunk size&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Agent + web search&lt;/td&gt;
&lt;td&gt;Many loops with growing context&lt;/td&gt;
&lt;td&gt;Expensive, scales explosively&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; The hierarchy is not about technology being "advanced" or "basic." It is about &lt;strong&gt;how many tokens flow through the model per user, per session.&lt;/strong&gt; Some patterns naturally produce a small flow. Others produce a flood.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; Know which one I'm building. Do the math first. Pick the right model for each step. Cache aggressively.&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; And one last thing.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; What?&lt;/p&gt;

&lt;p&gt;👨‍🦳 &lt;strong&gt;Uncle:&lt;/strong&gt; Next time, build a small calculator before you build the agent. One spreadsheet cell would have saved you a thousand rupees.&lt;/p&gt;

&lt;p&gt;👦 &lt;strong&gt;Nephew:&lt;/strong&gt; (laughs) Noted, kaka.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;If you found this useful, I write about backend engineering, system design, and these kinds of practical lessons from real projects. Less noise, more action.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;— until the next nephew project goes sideways.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>llm</category>
      <category>softwareengineering</category>
      <category>systemdesign</category>
    </item>
    <item>
      <title>Long ChatGPT chats kept breaking my copy-paste. So I built an exporter.</title>
      <dc:creator>surajdev9</dc:creator>
      <pubDate>Fri, 05 Jun 2026 06:01:08 +0000</pubDate>
      <link>https://dev.to/surajdev9/long-chatgpt-chats-kept-breaking-my-copy-paste-so-i-built-an-exporter-3bej</link>
      <guid>https://dev.to/surajdev9/long-chatgpt-chats-kept-breaking-my-copy-paste-so-i-built-an-exporter-3bej</guid>
      <description>&lt;p&gt;A free Chrome extension I built to save full ChatGPT conversations as Markdown in one click — because the existing ways were genuinely painful.&lt;/p&gt;

&lt;h2&gt;
  
  
  tags: chrome, productivity, opensource, chatgpt
&lt;/h2&gt;

&lt;p&gt;If you've ever had one of those long ChatGPT conversations — the kind where you spent half a day going deep into system design, or debugging something tricky, or learning a new concept end to end — you've probably run into this moment:&lt;/p&gt;

&lt;p&gt;You finish. You realize the chat is gold. You want to save it.&lt;/p&gt;

&lt;p&gt;You hit &lt;code&gt;Ctrl + A&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;Your browser freezes for ten seconds.&lt;/p&gt;

&lt;p&gt;You finally manage to copy. You paste it into a doc. Half the formatting is gone. Code blocks are mangled. The structure that made the conversation useful is just… not there anymore.&lt;/p&gt;

&lt;p&gt;That's where this story starts.&lt;/p&gt;




&lt;h2&gt;
  
  
  The problem (and why it's worse than it sounds)
&lt;/h2&gt;

&lt;p&gt;I learn a lot from ChatGPT. Not in a passive "ask one question, move on" way — I have &lt;strong&gt;multi-day conversations&lt;/strong&gt; going on system design, RabbitMQ internals, architecture decisions, debugging sessions. Some of my chats have &lt;strong&gt;800+ messages&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;These conversations are genuinely valuable. They're notes I keep coming back to. They're the work.&lt;/p&gt;

&lt;p&gt;But getting them &lt;em&gt;out&lt;/em&gt; of ChatGPT is shockingly bad:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Copy-paste&lt;/strong&gt;: freezes the browser on long chats, mangles formatting&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Screenshot&lt;/strong&gt;: useless if you ever need to search or re-read it&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;ChatGPT's own export feature&lt;/strong&gt;: emails you a giant ZIP… sometimes days later… of &lt;em&gt;everything you've ever written&lt;/em&gt;. No per-chat option.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Manual cleanup&lt;/strong&gt;: spending an hour formatting one conversation into Markdown is not what I want to be doing at 11 PM&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I checked if a tool already existed. Some did — most were either broken on long chats, badly designed, full of trackers, or only saved the messages currently visible on screen (which is a &lt;em&gt;fraction&lt;/em&gt; of a long conversation, because ChatGPT only keeps the visible messages loaded in memory).&lt;/p&gt;

&lt;p&gt;So I built my own.&lt;/p&gt;




&lt;h2&gt;
  
  
  What it actually does
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;ChatGPT Exporter&lt;/strong&gt; is a Chrome extension that does one thing:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Save your entire ChatGPT conversation as a clean Markdown file in one click.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That's it. No account, no signup, no upload, no cloud. You click a button, your file downloads.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkoofodr882o9be4mtkfw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkoofodr882o9be4mtkfw.png" alt=" " width="800" height="480"&gt;&lt;/a&gt;&lt;br&gt;
It has three modes depending on the situation:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Export Full Chat&lt;/strong&gt; — the main one. Works on chats of any length, including ones with thousands of messages. One click, full conversation, done.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Export Visible&lt;/strong&gt; — when you only want to save the part you're looking at right now.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Scroll Capture&lt;/strong&gt; — a fallback that captures messages as you scroll through the chat. Useful if the first mode ever hiccups.&lt;/p&gt;

&lt;p&gt;The output is clean Markdown with proper headings, code blocks (with the right language), bold, italics, links, lists — everything that made the conversation structured stays structured.&lt;/p&gt;




&lt;h2&gt;
  
  
  Who it's for
&lt;/h2&gt;

&lt;p&gt;It started as a tool for me, but it turns out a lot of people have this problem:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Code learners&lt;/strong&gt; taking notes from long debugging or learning sessions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Writers and researchers&lt;/strong&gt; preserving drafts and idea threads&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Developers&lt;/strong&gt; archiving architectural discussions they want to reference later&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Anyone&lt;/strong&gt; who realized too late that an important conversation got buried in their chat history&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you've ever thought "I should save that conversation somewhere I can actually find it" and then didn't, because the friction was too high — this is for you.&lt;/p&gt;




&lt;h2&gt;
  
  
  Try it
&lt;/h2&gt;

&lt;p&gt;🔗 &lt;strong&gt;Chrome Web Store&lt;/strong&gt;:&lt;br&gt;
&lt;a href="https://chromewebstore.google.com/detail/pmkhjmgnkohjdgninjfkdcegfkmnkfca" rel="noopener noreferrer"&gt;https://chromewebstore.google.com/detail/pmkhjmgnkohjdgninjfkdcegfkmnkfca&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;📦 &lt;strong&gt;Source code&lt;/strong&gt;: &lt;a href="https://github.com/SurajKhonde/chatgpt-exporter" rel="noopener noreferrer"&gt;https://github.com/SurajKhonde/chatgpt-exporter&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It's open source under MIT. Free forever. No ads, no tracking, no account.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I learned building it
&lt;/h2&gt;

&lt;p&gt;A few things I picked up that might be useful to anyone building browser extensions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Manifest V3 is much stricter than V2.&lt;/strong&gt; Permissions get audited automatically — request only what you actually use, or your submission gets rejected. (Ask me how I know.)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;DOM virtualization is a real problem.&lt;/strong&gt; ChatGPT only mounts the messages currently near the viewport, so naively scraping the page misses 90% of a long conversation. Solving this properly took more iteration than I expected.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Stable IDs matter.&lt;/strong&gt; Every message in ChatGPT has a unique ID — using that as the dedup key (instead of position) is the difference between a working exporter and a broken one.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Shipping a Chrome extension is a process.&lt;/strong&gt; Icons, screenshots, promo tiles, privacy practices, justifications for every permission… expect to spend almost as much time on the store listing as the code.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What's next
&lt;/h2&gt;

&lt;p&gt;A few things I'd like to add:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;PDF export (Markdown → PDF in one click)&lt;/li&gt;
&lt;li&gt;JSON export (raw conversation tree, for people who want to do their own processing)&lt;/li&gt;
&lt;li&gt;Batch export — saving multiple conversations at once&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you have ideas, PRs welcome on GitHub.&lt;/p&gt;




&lt;h2&gt;
  
  
  One last thing
&lt;/h2&gt;

&lt;p&gt;If you've also been losing important conversations in your ChatGPT history, give this a try. And if it saves you time, drop a ⭐ on the repo or share this with someone who needs it.&lt;/p&gt;

&lt;p&gt;Built out of a real problem I face daily. Hope it helps you too.&lt;/p&gt;

&lt;p&gt;— Suraj&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>programming</category>
      <category>productivity</category>
    </item>
  </channel>
</rss>
