<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Ismail Hossen</title>
    <description>The latest articles on DEV Community by Ismail Hossen (@ihossen016).</description>
    <link>https://dev.to/ihossen016</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3457126%2Fcd271d20-b1e2-46cc-a5a4-f11006082c32.jpeg</url>
      <title>DEV Community: Ismail Hossen</title>
      <link>https://dev.to/ihossen016</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ihossen016"/>
    <language>en</language>
    <item>
      <title>Micro-Business Digital Assistant — Track Sales &amp; Expenses with AI</title>
      <dc:creator>Ismail Hossen</dc:creator>
      <pubDate>Thu, 11 Sep 2025 04:55:40 +0000</pubDate>
      <link>https://dev.to/ihossen016/micro-business-digital-assistant-track-sales-expenses-with-ai-86j</link>
      <guid>https://dev.to/ihossen016/micro-business-digital-assistant-track-sales-expenses-with-ai-86j</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-ai-studio-2025-09-03"&gt;Google AI Studio Multimodal Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;I built a &lt;strong&gt;Micro-Business Digital Assistant&lt;/strong&gt; that helps small business owners keep track of their daily sales and expenses with minimal effort.&lt;/p&gt;

&lt;p&gt;The app provides:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Sales &amp;amp; Expense Tracking&lt;/strong&gt; with manual entry and AI automation.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Persistent storage&lt;/strong&gt; in the browser (IndexedDB) so data never disappears.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Summaries &amp;amp; Charts&lt;/strong&gt; to visualize financial health&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Multi-language support&lt;/strong&gt; (English + Bengali) with local currency customization&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal was to create a lightweight, offline-first tool that works on any modern browser without requiring sign-ups, servers, or external databases.&lt;/p&gt;

&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;🔗 &lt;strong&gt;Live Applet on Cloud Run:&lt;/strong&gt; &lt;a href="https://micro-business-assistant-263910167686.us-west1.run.app/" rel="noopener noreferrer"&gt;https://micro-business-assistant-263910167686.us-west1.run.app/&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  🖼️ Screenshots:
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ajwwdj5sc7wbp83hjzj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ajwwdj5sc7wbp83hjzj.png" alt="Sales"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsd70cdexjyvkd4opaiie.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsd70cdexjyvkd4opaiie.png" alt="Expenses"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9d730wn54invsbrosw82.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9d730wn54invsbrosw82.png" alt="Summary"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  How I Used Google AI Studio
&lt;/h2&gt;

&lt;p&gt;I used &lt;strong&gt;Google AI Studio Build mode&lt;/strong&gt; with &lt;strong&gt;Gemini 2.5 Flash&lt;/strong&gt; to handle all multimodal inputs directly in the browser:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Image input&lt;/strong&gt;: Photos of receipts or handwritten notes are parsed into structured JSON.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Audio input&lt;/strong&gt;: Voice memos describing sales/expenses are transcribed and extracted into records.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Text summarization&lt;/strong&gt;: Daily summaries are generated in English or Bengali with totals and a short narrative.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Multimodal Features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Image → Data (OCR)&lt;/strong&gt;: Upload a receipt/invoice/handwritten note → AI extracts type, item, quantity, amount, and date.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Audio → Data (Speech-to-Text + NLP)&lt;/strong&gt;: Record a quick note like “Sold 3 coffees for 150 taka today” → AI transcribes and saves it as a sale.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Summaries → Narrative&lt;/strong&gt;: Daily totals computed locally, then AI generates a localized business summary in English or Bengali.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These multimodal features make the app accessible for business owners who may not have time to type every transaction manually. They can snap a photo or record a quick voice note.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Sales &amp;amp; Expense Tracking&lt;/strong&gt;: Log and categorize all transactions.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Persistent Storage&lt;/strong&gt;: Records saved in IndexedDB for persistence.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Manual Data Entry&lt;/strong&gt;: Add/edit records by hand.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;AI Automation&lt;/strong&gt;: OCR for receipts, speech-to-text for audio.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Interactive Dashboard&lt;/strong&gt;: Summaries with charts and filters.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Data Table&lt;/strong&gt;: Paginated, with live search.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Exports&lt;/strong&gt;: CSV and Excel reports.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Customization&lt;/strong&gt;: English/Bengali, custom currency.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Toast Notifications&lt;/strong&gt; and &lt;strong&gt;Sample Data&lt;/strong&gt; for smooth onboarding.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This project shows how &lt;strong&gt;Google AI Studio’s multimodal&lt;/strong&gt; capabilities can be used to make business tools more natural and human-friendly — turning everyday receipts and voice notes into actionable financial data.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>gemini</category>
    </item>
    <item>
      <title>AI Health Companion — Making healthcare information accessible for everyone.</title>
      <dc:creator>Ismail Hossen</dc:creator>
      <pubDate>Tue, 09 Sep 2025 04:43:52 +0000</pubDate>
      <link>https://dev.to/ihossen016/ai-health-companion-making-healthcare-information-accessible-for-everyone-4nbi</link>
      <guid>https://dev.to/ihossen016/ai-health-companion-making-healthcare-information-accessible-for-everyone-4nbi</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-ai-studio-2025-09-03"&gt;Google AI Studio Multimodal Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;I built AI Health Companion, an accessibility-focused applet designed to support patients and caregivers with three core modes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Visual Aid&lt;/strong&gt;: Users upload an image, and the app describes the scene in plain language, highlighting important objects and potential hazards. In addition to the text description, users can &lt;strong&gt;listen to the description as an audio file&lt;/strong&gt;, making it even more accessible for people with visual impairments.&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Symptom Recorder&lt;/strong&gt;: Users record or upload a short audio clip of their symptoms. The app transcribes the speech and summarizes the key symptoms in simple terms.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Report Simplifier&lt;/strong&gt;: Users upload a PDF or image of a lab report, and the app provides a plain-language explanation of key information with a glossary of terms. To make the experience more helpful, users can &lt;strong&gt;listen to the simplified explanation&lt;/strong&gt; and also &lt;strong&gt;download the simplified report as a PDF&lt;/strong&gt; for sharing or offline use.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Together, these modes address real problems faced by patients with vision loss, elderly users who communicate better verbally, and anyone struggling with complex medical documents.&lt;/p&gt;

&lt;h3&gt;
  
  
  Key Features
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Visual Aid with audio playback&lt;/strong&gt;: Scene description plus text-to-speech output.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Symptom Recorder&lt;/strong&gt;: Audio-to-text transcription and symptom summarization.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Report Simplifier with audio + PDF download&lt;/strong&gt;: Plain-language explanations that can be listened to or saved as a PDF.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Strict input limits&lt;/strong&gt;: 5 MB max for images and 2 MB max for PDFs to stay within the Gemini free tier.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Deployed directly from Google AI Studio to Cloud Run&lt;/strong&gt; for a seamless build-and-deploy pipeline.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;🔗 &lt;strong&gt;Live Applet on Cloud Run:&lt;/strong&gt; &lt;a href="https://ai-health-companion-390658277222.us-west1.run.app/" rel="noopener noreferrer"&gt;https://ai-health-companion-390658277222.us-west1.run.app/&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  🖼️ Screenshots:
&lt;/h3&gt;

&lt;p&gt;👁️ &lt;strong&gt;Visual Aid&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F426ucyc6f78wg1g8owdw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F426ucyc6f78wg1g8owdw.png" alt="Visual Aid"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftc9dpk7so5n127u1j6nu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftc9dpk7so5n127u1j6nu.png" alt="Visual Aid result"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;📝 &lt;strong&gt;Symptom Recorder&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyzuav4kl4qp4axfwzvhb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyzuav4kl4qp4axfwzvhb.png" alt="Symptom Recorder"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo1pc3tbsz5mjlpk5uhsv.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo1pc3tbsz5mjlpk5uhsv.png" alt="Symptom Recoder Result"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;📊 &lt;strong&gt;Report Simplifier&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyb5n9cepirq5rk6cbfuk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyb5n9cepirq5rk6cbfuk.png" alt="Report Simplifier"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fon2tpajiyhw3g3ji48s0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fon2tpajiyhw3g3ji48s0.png" alt="Report Simplifier Result"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  How I Used Google AI Studio
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;I created the applet in &lt;strong&gt;Google AI Studio’s Build mode&lt;/strong&gt;, where I designed and refined the prompts for each mode.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;I specified &lt;strong&gt;structured JSON outputs&lt;/strong&gt; directly in the system instructions to ensure reliable parsing.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Once the applet was stable, I used &lt;strong&gt;AI Studio’s one-click deployment&lt;/strong&gt; to deploy it directly to &lt;strong&gt;Google Cloud Run&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Models Used
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Gemini 2.5 Flash&lt;/strong&gt; was the default model, chosen for speed and efficiency.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;For more complex reasoning tasks (like analyzing detailed reports), the applet optionally supports &lt;strong&gt;Gemini 2.5 Pro&lt;/strong&gt; if enabled.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Both models’ multimodal capabilities were leveraged for image understanding, audio transcription, and text simplification.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Multimodal Features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Image Understanding (Visual Aid)&lt;/strong&gt;: Gemini processes uploaded images to describe content, list objects, and identify hazards. Added feature: users can &lt;strong&gt;listen to the description as speech&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Audio Understanding (Symptom Recorder)&lt;/strong&gt;: Gemini transcribes patient voice notes and summarizes them into key symptoms.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Document + Image Understanding (Report Simplifier)&lt;/strong&gt;: Gemini explains lab reports in everyday language with glossary terms. Added features: users can &lt;strong&gt;listen to the simplified explanation&lt;/strong&gt; and &lt;strong&gt;download the result as a PDF&lt;/strong&gt;.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;By combining &lt;strong&gt;image, audio, and text processing&lt;/strong&gt; through &lt;strong&gt;Gemini 2.5 Flash/Pro&lt;/strong&gt;, the applet delivers a practical, real-world healthcare companion experience that is lightweight, privacy-friendly, and accessible.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>gemini</category>
    </item>
  </channel>
</rss>
