<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Anushka Singh</title>
    <description>The latest articles on DEV Community by Anushka Singh (@anushka_singh09).</description>
    <link>https://dev.to/anushka_singh09</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3651595%2Fec9c8945-8078-4c67-9705-e16b0bcae11b.png</url>
      <title>DEV Community: Anushka Singh</title>
      <link>https://dev.to/anushka_singh09</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/anushka_singh09"/>
    <language>en</language>
    <item>
      <title>The Agent that grows with you</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Sun, 31 May 2026 01:24:50 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/the-agent-that-grows-with-you-3ne8</link>
      <guid>https://dev.to/anushka_singh09/the-agent-that-grows-with-you-3ne8</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15"&gt;Hermes Agent Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A very good evening everyone!&lt;br&gt;
Hermes agent latest version is new in town and I can't keep it to myself that I am very glad that I could complete the &lt;em&gt;build challenge&lt;/em&gt; yesterday by showcasing personal AI newspaper in which the user receives personalised news according to their cup of tea everyday.&lt;br&gt;
Well! Let's not go there. It's been more than 5 months learning Agentic AI and I barely scratched the surface with making useful projects in this particular domain.&lt;br&gt;
If you know you know that Hermes agent is the self improving AI agent. Plus its session search is 4,500x faster this time and the agent is absolutely free. It is having built-in learning loop yes I am discussing about GEPA memory.&lt;/p&gt;

&lt;h3&gt;
  
  
  What is GEPA
&lt;/h3&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;              Generate → Evaluate → Prune → Accumulate
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;Generate — an agent attempt to do some task and learns skill based on it&lt;br&gt;
Evaluate — assigns grades to the skill whether it is helpful or not&lt;br&gt;
Prune — it deletes the unnecessary skill which won't be used likely&lt;br&gt;
Accumulate — it saves the skills for the next runs&lt;/p&gt;

&lt;p&gt;Basically it is a learning loop which learns as a human does.After each run the agent gets smarter because it retains the memory of the job done in the past. &lt;/p&gt;

&lt;h3&gt;
  
  
  The Memory System
&lt;/h3&gt;

&lt;p&gt;It does not only utilise skill memory but also uses FTS5 full-text search on past discourse which means the context can be found at ease from the long corpus of given text. &lt;br&gt;
It works as a curator agent running in the background and uses GEPA to make the context stronger and relevant. In other words I should say The agent itself discards the memory which is considered obsolete.&lt;/p&gt;

&lt;h3&gt;
  
  
  Hermes agent's Core Architecture
&lt;/h3&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Figrxvljblm5gg3ysz6mi.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Figrxvljblm5gg3ysz6mi.png" alt="Image d" width="800" height="776"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Interface layer — user communicates with the agent (CLI, Telegram, WhatsApp etc.) supports 20+ platforms.&lt;br&gt;
Agent core — LLM, planner, and tool dispatcher reside in the agent core&lt;br&gt;
Tool layer — 40+ built-in tools such as web search, browser automation, vision, file system, code execution which the agent uses&lt;br&gt;
Memory system — GEPA loop &lt;br&gt;
Output layer — final result in the form of text, files, emails and code&lt;/p&gt;

&lt;p&gt;It costs nothing when you keep it idle and provides MCP support as well.You can run it on a $5 VPS, a GPU cluster, or serverless infrastructure through Daytona or Modal. Even if you are not working for example using phone for some research task your work will be finished after you come back without keeping local machine running. That's the power of Hermes Agent&lt;/p&gt;

&lt;p&gt;Courtesy: Claude for Architecture Diagram&lt;/p&gt;

</description>
      <category>hermesagentchallenge</category>
      <category>devchallenge</category>
      <category>agents</category>
      <category>ai</category>
    </item>
    <item>
      <title>your personal AI newspaper</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Fri, 29 May 2026 16:24:19 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/your-personal-ai-newspaper-3ibk</link>
      <guid>https://dev.to/anushka_singh09/your-personal-ai-newspaper-3ibk</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15"&gt;Hermes Agent Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;Well we are always in a hurry in the morning catching up on that favourite mocha and toast in our hand. Realising the need to give time to the news at that time of the day sounds unrealistic for the professionals in rush and struggling to make time for themselves even. Recognising a typical physical newspaper to read and mug up the information to find insights would be time-consuming and  AI news apps to the least demand subscription and display detailed summary.What do I bring among you all is &lt;strong&gt;Daily Digest Agent&lt;/strong&gt; a self-hosted, fully autonomous AI agent that researches the news on your chosen topics every morning and delivers a structured briefing straight to your inbox.&lt;br&gt;
This is not a usual news digest tool which has a fixed prompt and hardcoded URLs. Hermes Agent runs a real multi-step loop — it plans its own searches, decides which articles are worth reading, fetches and extracts full article content, spots cross-topic patterns, and writes a clean digest. Then it emails it to you. Every single day, automatically, for free.The agent also gets smarter over time. After each run, it updates a skill memory log — tracking which topics returned rich results and which ran dry. By run 5, it's already adjusting its own search strategy without any input from you.&lt;br&gt;
The problem it solves is information overload. Rather clicking on multiple tabs your mail is the ultimate stoppage to get news highlights compiled by an agent that knows your interests and not which is optimizing based on the clicks.&lt;/p&gt;
&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;How is mail displayed!&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9y8i6wg67hg371itciqn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9y8i6wg67hg371itciqn.png" alt="Image1" width="800" height="744"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjvedi9caqpqnl0ihec51.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fjvedi9caqpqnl0ihec51.png" alt="Image2" width="800" height="731"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvaf1yz94yv911s5ge9jm.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvaf1yz94yv911s5ge9jm.png" alt="Image3" width="799" height="718"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdaz8mah7vbqjsls0t25b.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdaz8mah7vbqjsls0t25b.png" alt="Image4" width="623" height="248"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Code
&lt;/h2&gt;


&lt;div class="ltag-github-readme-tag"&gt;
  &lt;div class="readme-overview"&gt;
    &lt;h2&gt;
      &lt;img src="https://assets.dev.to/assets/github-logo-5a155e1f9a670af7944dd5e12375bc76ed542ea80224905ecaf878b9157cdefc.svg" alt="GitHub logo"&gt;
      &lt;a href="https://github.com/s17anushka" rel="noopener noreferrer"&gt;
        s17anushka
      &lt;/a&gt; / &lt;a href="https://github.com/s17anushka/daily-digest" rel="noopener noreferrer"&gt;
        daily-digest
      &lt;/a&gt;
    &lt;/h2&gt;
    &lt;h3&gt;
      
    &lt;/h3&gt;
  &lt;/div&gt;
  &lt;div class="ltag-github-body"&gt;
    
&lt;div id="readme" class="md"&gt;
&lt;div&gt;
&lt;div class="markdown-heading"&gt;
&lt;h1 class="heading-element"&gt;📰 Daily Digest Agent&lt;/h1&gt;
&lt;/div&gt;
&lt;p&gt;&lt;strong&gt;Your personal AI newspaper, delivered every morning.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Powered by &lt;a href="https://nousresearch.com/hermes/" rel="nofollow noopener noreferrer"&gt;Hermes Agent&lt;/a&gt; · Built on &lt;a href="https://openrouter.ai" rel="nofollow noopener noreferrer"&gt;OpenRouter&lt;/a&gt; · Zero paid APIs&lt;/p&gt;
&lt;p&gt;&lt;a href="https://python.org" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/b43cbee196e104f1912e1e1f08745aac72ee904fe95aa463d7b246cc2ccfe691/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f507974686f6e2d332e31302b2d3337373641423f7374796c653d666c61742d737175617265266c6f676f3d707974686f6e266c6f676f436f6c6f723d7768697465" alt="Python"&gt;&lt;/a&gt;
&lt;a href="https://openrouter.ai" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/c67d8f26226fd6a9d97f0dabf1012e707b1f83bfe6443832db5500e40fb9160f/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4f70656e526f757465722d467265655f546965722d3642344546463f7374796c653d666c61742d737175617265" alt="OpenRouter"&gt;&lt;/a&gt;
&lt;a href="https://github.com/s17anushka/daily-digest/LICENSE" rel="noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/8e016b16e4a3d13fa9fc00c210f17cfa49f93f0c4731b47c24155d4729e6b31e/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4c6963656e73652d4d49542d3232433535453f7374796c653d666c61742d737175617265" alt="License"&gt;&lt;/a&gt;
&lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15" rel="nofollow"&gt;&lt;img src="https://camo.githubusercontent.com/1c18c9a65f3fe88f91cce8d1308a80e26f7377852abbf881191f60d99ff8e5ac/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4865726d65735f4167656e742d4368616c6c656e67655f323032362d4646364233353f7374796c653d666c61742d737175617265" alt="Challenge"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;p&gt;Every morning, this agent wakes up, researches the news on topics &lt;em&gt;you&lt;/em&gt; care about, reads the actual articles, spots the patterns, and sends you a clean digest — straight to your inbox. No subscriptions. No paywalls. No noise.&lt;/p&gt;
&lt;p&gt;It doesn't just call an LLM once. Hermes runs a real agentic loop — planning, searching, reading, reasoning — and gets smarter with every run.&lt;/p&gt;

&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;What it looks like&lt;/h2&gt;
&lt;/div&gt;
&lt;div class="snippet-clipboard-content notranslate position-relative overflow-auto"&gt;
&lt;pre class="notranslate"&gt;&lt;code&gt;=== Daily Digest Agent starting — Friday, May 29 2026 ===
Hermes iteration 1/20
  Tool call: web_search → "artificial intelligence news May 29 2026" → 10 results
Hermes iteration 2/20
  Tool call: web_fetch  → reading techcrunch.com/...
Hermes iteration 3/20
  Tool call: web_search → "Indian startup funding round May 2026" → 10 results
...
Digest saved to output/digest_Friday_May_29_2026.md
Email sent to&lt;/code&gt;&lt;/pre&gt;…&lt;/div&gt;
&lt;/div&gt;
  &lt;/div&gt;
  &lt;div class="gh-btn-container"&gt;&lt;a class="gh-btn" href="https://github.com/s17anushka/daily-digest" rel="noopener noreferrer"&gt;View on GitHub&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;


&lt;h3&gt;
  
  
  My Tech Stack
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Hermes Agent (NousResearch) — agentic loop architecture&lt;/li&gt;
&lt;li&gt;OpenRouter (auto routing) — LLM backbone, free tier&lt;/li&gt;
&lt;li&gt;Python 3.10+ — core language&lt;/li&gt;
&lt;li&gt;httpx — async HTTP requests&lt;/li&gt;
&lt;li&gt;Google News RSS — real-time news, no API key needed&lt;/li&gt;
&lt;li&gt;trafilatura — article text extraction&lt;/li&gt;
&lt;li&gt;Gmail SMTP — email delivery&lt;/li&gt;
&lt;li&gt;Windows Task Scheduler / cron — daily automation&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  How I Used Hermes Agent
&lt;/h2&gt;

&lt;p&gt;I repeat it's an engine not a wrapper, A multistep agentic loop inspired by Hermes agent core architecture.&lt;br&gt;
&lt;strong&gt;Tool Calling&lt;/strong&gt;-Hermes receives two tools (web_search and web_fetch) and autonomously decides which queries to run, which articles are worth reading, and when it has gathered enough to write the digest. No hardcoded steps.&lt;br&gt;
&lt;strong&gt;Self-termination&lt;/strong&gt; — The agent runs until it emits DIGEST_COMPLETE on its own. It decides when it's done.&lt;br&gt;
&lt;strong&gt;GEPA-style skill memory&lt;/strong&gt; — After every run, a skill log is updated with what topics were searched and what patterns worked. On the next run, this context is fed back to the agent — so it gets progressively sharper at finding relevant stories. After 5+ runs you can see it adjusting its own search queries.&lt;br&gt;
&lt;strong&gt;Scheduled autonomy&lt;/strong&gt; — Runs every morning via cron/Task Scheduler with zero human input. Fully autonomous end-to-end.&lt;/p&gt;

&lt;p&gt;What makes this different from a simple LLM call is that the agent plans, executes, reads, reasons, and self-improves — exactly the capabilities Hermes Agent helps with.&lt;/p&gt;

</description>
      <category>hermesagentchallenge</category>
      <category>devchallenge</category>
      <category>agents</category>
    </item>
    <item>
      <title>UttarCheck — AI-based Handwritten Answer Evaluator Built with Gemma 4</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Thu, 21 May 2026 17:05:13 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/uttarcheck-ai-based-handwritten-answer-evaluator-built-with-gemma-4-3kge</link>
      <guid>https://dev.to/anushka_singh09/uttarcheck-ai-based-handwritten-answer-evaluator-built-with-gemma-4-3kge</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-gemma-2026-05-06"&gt;Gemma 4 Challenge: Build with Gemma 4&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;I dedicate this project to the students studying between 6th and 12th grade in India. My thoughts during the project ideation emerged from my extensive use of AI whether they are chatbots or RAG applications to understand the concepts but the scenario changes for the school going students. They write exams and need personalised evaluation for their handwritten answers because many students are still not able to access the premium education for example those living in remote areas, and as pupils in this age bracket are having developing minds, I decided to build something which can point their mistakes out in graceful manner while suggesting improvements so that they feel confident and prepared before examination, can write accurate answers in their exams.&lt;br&gt;
It is a simple concept to implement Gemma 4 model in this project. If I add personal story to it. Apps like these were needed to guide to write my UG exams let alone a drive for me to write in legible handwriting. I believe this app will genuinely be helpful for secondary school students.&lt;br&gt;
Student clicks the picture of his/her answer to any question and gets feedback on it, gives the score on scale 1 to 10 and generates detailed feedback in both Hindi and English language as well. It works for both Hindi and English language written answersheet and I wish to expand the evaluator to multilingual support because my country is diversified and each student on the land must get benefitted.&lt;/p&gt;
&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;I gave UttarCheck application Photosynthesis answer.Exposing my localhost on ngrok as of now and recorded it. I fetched GEMMA 4 model from Google AI studio. &lt;br&gt;
&lt;/p&gt;
&lt;div class="crayons-card c-embed text-styles text-styles--secondary"&gt;
    &lt;div class="c-embed__content"&gt;
      &lt;div class="c-embed__body"&gt;
        &lt;h2 class="fs-xl lh-tight"&gt;
          &lt;a href="https://drive.google.com/file/d/1VBmR8ZmgCu0-Xb0tDgyPvn0AwiX905hT/view?usp=sharing" rel="noopener noreferrer" class="c-link"&gt;
            AI handwritten answer evaluator - Google Drive
          &lt;/a&gt;
        &lt;/h2&gt;
        &lt;div class="color-secondary fs-s flex items-center"&gt;
            &lt;img alt="favicon" class="c-embed__favicon m-0 mr-2 radius-0" src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fssl.gstatic.com%2Fimages%2Fbranding%2Fproduct%2F1x%2Fdrive_2020q4_32dp.png" width="32" height="32"&gt;
          drive.google.com
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
&lt;/div&gt;


&lt;h2&gt;
  
  
  Code
&lt;/h2&gt;

&lt;p&gt;I provide GITHUB repo for detailed explanation of my project&lt;br&gt;
&lt;/p&gt;
&lt;div class="ltag-github-readme-tag"&gt;
  &lt;div class="readme-overview"&gt;
    &lt;h2&gt;
      &lt;img src="https://assets.dev.to/assets/github-logo-5a155e1f9a670af7944dd5e12375bc76ed542ea80224905ecaf878b9157cdefc.svg" alt="GitHub logo"&gt;
      &lt;a href="https://github.com/s17anushka" rel="noopener noreferrer"&gt;
        s17anushka
      &lt;/a&gt; / &lt;a href="https://github.com/s17anushka/UttarCheck" rel="noopener noreferrer"&gt;
        UttarCheck
      &lt;/a&gt;
    &lt;/h2&gt;
    &lt;h3&gt;
      UttarCheck is an AI-powered handwritten answer evaluation system built using Gemma 4. It analyzes student answer sheets, generates scores, detects mistakes, and provides detailed feedback in both Hindi and English. The platform is designed for Indian classrooms and supports intelligent evaluation of handwritten responses across multiple subjects. 
    &lt;/h3&gt;
  &lt;/div&gt;
  &lt;div class="ltag-github-body"&gt;
    
&lt;div id="readme" class="md"&gt;
&lt;div&gt;
&lt;div class="snippet-clipboard-content notranslate position-relative overflow-auto"&gt;&lt;pre class="notranslate"&gt;&lt;code&gt;██╗   ██╗████████╗████████╗ █████╗ ██████╗  ██████╗██╗  ██╗███████╗ ██████╗██╗  ██╗
██║   ██║╚══██╔══╝╚══██╔══╝██╔══██╗██╔══██╗██╔════╝██║  ██║██╔════╝██╔════╝██║ ██╔╝
██║   ██║   ██║      ██║   ███████║██████╔╝██║     ███████║█████╗  ██║     █████╔╝
██║   ██║   ██║      ██║   ██╔══██║██╔══██╗██║     ██╔══██║██╔══╝  ██║     ██╔═██╗ 
╚██████╔╝   ██║      ██║   ██║  ██║██║  ██║╚██████╗██║  ██║███████╗╚██████╗██║  ██╗
 ╚═════╝    ╚═╝      ╚═╝   ╚═╝  ╚═╝╚═╝  ╚═╝ ╚═════╝╚═╝  ╚═╝╚══════╝ ╚═════╝╚═╝  ╚═╝
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;
&lt;p&gt;&lt;strong&gt;AI-Powered Handwritten Answer Evaluator for Indian Students&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Built for the &lt;a href="https://dev.to/devteam/join-the-gemma-4-challenge-3000-prize-pool-for-ten-winners-23in" rel="nofollow"&gt;Gemma 4 Challenge&lt;/a&gt; on DEV.to&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://python.org" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/a4598a7970ccfa3ddfd1cdddb36487c83152a9e4a53f0dd7f66654d4ba785821/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f507974686f6e2d332e31312b2d3337373641423f7374796c653d666c61742d737175617265266c6f676f3d707974686f6e266c6f676f436f6c6f723d7768697465" alt="Python"&gt;&lt;/a&gt;
&lt;a href="https://flask.palletsprojects.com" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/d6c751c40d68958c6e4d32f996360cc30e37286f374424b785ab2e3c17272b36/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f466c61736b2d332e302d3030303030303f7374796c653d666c61742d737175617265266c6f676f3d666c61736b266c6f676f436f6c6f723d7768697465" alt="Flask"&gt;&lt;/a&gt;
&lt;a href="https://ai.google.dev" rel="nofollow noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/3b556b2b736986f3fe68767805459346bd2b1d7ee06da745d3986e3bf5610efa/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f47656d6d615f342d3236425f4d6f452d3432383546343f7374796c653d666c61742d737175617265266c6f676f3d676f6f676c65266c6f676f436f6c6f723d7768697465" alt="Gemma"&gt;&lt;/a&gt;
&lt;a href="https://github.com/s17anushka/UttarCheck/LICENSE" rel="noopener noreferrer"&gt;&lt;img src="https://camo.githubusercontent.com/152aa2a37725b9fd554b28ff24d270f6071c67927a63e6d635a55c8e188e20c7/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4c6963656e73652d4d49542d677265656e3f7374796c653d666c61742d737175617265" alt="License"&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;The Problem&lt;/h2&gt;

&lt;/div&gt;
&lt;p&gt;India has &lt;strong&gt;250 million school students&lt;/strong&gt;. Most write handwritten answers for board exams — CBSE, UP Board, ICSE. Getting feedback means waiting days for a teacher.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;UttarCheck changes that.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Photograph your handwritten answer. Get instant AI evaluation — score, mistakes, improvement tips — in &lt;strong&gt;Hindi and English&lt;/strong&gt;. Powered by Gemma 4 running multimodal vision inference.&lt;/p&gt;

&lt;div class="markdown-heading"&gt;
&lt;h2 class="heading-element"&gt;Demo&lt;/h2&gt;

&lt;/div&gt;
&lt;div class="snippet-clipboard-content notranslate position-relative overflow-auto"&gt;
&lt;pre class="notranslate"&gt;&lt;code&gt;Student photographs answer sheet
           ↓
    UttarCheck processes image
           ↓
  Gemma 4 reads handwriting
           ↓
  ┌─────────────────────────┐
  │  Score: 9/10  Grade: A+ │
  │  Subject: Science       │
  │                         │
  │  ✅&lt;/code&gt;&lt;/pre&gt;…&lt;/div&gt;
&lt;/div&gt;
  &lt;/div&gt;
  &lt;div class="gh-btn-container"&gt;&lt;a class="gh-btn" href="https://github.com/s17anushka/UttarCheck" rel="noopener noreferrer"&gt;View on GitHub&lt;/a&gt;&lt;/div&gt;
&lt;/div&gt;


&lt;h2&gt;
  
  
  How I Used Gemma 4
&lt;/h2&gt;

&lt;p&gt;The intent to choose Gemma 4 API for UttarCheck is its native multimodal capability. It is not a normal text model where we add OCR step to read content from the handwritten text while Gemma 4 does read directly from the text &lt;br&gt;
I specifically chose gemma-4-26b-a4b-it (the Mixture-of-Experts variant) for three reasons:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Multimodal vision — Gemma 4 receives the answer sheet photo as base64-encoded image data alongside the evaluation prompt. It reads the handwriting, identifies the subject, detects the question from context, and evaluates content quality — all in a single inference call.&lt;/li&gt;
&lt;li&gt;MoE efficiency — The 26B MoE model activates only ~4B parameters per inference. For an educational tool expecting many concurrent students, this means faster response times and lower API cost compared to a dense model of equivalent quality.&lt;/li&gt;
&lt;li&gt;Bilingual reasoning — Gemma 4 generates feedback simultaneously in Hindi and English without any translation layer. Indian students think in Hindi but study in English — having both in one response is genuinely useful.
The Multimodal Payload&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;payload = {&lt;br&gt;
    "contents": [{&lt;br&gt;
        "parts": [&lt;br&gt;
            {"inline_data": {"mime_type": "image/jpeg", "data": image_b64}},&lt;br&gt;
            {"text": "Evaluate the handwritten answer in this image..."}&lt;br&gt;
        ]&lt;br&gt;
    }],&lt;br&gt;
    "generationConfig": {"temperature": 0.2, "maxOutputTokens": 1024}&lt;br&gt;
}&lt;/p&gt;

&lt;p&gt;Gemma 4 does not always return in JSON response. It thinks through to evaluate step by step which produces better results but wraps the final JSON in prose and markdown fences.I wrote about it that how I handled in the Github Readme. I sought this strategy from utilising various LLMs because it was something new to me.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Edge Deployment&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The architecture is designed with Gemma 4 E4B in mind. When the edge variant becomes available via API, UttarCheck will run fully on-device — no internet, no server, student data never leaving the phone. The gemma_service.py already has an Ollama backend wired in for local inference today.&lt;/p&gt;

&lt;p&gt;A great success to the application, I hope it is ahead of it's time when the project comes out from its viability to its effective usability.&lt;br&gt;
Thank you !&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>gemmachallenge</category>
      <category>gemma</category>
    </item>
    <item>
      <title>Local AI on the move</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Mon, 18 May 2026 01:11:07 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/local-ai-on-the-move-3j2d</link>
      <guid>https://dev.to/anushka_singh09/local-ai-on-the-move-3j2d</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-gemma-2026-05-06"&gt;Gemma 4 Challenge: Write About Gemma 4&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Of late, Google has released intelligent open AI models-Gemma 4.&lt;br&gt;
For the first time AI behind cloud infrastructure is supposably being accessible on the edge realistically. While exploring browser privacy concepts in my recent project, I understood the importance of AI running directly on the devices however since the advent of AI, the powerful open models felt distant to be used locally. I used media pipe for the local privacy shield but there were too many restrictions which I was unable to tackle and as an AI practitioner and dealing with implementation of ML pipelines often I am not someone who has expertise in security and if billions of parameters during inference preserved on the hardware can work effectively then there is possibility of acceleration of rise in Personal AI.&lt;br&gt;
I know many of my colleagues will argue with me that Am I strange to capabilities of Cloud? No! &lt;br&gt;
Since we know edge computing reduces latency in enabling important communication such as in relation to defense and medicine, i should say we should rely a bit less on Cloud infrastructure and make our edge computing paradigm stronger. &lt;/p&gt;

&lt;p&gt;Personal AI can be thought as analogous to Personalised Medicine. In traditional medicine where each person has different response to medications people need medicine and healthcare exclusive and individualistic. Scientists, Researchers, hospitals and industries all of them are working on Personalised medicine similarly Personalised AI will enhance the human creativity manifold which will not only train on the users' context but will remain capable of fetching world's information relevant, useful and novel. From security to educating on the ground level  AI enabled IOT using open models like Gemma 4 will reach the remote and rural areas. The invention of AI technology has itself increased the educated masses informed, creative and enthusiastic to new things. When knowledge was far fetched concept for people who were not belonging to the premium locations in the country/countries we found online content spreading like lightening speed a few years back but AI and cloud has done wonders to the human productivity. AI can help you with anything no matter which field you are in!&lt;/p&gt;

&lt;h3&gt;
  
  
  What about Gemma 4 and what it can do in the upcoming times
&lt;/h3&gt;

&lt;p&gt;Gemma 4 is trained on billions of parameters having longer context and superb mathematical capabilities &lt;br&gt;
&lt;strong&gt;26B and 31B models&lt;/strong&gt; for your personal computers&lt;br&gt;
26B is great for lightweight multimodal experiments, local assistants, summarization and RAG systems effectively&lt;br&gt;
Developers can choose their models according to their use case and get most of it.&lt;/p&gt;

&lt;p&gt;Let's come to 31B which is having better reasoning than it's previous counterpart can be used for complex coding, advanced reasoning which we need [(sarcastic): do not trade off your reasoning for real.&lt;br&gt;
&lt;em&gt;both on 80GB NVIDIA H100 GPU&lt;/em&gt; &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;E2B and E4B models&lt;/strong&gt;: The concept of AI everywhere &lt;br&gt;
The peculiarity is that they do not waste battery life because of their lightweight nature and I found it interesting to bypass sensitive cloud processing, good use case is speech recognition and understanding OCR because of best native audio and video processing. &lt;/p&gt;

&lt;p&gt;These models are optimized by the technique of quantization which reduces the memory and computational requirements of large models capable of running on the consumer hardware...ok now I can get the inferences locally wonderful!&lt;/p&gt;

&lt;h3&gt;
  
  
  Future scope
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;low resource environments will get benefitted&lt;/li&gt;
&lt;li&gt;security shall be redefined&lt;/li&gt;
&lt;li&gt;local healthcare assistance&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Challenges
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Deployment complexity of lightweight open models still persist but you can refer Google Keyword for the new releases for the sake of simplicity.&lt;/li&gt;
&lt;li&gt;Energy usage is real as well as hardware cost will be high in initial years&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It is a good news for researchers you do not need to rely on API, research labs and gives you more capabilities to build prototype and study model behaviour. Infrastructure dependency is getting minimal here.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>gemmachallenge</category>
      <category>gemma</category>
    </item>
    <item>
      <title>Sentinel AI privacy shield- a failed project</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Sun, 17 May 2026 09:34:32 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/sentinel-ai-privacy-shield-a-failed-project-1eho</link>
      <guid>https://dev.to/anushka_singh09/sentinel-ai-privacy-shield-a-failed-project-1eho</guid>
      <description>&lt;p&gt;Failures come unexpected and it amazes you that how well you can push your thresholds but cannot keep up with the debugging. For 5 days I have been churning my brain onto building a chrome extension made for security such as when students attempt exams online or if one is doing it's confidential work on the system then the screen blurs and nobody but the authorized user when returned to the screen gets access to it. Let me show you what am I giving my time to!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Chrome Extension · Manifest V3 · Local Face Detection&lt;br&gt;
Zero cloud. Zero data leakage. 100% local MediaPipe inference.&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftcherneiylepyru8qzbs.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftcherneiylepyru8qzbs.png" alt="Stranger/Empty" width="799" height="382"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fww26alaxyhxrh8mnfrps.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fww26alaxyhxrh8mnfrps.png" alt="A zoomed view" width="411" height="505"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fek7qq41wqxttact6rm0s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fek7qq41wqxttact6rm0s.png" alt="authorized user" width="800" height="396"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  How did the ideation begin
&lt;/h2&gt;

&lt;p&gt;Lately I have been studying &lt;strong&gt;Tensorflow.js (tf.js)&lt;/strong&gt; an open source java script library for machine learning that helps to run AI models directly in the browser or on node.js. I got the idea to create a project related to the security and not just a basic object detection project. I am really frustrated that after taking help from AI, I am not able to give this project an end.&lt;/p&gt;

&lt;h3&gt;
  
  
  Problem
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;I never studied core cybersecurity courses but I was trying to combine AI and privacy and holding them together as a novice without any expert advice. -- &lt;em&gt;half knowledge makes you detour for a long time&lt;/em&gt;
&lt;/li&gt;
&lt;li&gt;It was a headache because web browsers have strict security policy-specifically Chrome Extension Manifest V3 (MV3) and Content Security Policy (CSP) which didn't go with tf.js core architecture.
Let me give you some more reasons, I took it from Gemini AI&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;a) &lt;strong&gt;The unsafe-eval Deception&lt;/strong&gt; (The Biggest Culprit)&lt;br&gt;
To make neural networks run fast in a browser, TensorFlow.js uses dynamic code compilation. Under the hood, it dynamically generates JavaScript code strings at runtime and executes them using functions like eval() or new Function() to optimize mathematical matrix operations for your CPU or GPU.&lt;/p&gt;

&lt;p&gt;The Wall: Chrome Manifest V3 completely bans unsafe-eval inside standard extension scripts to prevent hackers from executing malicious strings hidden inside extensions. The moment tf.js tried to run its optimization scripts, Chrome instantly blocked it, throwing the error: Uncaught EvalError: Evaluating a string as JavaScript violates the following Content Security Policy...&lt;/p&gt;

&lt;p&gt;b) &lt;strong&gt;Missing Core Features when Forced to Fall Back&lt;/strong&gt;&lt;br&gt;
When we tried to force tf.js into a "safe" environment or bypass its initialization errors, the framework automatically disabled its dynamic engine and fell back to a basic CPU execution mode.&lt;/p&gt;

&lt;p&gt;The Wall: Because the engine initialized in a crippled, partial state, complex downstream models like MobileNet couldn't find their required dependencies. This triggered the second error you saw: Uncaught (in promise) TypeError: a.loadGraphModel is not a function. The framework literally failed to construct its own loading sub-routines because the compiler was blocked halfway through execution.&lt;/p&gt;

&lt;p&gt;c) &lt;strong&gt;Remote CDN Injections are Illegal in MV3&lt;/strong&gt;&lt;br&gt;
In older Manifest V2 extensions, developers easily bypassed file size limitations by pointing a script tag to an external link like &lt;a href="https://cdn.jsdelivr.net/" rel="noopener noreferrer"&gt;https://cdn.jsdelivr.net/&lt;/a&gt;....&lt;/p&gt;

&lt;p&gt;The Wall: Manifest V3 strictly mandates that all code executed by the extension must be packaged locally inside the extension zip. It blocks remote scripts to prevent extensions from fetching modified malicious code from the internet after being approved by the Chrome Web Store. When we tried to load tf.js via a CDN, the browser blocked the network request entirely.&lt;/p&gt;

&lt;p&gt;d) &lt;strong&gt;Massive File Size &amp;amp; WebAssembly Constraints&lt;/strong&gt;&lt;br&gt;
TensorFlow.js is a heavyweight library. The minified core library, along with the MobileNet weights and layers, spans several megabytes. When loaded locally in an extension popup, it causes severe latency, making the popup feel sluggish.&lt;/p&gt;

&lt;p&gt;Furthermore, to run properly without eval, TensorFlow.js relies heavily on WebAssembly (WASM) binaries (.wasm files). Chrome Extensions isolate execution spaces so aggressively that passing heavy WASM buffers between a background script, a popup, and an injected webpage, webpage creates a massive data-sharing bottleneck.&lt;/p&gt;

&lt;p&gt;I switched to &lt;strong&gt;Google mediapipe&lt;/strong&gt; which was easier to implement designed to build the ml pipelines that process live video, audio and sensor data. The above snapshots I attached is from using mediapipe which is working fine but there was issue&lt;/p&gt;

&lt;h3&gt;
  
  
  Problem
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;As the screen was locking successfully, I wished to work on the tab but the extension was disappearing as soon as I was clicking somewhere on the screen. I wanted my extension to be useful so that I can scroll, type on the screen and can truly be monitored in case of unauthorized user but no..&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Primary Architecture overview
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;┌─────────────────────────────────────────────────────────┐
│  Chrome Browser                                         │
│                                                         │
│  ┌──────────────────────────────┐                       │
│  │  popup.html (Extension Page) │                       │
│  │                              │                       │
│  │  ┌─────────┐  getUserMedia   │                       │
│  │  │ Camera  │──────────────►  │                       │
│  │  └─────────┘   &amp;lt;video&amp;gt;       │                       │
│  │       │                      │                       │
│  │  offscreen &amp;lt;canvas&amp;gt;          │                       │
│  │  raw ImageData               │                       │
│  │       │  postMessage         │                       │
│  │       ▼  (Transferable)      │                       │
│  │  ┌──────────────────────┐    │                       │
│  │  │  sandbox.html        │    │                       │
│  │  │  (allow-eval CSP)    │    │                       │
│  │  │                      │    │                       │
│  │  │  MediaPipe FaceMesh  │    │                       │
│  │  │  (local WASM)        │    │                       │
│  │  │  0 faces → LOCK      │    │                       │
│  │  │  1 face  → UNLOCK    │    │                       │
│  │  │  &amp;gt;1 face → LOCK      │    │                       │
│  │  └──────────┬───────────┘    │                       │
│  │             │ postMessage    │                       │
│  │             ▼                │                       │
│  │        popup.js              │                       │
│  │             │ sendMessage    │                       │
│  └─────────────┼────────────────┘                       │
│                ▼                                        │
│  ┌─────────────────────────┐                            │
│  │  content.js             │                            │
│  │  (injected into page)   │                            │
│  │                         │                            │
│  │  "lock"  → blur page    │                            │
│  │           show overlay  │                            │
│  │  "unlock"→ unblur page  │                            │
│  │           hide overlay  │                            │
│  └─────────────────────────┘                            │
└─────────────────────────────────────────────────────────┘
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqd4g7fyu9zulw6mysphf.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqd4g7fyu9zulw6mysphf.png" alt="card" width="578" height="355"&gt;&lt;/a&gt;&lt;br&gt;
I was excited that my chrome extension gets submitted to the chrome web store and I will go gala..haha novice&lt;br&gt;
I asked Claude AI that how can I do this;&lt;/p&gt;

&lt;h3&gt;
  
  
  The suggestion
&lt;/h3&gt;

&lt;p&gt;Chrome MV3 a chrome.offscreen API  — to make hidden background page which provides access to DOM and Camera even w/o popup.&lt;br&gt;
Popup is similar to the regular browser window, the moment it closes all the js files, camera stream are lost.&lt;br&gt;
Again &lt;strong&gt;THE PROBLEM&lt;/strong&gt; arose offscreen API strict in allowing/disallowing the camera permissions but it was not an issue.&lt;br&gt;
I asked about this again and it gave me to create the side-panel so that camera never closes, it was looking ugly!&lt;/p&gt;

&lt;p&gt;I dropped all the ideas and shut the laptop down, I must have understood the facts and have read the security docs and papers. Anyways I deleted the repository because I wanted it to work for me and it didn't, I cannot sell more of this&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxil5g8wz8kfc3or7ao0s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxil5g8wz8kfc3or7ao0s.png" alt="Logo" width="128" height="128"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;It is kinda failed prototype and I need to work on this project in near future after taking informed decisions and choosing unconventionally right architecture .&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Needs clarification more than perfection&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Any comments?&lt;/p&gt;

</description>
      <category>failure</category>
      <category>extensions</category>
      <category>javascript</category>
      <category>security</category>
    </item>
    <item>
      <title>From prompt to playable</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Thu, 14 May 2026 08:05:31 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/from-prompt-to-playable-a3f</link>
      <guid>https://dev.to/anushka_singh09/from-prompt-to-playable-a3f</guid>
      <description>&lt;p&gt;Long time no see ! &lt;br&gt;
A bit of story- Back in 2024 I came across the Google Cloud webinar and I readily joined that session, insightful and edgy.. They invited me to join Google Developer Program to thrive in the fraternity of coders and developers.&lt;br&gt;
To my surprise, Lately I was scrolling my profile looking at earned badges and exploring various tabs what I found was, they provide learnings as well. Truly I hadn't had recalled that they give the benefit of learning various skills especially for developers if they need to incorporate something into their application which complement getting knowledge from classroom learnings.&lt;br&gt;
I searched through the learning (Codelabs) page whether I can rush onto a skill new and demanded nowadays.&lt;br&gt;
I clicked...&lt;br&gt;
&lt;strong&gt;&lt;em&gt;Vibe-code a kids game with Gemini and publish with Firebase!&lt;/em&gt;&lt;/strong&gt; &lt;br&gt;
I was missing my 6 year old, I left her at home without telling her and the guilt arose each time I saw this little child in pictures and even randomly when I see my birds. I was determined to make this project to feel good. &lt;em&gt;This was my first intention&lt;/em&gt;&lt;br&gt;
The project is based on java script specifically on p5.js which is deployed on firebase.&lt;/p&gt;

&lt;h2&gt;
  
  
  What is p5.js?
&lt;/h2&gt;

&lt;p&gt;Even I was curious when I was going through the tutorial pages because I am very new to java script in fact I learned Vue.js very recent for frontend while creating my agentic AI project...&lt;br&gt;
p5.js is an open source java script library, free and designed for coding to hone creativity. It gives you special commands to draw shapes,adding colours and creating animations which is displayed in the web browser simultaneously and easier to implement than the traditional use of java script which is used to make websites work for different use cases.&lt;br&gt;
The p5.js is the great tool for artists and beginners who intend to create beautiful animations and flex their creativity with the help of coding.&lt;/p&gt;

&lt;h2&gt;
  
  
  THE GAME
&lt;/h2&gt;

&lt;p&gt;As per the tutorial, the prompt was given for making pixel dinosaur which needs to hop over every obstacle- the game which we play when the internet is not connected on the google search engine, I loved that prompt and the code worked in first go, however I didn't like the game when I thought about the little kid at home, I asked Gemini about my sharp-minded 6 year old trait and at this age they need to be engaged in a fun activity.&lt;/p&gt;

&lt;p&gt;This is how &lt;em&gt;Clever-Fox-Quest&lt;/em&gt; game came to life a stress-free educational colour matching game for kids in which they can drag their mouse which is moving a fox left to right and match the correct colours displayed on screen.&lt;/p&gt;

&lt;p&gt;The 5 correct matches will change the colours and the game won't bore the children.&lt;br&gt;
When 30 points are achieved then the fireworks are shown with the YOU WIN! message on the screen.&lt;br&gt;
For the controls you can press the spacebar to restart the game.&lt;br&gt;
Gemini suggested kids do not like to be defeated which result in irritable behaviour and get cranky.&lt;br&gt;
 I know nowadays the kids are way more ahead to play simple games but this game has the nice animation..&lt;br&gt;
My second reason to choose this tutorial is to work hands-on with firebase, deployment is pain for me when I leverage AWS services for full stack application, i will come back to this some other day!&lt;/p&gt;

&lt;h2&gt;
  
  
  What is Firebase
&lt;/h2&gt;

&lt;p&gt;Firebase is a platform created by Google that gives developers tools to build, host and manage web and mobile apps. &lt;br&gt;
For now I used Firebase hosting to deploy the game on the internet by utilising firebase deploy command on my vs code terminal. I can use various services and tools provided by Firebase to expand my game like integrating leaderboards by taking help of built-in databases by firebase and any other functionality like creating accounts of the players as well.&lt;br&gt;
&lt;em&gt;You can see the preliminary stage of the game by clicking the link below&lt;/em&gt;&lt;br&gt;
&lt;/p&gt;
&lt;div class="crayons-card c-embed text-styles text-styles--secondary"&gt;
    &lt;div class="c-embed__content"&gt;
      &lt;div class="c-embed__body flex items-center justify-between"&gt;
        &lt;a href="https://clever-fox-quest-anushka-99c65.web.app/" rel="noopener noreferrer" class="c-link fw-bold flex items-center"&gt;
          &lt;span class="mr-2"&gt;clever-fox-quest-anushka-99c65.web.app&lt;/span&gt;
          

        &lt;/a&gt;
      &lt;/div&gt;
    &lt;/div&gt;
&lt;/div&gt;
&lt;br&gt;
If she says she wants something more interesting and engaging then I have to research more that what can make these children locked-in.&lt;br&gt;
The cursor to move fox moves so smoothly, it is because of p5.js library built around 2 main functions&lt;br&gt;
setup() and  draw()&lt;br&gt;
setup() runs only 1 time loads the images and set the starting from the score 0.&lt;br&gt;
draw() is the miraculous part of the code which usually runs 60 times per second.Every fraction of a second, it wipes the screen clean and redraws the fox, the gems, and the score in their slightly new positions. This is what creates the illusion of smooth animation. This is why it looks like an art.

&lt;p&gt;Any questions are welcomed and we can collaborate on the creative projects.&lt;br&gt;
I hope she likes the game so that I can feel low-key talented for some time hehehe!&lt;/p&gt;

</description>
      <category>javascript</category>
      <category>learning</category>
      <category>firebase</category>
      <category>ai</category>
    </item>
    <item>
      <title>Glimpses of Agentic AI practises</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Wed, 14 Jan 2026 18:28:51 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/glimpses-of-agentic-ai-practises-1dg9</link>
      <guid>https://dev.to/anushka_singh09/glimpses-of-agentic-ai-practises-1dg9</guid>
      <description>&lt;p&gt;In continuation of my Agentic AI learning &lt;em&gt;I got to make a project&lt;/em&gt; &lt;strong&gt;complaint-triage-system&lt;/strong&gt; by incorporating process automation with the help of LLM API. For database I used Sqlalchemy and when admin intends to check the lodged complaints on separate dashboard, they have to authenticate with their email and password secured by JWT token. It took me so long because first I used Gemini API key and at last I had to revoke because some glitch happened to be there (as usual). Oh I completely forgot to write that why did I use API key because it was helping me to triage the complaint status to high/medium/low and it was responsible for analysis of complaint submitted but the twist came when I wished to create the email send reply button by admin side along with regenerate and  edit button.&lt;/p&gt;

&lt;p&gt;I don't know how but my gemini key crashed, I rushed into changing the models but it didn't work. I switched to grok API because it was free, triage was working correctly and the analysis was a bit short (very specific in keywords).In between all of this It was all pain to use JWT based tokenization , I had to cancel otp based authentication when entering email, the otp was appearing to me in my backend server. I got so confused, my biggest red flag was lack of system design or clear workflow, and I mean it you not only need mere inspiration,but also clarity in your aim...&lt;/p&gt;

&lt;p&gt;LET'S Get back to the topic so i just made a create_admin file and added my email, password in the env. After this, everything was working fine and I finally wanted AI-generated customized editable reply to be sent to the user.. I used APP password of one of my email ids but again it didn't work out. What came as my saviour &lt;em&gt;&lt;strong&gt;Twilio Sendgrid&lt;/strong&gt;&lt;/em&gt;. Sendgrid API key and my application was working without bugs, I edited my reply to add some human touch and specific details to the email however the edited reply was not going to the payload.. I debugged one last time and magic it is running smoothly.&lt;/p&gt;

&lt;p&gt;Deploying is another pain and I tried on AWS Elastic Beanstalk but that http and https mismatch because I was using AWS amplify(using https), I turned to EC2 and tried installing nginx but the timeout in free tier and repeated commands exhausted me...This is how my full-stack application came into the life and the github commits&amp;gt; 11 has other memebase.&lt;br&gt;
Today My aws free tier expired and I have multiple quests to create many projects, to contribute.&lt;br&gt;
P.S. I intend AI to routing the complaint to specific department as per the user needs for faster  issue resolving as my project grows &lt;/p&gt;

</description>
      <category>ai</category>
      <category>vue</category>
      <category>vite</category>
      <category>python</category>
    </item>
    <item>
      <title>Whitepapers, Labs and loads of learning</title>
      <dc:creator>Anushka Singh</dc:creator>
      <pubDate>Mon, 08 Dec 2025 18:47:25 +0000</pubDate>
      <link>https://dev.to/anushka_singh09/whitepapers-labs-and-loads-of-learning-1638</link>
      <guid>https://dev.to/anushka_singh09/whitepapers-labs-and-loads-of-learning-1638</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/googlekagglechallenge"&gt;Google AI Agents Writing Challenge&lt;/a&gt;: Learning Reflections&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Agentic AI is the evolution of automation which is agile in adaptive &lt;br&gt;
decision making.&lt;/p&gt;

&lt;h3&gt;
  
  
  Fellas ! It was truly an amazing event packed with practical implementation
&lt;/h3&gt;

&lt;p&gt;I started with &lt;a href="https://drive.google.com/file/d/1FM7NK-E2O8CTl_pNVTagxBHuQ0CB2dYc/view?usp=drive_link" rel="noopener noreferrer"&gt;Building your first Agent&lt;/a&gt;, it was quite fun and accomplishing to record the working of an agent. The learning is based on utilising ADK and Gemini for API keys and compatible model. As I geared up on day 1, I got to know the very basics of what is Agentic AI and the method it invokes to work with, then the types of agents and clear workflows of architecture in multi-agent systems.&lt;/p&gt;

&lt;p&gt;I caught up with &lt;em&gt;Interoperability with MCP and tools&lt;/em&gt; to get to know the deeper side of orchestration behind an agent's success. The best part was that you can connect the documents to the NotebookLM and learn the core concepts a way much better. They even provided the summary podcast created by NotebookLM.&lt;/p&gt;

&lt;p&gt;I was able to call get tiny image tool from MCP server to test on my local host and it worked. Furthermore I wanted the output of image of an anime girl when asked from the agent just to work with different MCP server, let alone Replicate MCP server. There was a glitch and I moved on to day 3, however on day 2, I worked with agent with approval and definitely wouldn't have missed.&lt;br&gt;
Day 3 and the dawn of context engineering &lt;em&gt;Sessions and Memory&lt;/em&gt; one of my favourite topics, I put pen to paper and dived into how to make the agent stateful and the labs were my only resources cut to the chase for any beginner and I am glad everything was so smooth while I learned.&lt;/p&gt;

&lt;p&gt;Day 4 was the addition to Responsible AI and if the agent is capable to solve problem Should it actually do or not, they must be evaluated too.&lt;br&gt;
From Glassbox and Blackbox evaluations to pillars of observability. There was proper guidance depending on the roles of any professional.&lt;/p&gt;

&lt;p&gt;While Day 5 was all about &lt;em&gt;Prototype to Production&lt;/em&gt; , Deploying the agent.&lt;br&gt;
On a good note, I would say the event was more than worthwhile&lt;br&gt;
The curated playbooks and the steps to develop the agents on &lt;em&gt;kaggle&lt;/em&gt; is the foundation to my next move which is developing Agentic AI project end to end.&lt;/p&gt;

&lt;p&gt;Thank you for the day&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fds504f05q8g9fph50hs7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fds504f05q8g9fph50hs7.png" alt="Thank you for the day" width="393" height="344"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>agents</category>
      <category>devchallenge</category>
    </item>
  </channel>
</rss>
