<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: harshini hegde</title>
    <description>The latest articles on DEV Community by harshini hegde (@harshini_hegde_1ab5ee3606).</description>
    <link>https://dev.to/harshini_hegde_1ab5ee3606</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3827603%2F22e6a582-82f2-4406-92dd-fe61527179e3.jpg</url>
      <title>DEV Community: harshini hegde</title>
      <link>https://dev.to/harshini_hegde_1ab5ee3606</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/harshini_hegde_1ab5ee3606"/>
    <language>en</language>
    <item>
      <title>Mentori: Turning Documents Into Interactive AI Tutors with Gemini Live</title>
      <dc:creator>harshini hegde</dc:creator>
      <pubDate>Mon, 16 Mar 2026 23:29:07 +0000</pubDate>
      <link>https://dev.to/harshini_hegde_1ab5ee3606/mentori-turning-documents-into-interactive-ai-tutors-with-gemini-live-26nl</link>
      <guid>https://dev.to/harshini_hegde_1ab5ee3606/mentori-turning-documents-into-interactive-ai-tutors-with-gemini-live-26nl</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftzxwzwbjomp3n3moecxu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftzxwzwbjomp3n3moecxu.png" alt="🐢 Mentori"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Disclosure:&lt;/strong&gt; This article and the Mentori project were created for the purposes of entering the &lt;strong&gt;Google Gemini Live Agent Challenge hackathon.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem We Wanted to Solve
&lt;/h2&gt;

&lt;p&gt;Every student studies from documents — PDFs, lecture slides, research papers, technical documentation.&lt;/p&gt;

&lt;p&gt;But the experience is usually the same.&lt;/p&gt;

&lt;p&gt;You sit there staring at a page trying to understand a concept. If it doesn't make sense, you reread the same paragraph again. And if it still doesn’t make sense, you open another tab and start searching for explanations somewhere else.&lt;/p&gt;

&lt;p&gt;That breaks the learning flow.&lt;/p&gt;

&lt;p&gt;Modern AI assistants help a little, but they have another problem: they answer from &lt;strong&gt;general knowledge&lt;/strong&gt;, not from the exact document you're studying.&lt;/p&gt;

&lt;p&gt;That means explanations often don't match the terminology, context, or examples used in the material.&lt;/p&gt;

&lt;p&gt;We started thinking:&lt;strong&gt;What if the document itself could become the tutor?&lt;/strong&gt;&lt;br&gt;
Not a chatbot.  Not a search tool.&lt;br&gt;
A tutor that understands the document and can &lt;strong&gt;teach, explain, and interview you on the material.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;That's how 🐢 &lt;strong&gt;Mentori&lt;/strong&gt; was born.&lt;/p&gt;


&lt;h2&gt;
  
  
  Meet 🐢 Mentori
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Mentori&lt;/strong&gt; turns any document into an &lt;strong&gt;interactive AI tutor&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Upload a textbook chapter, lecture notes, or research paper and Mentori will:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;explain the material conversationally
&lt;/li&gt;
&lt;li&gt;answer questions grounded in the document
&lt;/li&gt;
&lt;li&gt;switch languages when needed for better understanding
&lt;/li&gt;
&lt;li&gt;interview you to test your knowledge
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Instead of passively reading documents, students can &lt;strong&gt;talk to the material and learn interactively.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Mentori uses &lt;strong&gt;Retrieval-Augmented Generation (RAG)&lt;/strong&gt; with &lt;strong&gt;Gemini models&lt;/strong&gt; so answers always stay grounded in the uploaded document.&lt;/p&gt;

&lt;p&gt;And with &lt;strong&gt;Gemini Live API&lt;/strong&gt;, the interaction becomes natural — students can speak to the tutor and receive responses in real time.&lt;/p&gt;


&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;Learning from documents is still the backbone of education.&lt;/p&gt;

&lt;p&gt;But the process is inefficient:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Students reread sections repeatedly
&lt;/li&gt;
&lt;li&gt;Concepts remain unclear
&lt;/li&gt;
&lt;li&gt;Questions interrupt learning flow
&lt;/li&gt;
&lt;li&gt;Understanding is rarely tested properly
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;🐢 Mentori addresses these problems directly.&lt;/p&gt;
&lt;h3&gt;
  
  
  Passive Reading
&lt;/h3&gt;

&lt;p&gt;Most studying is passive.&lt;br&gt;
Mentori transforms reading into &lt;strong&gt;interactive conversation-based learning&lt;/strong&gt;.&lt;/p&gt;
&lt;h3&gt;
  
  
  Context Loss
&lt;/h3&gt;

&lt;p&gt;Traditional AI assistants don't know your document.&lt;br&gt;
Mentori uses &lt;strong&gt;RAG&lt;/strong&gt; so responses always come from the uploaded material.&lt;/p&gt;
&lt;h3&gt;
  
  
  No Feedback Loop
&lt;/h3&gt;

&lt;p&gt;Reading alone doesn't confirm understanding.&lt;br&gt;
Mentori's &lt;strong&gt;Interview Mode&lt;/strong&gt; actively tests comprehension.&lt;/p&gt;


&lt;h2&gt;
  
  
  The Two Core Experiences
&lt;/h2&gt;

&lt;p&gt;🐢 Mentori focuses on two key learning workflows.&lt;/p&gt;


&lt;h3&gt;
  
  
  Conversational Learning
&lt;/h3&gt;

&lt;p&gt;After a document is uploaded, Mentori analyzes it and creates a &lt;strong&gt;structured learning session&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Instead of expecting the student to read everything alone, Mentori walks through the material and explains concepts step-by-step.&lt;/p&gt;

&lt;p&gt;The interaction happens through &lt;strong&gt;real-time voice conversation powered by Gemini Live API.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Students can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;interrupt the tutor
&lt;/li&gt;
&lt;li&gt;ask follow-up questions
&lt;/li&gt;
&lt;li&gt;request clarification
&lt;/li&gt;
&lt;li&gt;ask for explanations in another language
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"I didn't understand that. Can you explain it in Spanish?"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Mentori will immediately switch languages while still explaining the concept &lt;strong&gt;based on the document context&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The experience feels much closer to learning with a &lt;strong&gt;real tutor&lt;/strong&gt;.&lt;/p&gt;


&lt;h3&gt;
  
  
  Interview Mode
&lt;/h3&gt;

&lt;p&gt;🐢 Mentori also includes an &lt;strong&gt;Interview Mode&lt;/strong&gt; designed to test understanding.&lt;/p&gt;

&lt;p&gt;After processing the document, Mentori generates &lt;strong&gt;a curated set of important questions&lt;/strong&gt; from the material.&lt;/p&gt;

&lt;p&gt;The experience works like a real interview.&lt;/p&gt;

&lt;p&gt;Mentori asks a question and the student answers using &lt;strong&gt;voice or text&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;If the answer is incomplete or incorrect, Mentori &lt;strong&gt;guides the student toward the correct reasoning&lt;/strong&gt; instead of immediately revealing the answer.&lt;/p&gt;

&lt;p&gt;At the end of each question Mentori provides:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the correct answer
&lt;/li&gt;
&lt;li&gt;feedback on the response
&lt;/li&gt;
&lt;li&gt;suggestions for improvement
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This turns studying into &lt;strong&gt;active knowledge reinforcement&lt;/strong&gt;.&lt;/p&gt;


&lt;h2&gt;
  
  
  How 🐢 Mentori Works
&lt;/h2&gt;

&lt;p&gt;Mentori is built as a &lt;strong&gt;real-time AI learning platform on Google Cloud&lt;/strong&gt; combining document retrieval, conversational AI, and voice interaction.&lt;/p&gt;


&lt;h2&gt;
  
  
  Architecture Overview
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5db5spug78td1nl94v3v.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5db5spug78td1nl94v3v.png" alt="Architecture"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Mentori Architecture:&lt;/strong&gt; RAG-powered document tutoring using Gemini Flash 2.5 for reasoning and Gemini Live API for real-time conversational learning.&lt;/p&gt;

&lt;p&gt;At a high level, Mentori combines document processing, vector retrieval, and live AI interaction to create a responsive tutoring experience.&lt;/p&gt;

&lt;p&gt;The system consists of several key components:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;React Frontend (Firebase Hosting)&lt;/strong&gt;  Handles document uploads, learning sessions, and real-time interaction with the AI tutor.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Cloud Run Services (Python / FastAPI)&lt;/strong&gt;  Backend APIs responsible for document ingestion, session management, and AI orchestration.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Gemini Live API&lt;/strong&gt;  Enables low-latency conversational interaction and streaming voice responses.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Gemini Flash 2.5&lt;/strong&gt;  Generates grounded explanations using retrieved document context.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Vertex AI Embeddings&lt;/strong&gt;  Converts document chunks into semantic vector representations.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Vertex AI Vector Search&lt;/strong&gt;  Retrieves relevant sections of the document during tutoring conversations.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Cloud Storage&lt;/strong&gt;  Stores uploaded documents.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Firestore&lt;/strong&gt;  Maintains session metadata and document references.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;WebSockets&lt;/strong&gt;  Enable real-time communication between the frontend and AI services.&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;


&lt;h2&gt;
  
  
  Document Processing (RAG Pipeline)
&lt;/h2&gt;

&lt;p&gt;When a document is uploaded, Mentori processes it through an ingestion pipeline.&lt;/p&gt;

&lt;p&gt;The system:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;stores the document in &lt;strong&gt;Cloud Storage&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;splits it into semantic chunks
&lt;/li&gt;
&lt;li&gt;generates embeddings using &lt;strong&gt;Vertex AI Embeddings&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;indexes them in &lt;strong&gt;Vertex AI Vector Search&lt;/strong&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This creates a searchable knowledge base for the tutor.&lt;/p&gt;

&lt;p&gt;The document is processed &lt;strong&gt;only once&lt;/strong&gt;, and the session is stored so users can return later without uploading the document again.&lt;/p&gt;


&lt;h2&gt;
  
  
  Real-Time Learning Interaction
&lt;/h2&gt;

&lt;p&gt;When students interact with the tutor:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Questions arrive through &lt;strong&gt;WebSockets&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Relevant document sections are retrieved from &lt;strong&gt;Vector Search&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Context is sent to &lt;strong&gt;Gemini Flash 2.5&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Responses are streamed back through &lt;strong&gt;Gemini Live API&lt;/strong&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This enables &lt;strong&gt;low-latency conversational tutoring grounded in the document.&lt;/strong&gt;&lt;/p&gt;


&lt;h2&gt;
  
  
  Technology Stack
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Layer&lt;/th&gt;
&lt;th&gt;Technology&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Frontend&lt;/td&gt;
&lt;td&gt;React&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Backend&lt;/td&gt;
&lt;td&gt;Python + FastAPI&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;AI Models&lt;/td&gt;
&lt;td&gt;Gemini Flash 2.5 + Gemini Live API&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Retrieval&lt;/td&gt;
&lt;td&gt;Vertex AI Embeddings + Vector Search&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Storage&lt;/td&gt;
&lt;td&gt;Cloud Storage&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Database&lt;/td&gt;
&lt;td&gt;Firestore&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hosting&lt;/td&gt;
&lt;td&gt;Google Cloud Run&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Frontend Hosting&lt;/td&gt;
&lt;td&gt;Firebase Hosting&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Infrastructure&lt;/td&gt;
&lt;td&gt;Terraform&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;CI/CD&lt;/td&gt;
&lt;td&gt;GitHub + GitHub Actions&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;


&lt;h2&gt;
  
  
  Lessons Learned
&lt;/h2&gt;
&lt;h3&gt;
  
  
  Context matters
&lt;/h3&gt;

&lt;p&gt;AI responses become far more useful when grounded in the exact material a user is studying.&lt;/p&gt;
&lt;h3&gt;
  
  
  Voice interaction changes the learning experience
&lt;/h3&gt;

&lt;p&gt;Talking to a tutor is much more natural than typing questions.&lt;/p&gt;
&lt;h3&gt;
  
  
  RAG improves reliability
&lt;/h3&gt;

&lt;p&gt;Retrieval ensures responses stay aligned with the document instead of drifting into generic AI explanations.&lt;/p&gt;
&lt;h3&gt;
  
  
  Architecture matters
&lt;/h3&gt;

&lt;p&gt;Combining retrieval, AI reasoning, and live interaction requires careful design — but when done well it creates powerful learning experiences.&lt;/p&gt;


&lt;h2&gt;
  
  
  What's Next
&lt;/h2&gt;

&lt;p&gt;🐢 Mentori is just getting started.&lt;/p&gt;

&lt;p&gt;Future improvements include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;learning progress tracking
&lt;/li&gt;
&lt;li&gt;personalized learning paths
&lt;/li&gt;
&lt;li&gt;diagram and chart understanding
&lt;/li&gt;
&lt;li&gt;collaborative study sessions
&lt;/li&gt;
&lt;li&gt;integration with learning platforms
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The vision is simple:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;every document should become a personalized learning experience.&lt;/strong&gt;&lt;/p&gt;


&lt;h2&gt;
  
  
  Try It
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Project repository&lt;/strong&gt; &lt;br&gt;
&lt;em&gt;&lt;a href="https://github.com/HarshiniHegde/Mentrova" rel="noopener noreferrer"&gt;https://github.com/HarshiniHegde/Mentrova&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Demo video&lt;/strong&gt; &lt;br&gt;


  &lt;iframe src="https://www.youtube.com/embed/RDrLC7E86Yo"&gt;
  &lt;/iframe&gt;


&lt;/p&gt;




&lt;h2&gt;
  
  
  👥 &lt;strong&gt;Creators&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Mentori was built by:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Harshini Hegde&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
🔗 LinkedIn: &lt;a href="https://www.linkedin.com/in/harshini-hegde-9806797a/" rel="noopener noreferrer"&gt;https://www.linkedin.com/in/harshini-hegde-9806797a/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Rishi Muruganandha&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
🔗 LinkedIn: &lt;a href="https://www.linkedin.com/in/rishi-muruganandha/" rel="noopener noreferrer"&gt;https://www.linkedin.com/in/rishi-muruganandha/&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;We often think about AI as a tool that answers questions.&lt;br&gt;
But in education, the real opportunity is something different.&lt;br&gt;
Not an AI that answers questions.&lt;br&gt;
An AI that &lt;strong&gt;teaches.&lt;/strong&gt;&lt;br&gt;
🐢 Mentori is a small step in that direction.&lt;/p&gt;

</description>
      <category>geminiliveagentchallenge</category>
      <category>googlecloud</category>
      <category>ai</category>
      <category>learning</category>
    </item>
  </channel>
</rss>
