<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Dhruv</title>
    <description>The latest articles on DEV Community by Dhruv (@mycmdhub).</description>
    <link>https://dev.to/mycmdhub</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3740582%2F3e011681-a820-432e-8515-e5471cbefae7.jpeg</url>
      <title>DEV Community: Dhruv</title>
      <link>https://dev.to/mycmdhub</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/mycmdhub"/>
    <language>en</language>
    <item>
      <title>From Voice Demo to Operational Voice Assistant: Reviving Ovela AI</title>
      <dc:creator>Dhruv</dc:creator>
      <pubDate>Wed, 03 Jun 2026 15:17:28 +0000</pubDate>
      <link>https://dev.to/mycmdhub/from-voice-demo-to-operational-voice-assistant-reviving-ovela-ai-2nlm</link>
      <guid>https://dev.to/mycmdhub/from-voice-demo-to-operational-voice-assistant-reviving-ovela-ai-2nlm</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/github-2026-05-21"&gt;GitHub Finish-Up-A-Thon Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;Ovela AI started as a side project driven by a question that kept pulling me back:&lt;/p&gt;

&lt;p&gt;What would it take for a business to genuinely trust a voice AI system?&lt;/p&gt;

&lt;p&gt;At first, I thought the answer was simple: make conversations sound natural.&lt;/p&gt;

&lt;p&gt;The original prototype could answer calls, respond to questions, and carry a conversation reasonably well. From a technical perspective, it looked impressive.&lt;/p&gt;

&lt;p&gt;But after speaking with accommodation providers and small business owners, I realized I was focused on the wrong problem.&lt;/p&gt;

&lt;p&gt;Businesses don't trust a system because it sounds human.&lt;/p&gt;

&lt;p&gt;They trust it because it behaves reliably.&lt;/p&gt;

&lt;p&gt;Can it check availability correctly?&lt;/p&gt;

&lt;p&gt;Can it update reservations safely?&lt;/p&gt;

&lt;p&gt;Can it collect payments?&lt;/p&gt;

&lt;p&gt;Can it transfer a call when confidence is low?&lt;/p&gt;

&lt;p&gt;Can staff see exactly what happened afterward?&lt;/p&gt;

&lt;p&gt;That realization changed the direction of the project completely.&lt;/p&gt;

&lt;p&gt;Ovela AI evolved from a voice demo into an operational voice assistant designed to help businesses handle real customer interactions while keeping humans in control of important decisions.&lt;/p&gt;

&lt;p&gt;Today, Ovela can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Handle inbound phone calls&lt;/li&gt;
&lt;li&gt;Check room availability&lt;/li&gt;
&lt;li&gt;Create reservations&lt;/li&gt;
&lt;li&gt;Process payments through Stripe&lt;/li&gt;
&lt;li&gt;Answer property and local information questions&lt;/li&gt;
&lt;li&gt;Transfer calls when needed&lt;/li&gt;
&lt;li&gt;Keep staff synchronized through a management dashboard&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;More importantly, every improvement is guided by a simple principle:&lt;/p&gt;

&lt;p&gt;AI should support human operations, not blindly replace them.&lt;/p&gt;




&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;📞 Live Demo (Australia)&lt;/p&gt;

&lt;p&gt;Phone: +61 3 4823 6219&lt;/p&gt;

&lt;p&gt;Due to abuse protection and testing limits, availability may occasionally be restricted.&lt;/p&gt;

&lt;p&gt;Try asking:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;"Do you have any rooms available this weekend?"&lt;/li&gt;
&lt;li&gt;"Can I make a reservation?"&lt;/li&gt;
&lt;li&gt;"What attractions are nearby?"&lt;/li&gt;
&lt;li&gt;"What's the weather like today?"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;🌐 Website: &lt;a href="https://ovela.dev" rel="noopener noreferrer"&gt;https://ovela.dev&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;🐙 GitHub Repository:&lt;a href="https://github.com/My-CMDhub/Ovela-AI" rel="noopener noreferrer"&gt;https://github.com/My-CMDhub/Ovela-AI&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  The Comeback Story
&lt;/h2&gt;

&lt;p&gt;Like many side projects, Ovela reached a point where the prototype worked well enough to demonstrate the idea.&lt;/p&gt;

&lt;p&gt;Then it sat untouched not because the project failed but because other priorities took over.&lt;/p&gt;

&lt;p&gt;Months later, after more conversations with business owners and more exposure to real operational challenges, I came back to the project with a very different perspective.&lt;/p&gt;

&lt;p&gt;The biggest lesson was surprisingly non-technical.&lt;/p&gt;

&lt;p&gt;The challenge isn't making AI speak.&lt;/p&gt;

&lt;p&gt;The challenge is making AI behave appropriately within human workflows.&lt;/p&gt;

&lt;p&gt;A real receptionist doesn't simply answer questions.&lt;/p&gt;

&lt;p&gt;They:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Recognize interruptions&lt;/li&gt;
&lt;li&gt;Acknowledge requests before acting&lt;/li&gt;
&lt;li&gt;Handle uncertainty&lt;/li&gt;
&lt;li&gt;Understand when information is missing&lt;/li&gt;
&lt;li&gt;Escalate sensitive situations&lt;/li&gt;
&lt;li&gt;Maintain context across an entire conversation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Most voice demos don't fail because speech recognition is poor.&lt;/p&gt;

&lt;p&gt;They fail because the operational behavior doesn't match what people expect from a trusted assistant.&lt;/p&gt;

&lt;p&gt;That became the focus of the revival.&lt;/p&gt;

&lt;h3&gt;
  
  
  What Changed
&lt;/h3&gt;

&lt;h4&gt;
  
  
  Multi-Agent Architecture
&lt;/h4&gt;

&lt;p&gt;The original system relied on a much simpler flow.&lt;/p&gt;

&lt;p&gt;The new version uses a multi-agent architecture built around Google's Agent Development Kit (ADK), allowing different agents to handle reservations, business operations, and information requests independently.&lt;/p&gt;

&lt;h4&gt;
  
  
  Lower Latency Conversations
&lt;/h4&gt;

&lt;p&gt;Voice interactions are highly sensitive to delays.&lt;/p&gt;

&lt;p&gt;Several architectural bottlenecks were removed to improve response times and reduce awkward pauses during calls.&lt;/p&gt;

&lt;h4&gt;
  
  
  Stronger Context Awareness
&lt;/h4&gt;

&lt;p&gt;One of the most interesting challenges was interruption handling.&lt;/p&gt;

&lt;p&gt;People interrupt constantly during real conversations.&lt;/p&gt;

&lt;p&gt;The system now maintains awareness of what information has already been spoken, allowing it to continue naturally instead of restarting or losing context.&lt;/p&gt;

&lt;h4&gt;
  
  
  Operational Reliability
&lt;/h4&gt;

&lt;p&gt;Reservation workflows, payment handling, availability checks, and dashboard synchronization were rebuilt to behave more like real business processes rather than isolated AI actions.&lt;/p&gt;

&lt;h4&gt;
  
  
  Abuse Protection
&lt;/h4&gt;

&lt;p&gt;Real phone systems attract misuse.&lt;/p&gt;

&lt;p&gt;Rate limits, call protections, and operational safeguards were added to prevent abuse while keeping legitimate usage frictionless.&lt;/p&gt;




&lt;h2&gt;
  
  
  My Experience with GitHub Copilot
&lt;/h2&gt;

&lt;p&gt;Returning to a codebase that has sat inactive for months is often harder than starting a brand new one. You inherit your own past decisions without fully remembering &lt;em&gt;why&lt;/em&gt; you made them. &lt;/p&gt;

&lt;p&gt;For the revival of Ovela AI, I didn't use GitHub Copilot as a simple autocomplete tool to write boilerplate code. Instead, I used it as a high-level engineering partner and data auditor to manage complex architectural shifts and harden my system’s reliability. &lt;/p&gt;

&lt;p&gt;Here are the two major ways Copilot helped me cross the finish line, backed by real-world interaction during my workflow development:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Translating Complex Systems into Architecture Diagrams
&lt;/h3&gt;

&lt;p&gt;As Ovela AI transitioned to a multi-agent setup, mapping out component connections, telephony triggers, and dashboard synchronization endpoints became a major cognitive bottleneck. I leveraged Copilot within my workspace as a principal solutions architect. By feeding it my core file dependencies, it mapped out a clean, production-ready system workflow directly in &lt;strong&gt;Mermaid.js&lt;/strong&gt; syntax for the repository documentation. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe86amleofdqd1e5x9e0s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe86amleofdqd1e5x9e0s.png" alt="Architecture Diagrams Breakdown" width="799" height="493"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Eval Hardening (The Supreme Judge)
&lt;/h3&gt;

&lt;p&gt;Building a reliable voice AI requires robust testing. I simulate conversations between two LLMs and dump the evaluation telemetry into local &lt;code&gt;.json&lt;/code&gt; log files. However, default automated grading scripts are notoriously prone to false positives (e.g., grading a hallucinated response highly simply because it sounded polite). &lt;/p&gt;

&lt;p&gt;I utilized Copilot as a &lt;strong&gt;Supreme AI Evaluation Auditor&lt;/strong&gt;. I passed it raw JSON conversation objects, prompting it to critically audit the automated scores, spot misleading feedback, and generate an adjusted "Supreme Score" with a bulleted logical justification. This drastically reduced noise in my evaluation pipeline.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fayr6o4bxcb4adbc3nhsc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fayr6o4bxcb4adbc3nhsc.png" alt="Eval score finalisation" width="800" height="485"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fx66o9v9sn2of5417sogq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fx66o9v9sn2of5417sogq.png" alt="Solidifying eval score of simulation test" width="799" height="482"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Operational Code Polish &amp;amp; Balancing
&lt;/h3&gt;

&lt;p&gt;Beyond these two core pillars, Copilot served as an excellent "cleanup crew" throughout this journey even after hitting rate limits ✋. It assisted in tracking down legacy typing issues, reviewing asynchronous edge cases, and generating clean inline documentation. &lt;/p&gt;

&lt;p&gt;Ultimately, the biggest value Copilot provided wasn't just writing lines of code faster but it was accelerating complex architectural decisions and data validation when reviving a stale codebase.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I Learned
&lt;/h2&gt;

&lt;p&gt;The most valuable lesson wasn't technical.&lt;/p&gt;

&lt;p&gt;It was understanding the difference between a convincing demo and a useful product.&lt;/p&gt;

&lt;p&gt;A demo succeeds when the AI says the right thing.&lt;/p&gt;

&lt;p&gt;A business system succeeds when the right thing actually happens afterward.&lt;/p&gt;

&lt;p&gt;That distinction changed how I think about voice AI.&lt;/p&gt;

&lt;p&gt;Natural conversation matters.&lt;/p&gt;

&lt;p&gt;Latency matters.&lt;/p&gt;

&lt;p&gt;Speech quality matters.&lt;/p&gt;

&lt;p&gt;But trust matters more.&lt;/p&gt;

&lt;p&gt;Trust comes from reliability, transparency, and knowing when humans should remain part of the process.&lt;/p&gt;

&lt;p&gt;I don't believe current voice AI systems perfectly replicate human interaction, and that's not really the goal.&lt;/p&gt;

&lt;p&gt;What interests me is the space between humans and AI:&lt;/p&gt;

&lt;p&gt;How can AI handle repetitive operational work while humans remain responsible for judgment, relationships, and important decisions?&lt;/p&gt;

&lt;p&gt;Reviving Ovela helped me explore that question far more deeply than when I first started the project.&lt;/p&gt;

&lt;p&gt;And honestly, that's what made finishing it worthwhile.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>githubchallenge</category>
      <category>agents</category>
      <category>github</category>
    </item>
  </channel>
</rss>
