<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Vikas G.</title>
    <description>The latest articles on DEV Community by Vikas G. (@vikas_g_a248a99bee33e4d3).</description>
    <link>https://dev.to/vikas_g_a248a99bee33e4d3</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3871581%2F3d65a451-b64a-4c62-b0de-e1b3d3baf6df.png</url>
      <title>DEV Community: Vikas G.</title>
      <link>https://dev.to/vikas_g_a248a99bee33e4d3</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/vikas_g_a248a99bee33e4d3"/>
    <language>en</language>
    <item>
      <title>I Built a Free Call Quality Audit Tool Using OpenAI Whisper</title>
      <dc:creator>Vikas G.</dc:creator>
      <pubDate>Fri, 10 Apr 2026 10:57:15 +0000</pubDate>
      <link>https://dev.to/vikas_g_a248a99bee33e4d3/i-built-a-free-call-quality-audit-tool-using-openai-whisper-49a7</link>
      <guid>https://dev.to/vikas_g_a248a99bee33e4d3/i-built-a-free-call-quality-audit-tool-using-openai-whisper-49a7</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; I built &lt;a href="https://www.boloaurlikho.com" rel="noopener noreferrer"&gt;boloaurlikho.com&lt;/a&gt; — a free tool that transcribes calls and runs quality audits on them. Supports 99+ languages, no signup required. Here's the technical journey and what I learned about building AI-powered QA tools.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Problem I Was Solving
&lt;/h2&gt;

&lt;p&gt;I run a telecalling operation in India. Our QA process was painful — team leads manually listening to call recordings, filling Excel scorecards, spending 3x the call duration just to audit one conversation. At 500+ calls/day, we were only auditing about 5% of total volume.&lt;/p&gt;

&lt;p&gt;I wanted something that could:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Transcribe every call automatically (Hindi, English, Hinglish — we use all three)&lt;/li&gt;
&lt;li&gt;Flag compliance issues without human review&lt;/li&gt;
&lt;li&gt;Score calls on parameters like greeting, pitch delivery, objection handling, and closure&lt;/li&gt;
&lt;li&gt;Work without expensive enterprise contracts&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The Tech Stack
&lt;/h2&gt;

&lt;p&gt;The core is OpenAI's Whisper model for speech-to-text. Whisper's multilingual capability was the deciding factor — most transcription APIs choke on code-switched Indian English. Whisper handles "aap ka account number bata dijiye, I'll check the status" without breaking.&lt;/p&gt;

&lt;p&gt;On top of transcription, I built audit layers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Keyword compliance&lt;/strong&gt;: Checks if mandatory disclosures, greetings, and CTAs were spoken&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Sentiment analysis&lt;/strong&gt;: Tracks tone shifts through the conversation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Dead air detection&lt;/strong&gt;: Flags excessive silence (usually means agent was scrambling)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Timestamp markers&lt;/strong&gt;: So reviewers can jump to problem spots instead of listening to full calls&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What Surprised Me
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Whisper's accuracy on Indian accents is genuinely impressive.&lt;/strong&gt; We tested against Google Speech-to-Text and AWS Transcribe — Whisper won on Hindi and mixed-language content by a significant margin.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. The audit layer is more valuable than the transcription.&lt;/strong&gt; Everyone builds transcription tools. The real unlock is what you do with the text after. Automated scoring against custom QA parameters saves 80%+ of manual review time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. People use it for things I never expected.&lt;/strong&gt; Sales teams auditing their own cold calls. Podcast creators checking interview quality. Students verifying lecture transcription accuracy. A lawyer transcribing witness depositions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try It
&lt;/h2&gt;

&lt;p&gt;The tool is free at &lt;a href="https://www.boloaurlikho.com" rel="noopener noreferrer"&gt;boloaurlikho.com&lt;/a&gt;. No signup, no paywall. Supports MP3, WAV, M4A, OGG, WEBM, FLAC. Currently handles files up to 20 minutes, with longer audio support coming soon.&lt;/p&gt;

&lt;p&gt;If you're building something similar or have questions about the Whisper integration, happy to chat in the comments.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>automation</category>
      <category>openai</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
