DEV Community

Sonu Kushwaha
Sonu Kushwaha

Posted on

1

Freelancer Transcription Tool - Transform Audio to Text with AssemblyAI ๐Ÿš€

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

The Freelancer Transcription Tool is a cutting-edge, user-friendly application designed to enhance productivity by converting audio into accurate, timestamped text. Built during the AssemblyAI Challenge, this tool demonstrates innovation in audio-to-text technology and provides features that cater to freelancers, researchers, and content creators.

Demo

Live Demo ๐ŸŽ‰

GitHub Link

GiTHub

Journey

Participating in the AssemblyAI Challenge, I aimed to create a tool that not only transcribes audio but also provides real-time sentiment analysis, speaker labeling, and export options. By leveraging AssemblyAIโ€™s Universal-2 Speech-to-Text model, I was able to incorporate powerful transcription and analysis features seamlessly.

Key Features:

  1. File Upload & Playback:
    Drag-and-drop or file picker for easy audio uploads.
    Smooth progress bar indicating upload status.
    Local audio playback with intuitive controls.

  2. Transcription with Timestamps:
    Generate transcription with optional timestamps.
    Real-time manual timestamp addition and clickable timestamps.

  3. Search Functionality:
    Search specific words in the transcription, with results highlighted dynamically.

  4. Export Options:
    Export transcription as .TXT or .PDF, with or without timestamps.

  5. Sentiment Analysis:
    Analyze sentiment (Positive, Neutral, Negative) at the sentence level.

  6. Speaker Labeling:
    Differentiate speakers with AssemblyAIโ€™s speaker labeling feature.

  7. Enhanced UI/UX:
    Fully responsive design with dark mode toggle.
    Real-time playback highlighting synced with transcription.

  8. Editable Transcription:
    Edit transcriptions directly in the UI, with the option to remove timestamps.

Team :

Itโ€™s just me. A one-person army.

Project Image :

home page

Paste you Audio

 Convert to Text

 Here is Result

API Trace View

Struggling with slow API calls? ๐Ÿ•’

Dan Mindru walks through how he used Sentry's new Trace View feature to shave off 22.3 seconds from an API call.

Get a practical walkthrough of how to identify bottlenecks, split tasks into multiple parallel tasks, identify slow AI model calls, and more.

Read more โ†’

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

๐Ÿ‘‹ Kindness is contagious

Please leave a โค๏ธ or a friendly comment on this post if you found it helpful!

Okay