DEV Community

Cover image for From Audio to Diagram
bsorrentino
bsorrentino

Posted on

From Audio to Diagram

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

An Application that allow from an Audio concerning a discussion, a meeting, etc ... to generate a "meaningful mind-map diagram", that represent the touched key points. This representation joined with summary provide a more complete and understandable informations

Demo

The application in available here for access to full functionality you need both an AssemblyAI Api Key and a OpenAI Api Key. Below there are some representative screenshots

Settings

Settings

Upload Audio

Upload Audio

Transcribe Audio

Transcribe Audio

Generate Mindmap Diagram

Generate Mindmap Diagram

Journey

To implement process from audio to diagram I have developed several "skilled agents" described below:

  1. transcribe-from-audio: this agent use AssemblyAI transcripts API to transcribe the provided audio.

  2. keypoints-from-transcript: this Agent use OpenAI (got-4o-mini) to extract the Keypoints inside the given transcription

  3. summary-to-mindmap: this agent use OpenAI (got-4o-mini) to arrange the key points in a kind of ontology providing a hierarchical representation of information

  4. mindmap-to-mermaid: last agent transform the mind-map representation in a mermaid syntax ready for the visualization

Diagram of Agentic Architecture

Image description

Top comments (2)

Collapse
 
bill_ec71da0eaea845fff0d0 profile image
Bill

I can think of several long-winded, recorded meetings in which key ideas presented in a schema like this would be highly beneficial. Nice job

Collapse
 
bsorrentino profile image
bsorrentino

Hi @bill_ec71da0eaea845fff0d0 thank you, however this is just one of the possible use cases of this process, the possibilities are truly limitless