🚀 Turning Videos into Interactive Courses with AI

How I built VidSabio.com by combining multiple LLMs into a robust processing pipeline

I recently launched VidSabio, a platform that transforms ordinary videos into structured, interactive learning experiences — automatically. 🎥➡️📚

But this didn’t come together overnight.

🧪 Trial, Error, and... More Trial

At first, I thought a single LLM could handle everything: analyze the video, transcribe it, generate quizzes, build a course. Easy, right?

Wrong.

It turned out the real breakthrough came when I stopped looking for one model to do it all — and instead built a pipeline of specialized steps, each powered by the best tool for that specific task.

🧠 The Flow

Here’s the high-level breakdown of how it works:

User uploads a video
Video is processed through a multi-step pipeline:

Frame extraction (FFmpeg)
Audio transcription (Google Speech-to-Text)
Visual analysis (Gemini / GPT-4 Vision)
Content summarization + topic generation (Gemini Pro)
Course structure generation (GPT-4)
Interactivity injection (quizzes, accordions, etc.)
1. Final course can be edited and exported as SCORM or HTML

This is all built using Vue 3, Firebase, and various LLMs (OpenAI + Gemini) stitched together through Cloud Functions.

✨ The Magic Was in the Combo

By distributing the heavy lifting across multiple LLMs and APIs, I was able to:

Improve accuracy
Scale more efficiently
Debug and retry isolated steps
Customize outputs based on step-level feedback

🔧 Stack

Frontend: Vue 3 + Tailwind
Backend: Firebase Functions + Firestore + Storage
AI: GPT-4.1, Gemini Pro, Whisper, Google Speech-to-Text
Analytics: PostHog
Export: SCORM, HTML

🔍 Visual Overview

Here's a stylized diagram of the flow:

(Insert your HappyAlien-styled image here)

💬 Final Thoughts

This was built with plenty of detours, dead ends, and "aha" moments. If you're building with LLMs, I encourage you to explore multi-model orchestration — it might be the key to your next breakthrough.

Check it out: https://vidsabio.com
Feel free to reach out if you’re curious or building something similar!