AutoVio is an open-source text-to-video pipeline that connects multiple AI providers to automate video production end-to-end.
How it works:
Write a text prompt or upload a reference video
AI analyzes style, tone, and structure (vision models)
Generates a scene-by-scene scenario with image and video prompts
Creates visuals using Gemini, DALL-E, or other image models
Converts images to video clips with Veo, Runway, or similar
Edit in a timeline editor: add text overlays, transitions, templates
Export the final MP4
Built for integration: Full REST API with OpenAPI docs, plus an MCP server that works with Claude Desktop, Cursor, Claude Code, and automation tools like n8n.
Tech stack: TypeScript monorepo (Express backend, React frontend), MongoDB, FFmpeg for rendering.
Perfect for product teams automating demo videos, developers building video features, or anyone who wants AI-assisted video creation without the complexity.
Auto-Vio
/
autovio
Open-source AI video pipeline. Text prompt → scenario → images → video clips → editor → MP4. Self-hosted, multi-provider, MCP-ready.
English | 简体中文 | 繁體中文 | 한국어 | Deutsch | Español | Français | Italiano | Dansk | 日本語 | Polski | Русский | Bosanski | العربية | Norsk | Português (Brasil) | ไทย | Türkçe | Українська | বাংলা | Ελληνικά | Tiếng Việt | हिन्दी
AutoVio
Open-source AI video generation pipeline for SaaS teams and developers.
From a text prompt to a finished video — scenario, images, clips, editing, export
📖 Docs · 🚀 Quick Start · 📡 API · 🤖 MCP Server
What is AutoVio?
Most AI tools handle one step of video creation. AutoVio handles the whole thing.
You describe what you want — a product, an idea, a story. AutoVio writes the scene-by-scene scenario, generates an image for each scene, animates those images into video clips, and assembles everything in a timeline editor. You export a finished MP4. Especially useful for SaaS product demos, feature…

Top comments (0)