DEV Community

Porus
Porus

Posted on

I Was Tired of Manual Video Editing — So I Built OmniVid Lite

****🚀 OmniVid Lite — Text-to-Video, Without the Complexity

Video creation is painful.

You open CapCut or Premiere…
drag timelines… adjust timing… export… fix something… export again.

It feels outdated — especially when LLMs can already generate text, code, and images from a single prompt.

So I built OmniVid Lite — a lightweight, fast, developer-friendly way to generate simple AI-powered videos using nothing but text instructions.


🎯 The Goal

Not another "all-in-one AI video editor."

Just a clean minimal workflow:

Text prompt ➝ API ➝ Video

No GPU-heavy diffusion models.

Instead, OmniVid Lite focuses on:

05–30 sec explanatory videos

Subtitles + narration

Basic scene transitions

Simple assets (text, shapes, images)

Think:
"Explain Kubernetes in 20 seconds" → Auto-generated animation.


🛠️ Tech Stack

Backend: FastAPI

Frontend: React

Video Rendering: Python + Manim (for animation logic)

Model: LLM (prompt → animation config)

Auth: API key based

I wanted a system that a normal developer can deploy, extend, and hack — without depending on closed SaaS APIs.


🧩 Core Architecture

User Prompt

LLM Prompt Parser (Python)

Scene Config (JSON)

Renderer (Manim)

Final .mp4 Output

The LLM doesn’t generate raw video — that would be slow and unstable.

Instead, it generates a structured scene description, like:

{
"duration": 8,
"scenes": [
{
"type": "text",
"content": "What is Blockchain?",
"animation": "fade_in"
}
]
}

This config is deterministic and editable.

That makes OmniVid Lite programmable and predictable, unlike "random AI video generators."


🧪 Example Prompt

Create a 10-second explanatory video about how DNS works.
Use subtitles, simple animations, and show request flow visually.

Output:
A short animated video showing:

browser → DNS resolver → name server → IP response.

No manual editing. No timeline.


📦 Current Features

Text-to-video via /api/v1/render

Simple Manim-based animations

Subtitles

React UI with prompt builder

API key auth

Async background rendering


🧭 What’s Next?

Voice-over generation

Image → animated explainer

Template library

Drag-and-drop scene editor

Blender-based advanced export mode

Not promising hype — just building.


🔗 Repo

https://github.com/Abhishek-mule/omnivid-lite.git


💡 Final Thought

AI won’t replace video editors.

But tools like OmniVid Lite will replace repetitive, low-creativity editing — so humans can focus on storytelling, not timeline scrubbing.

If you build videos, teach online, or work on developer tooling — there’s a lot to explore here.

Top comments (0)