DEV Community

Abhi
Abhi

Posted on

I Was Tired of Manual Video Editing โ€” So I Built OmniVid Lite

****๐Ÿš€ OmniVid Lite โ€” Text-to-Video, Without the Complexity

Video creation is painful.

You open CapCut or Premiereโ€ฆ
drag timelinesโ€ฆ adjust timingโ€ฆ exportโ€ฆ fix somethingโ€ฆ export again.

It feels outdated โ€” especially when LLMs can already generate text, code, and images from a single prompt.

So I built OmniVid Lite โ€” a lightweight, fast, developer-friendly way to generate simple AI-powered videos using nothing but text instructions.


๐ŸŽฏ The Goal

Not another "all-in-one AI video editor."

Just a clean minimal workflow:

Text prompt โž API โž Video

No GPU-heavy diffusion models.

Instead, OmniVid Lite focuses on:

05โ€“30 sec explanatory videos

Subtitles + narration

Basic scene transitions

Simple assets (text, shapes, images)

Think:
"Explain Kubernetes in 20 seconds" โ†’ Auto-generated animation.


๐Ÿ› ๏ธ Tech Stack

Backend: FastAPI

Frontend: React

Video Rendering: Python + Manim (for animation logic)

Model: LLM (prompt โ†’ animation config)

Auth: API key based

I wanted a system that a normal developer can deploy, extend, and hack โ€” without depending on closed SaaS APIs.


๐Ÿงฉ Core Architecture

User Prompt
โ†“
LLM Prompt Parser (Python)
โ†“
Scene Config (JSON)
โ†“
Renderer (Manim)
โ†“
Final .mp4 Output

The LLM doesnโ€™t generate raw video โ€” that would be slow and unstable.

Instead, it generates a structured scene description, like:

{
"duration": 8,
"scenes": [
{
"type": "text",
"content": "What is Blockchain?",
"animation": "fade_in"
}
]
}

This config is deterministic and editable.

That makes OmniVid Lite programmable and predictable, unlike "random AI video generators."


๐Ÿงช Example Prompt

Create a 10-second explanatory video about how DNS works.
Use subtitles, simple animations, and show request flow visually.

Output:
A short animated video showing:

browser โ†’ DNS resolver โ†’ name server โ†’ IP response.

No manual editing. No timeline.


๐Ÿ“ฆ Current Features

Text-to-video via /api/v1/render

Simple Manim-based animations

Subtitles

React UI with prompt builder

API key auth

Async background rendering


๐Ÿงญ Whatโ€™s Next?

Voice-over generation

Image โ†’ animated explainer

Template library

Drag-and-drop scene editor

Blender-based advanced export mode

Not promising hype โ€” just building.


๐Ÿ”— Repo

https://github.com/Abhishek-mule/omnivid-lite.git


๐Ÿ’ก Final Thought

AI wonโ€™t replace video editors.

But tools like OmniVid Lite will replace repetitive, low-creativity editing โ€” so humans can focus on storytelling, not timeline scrubbing.

If you build videos, teach online, or work on developer tooling โ€” thereโ€™s a lot to explore here.

Top comments (0)