v. Splicer

Posted on Mar 21

I Built a Free AI Pipeline for YouTube Shorts Using FFmpeg

#showdev #sideprojects #ai #ffmpeg

I set a constraint before I had a plan.

No subscriptions. No credits ticking down in the background. No platforms deciding how many videos I was allowed to make this week.

If I was going to produce YouTube Shorts at any real volume, it had to run locally, it had to be repeatable, and it had to cost nothing beyond the machine I was already using. That requirement stripped the landscape down fast.

Most “AI video tools” disappeared the moment you looked closely. What was left wasn’t polished. It wasn’t friendly. But it was honest.

_Raw utilities. _

Things that do exactly what you tell them and nothing more.

That’s how I ended up back at FFmpeg- not as a last resort, but as the only thing in the room that wasn’t trying to meter my output... and that could surprisingly be used by any AI model I threw it at.

The Constraint That Actually Mattered

I wasn’t trying to build a better editor.

I was trying to solve something much narrower and more stubborn.

I wanted a way to generate YouTube Shorts that was:

consistent
fast
scriptable
and completely free

That last part mattered more than expected.

Because once you start looking at “AI video tools,” you hit a wall almost immediately. Subscriptions. Credit systems. Watermarks. Quiet limits that only show up after you’ve already built a workflow around them.

You don’t own the pipeline. You’re renting it.

That didn’t sit right.

So the constraint became clear. No paid APIs. No locked platforms. No hidden ceilings. Just something local, controllable, and repeatable.

That’s what led me to FFmpeg.

The Moment Claude Changed the Equation

FFmpeg wasn’t new to me. It’s one of those tools you circle around for years. You know it’s powerful. You know it’s everywhere. But you only touch it when you have to, because the syntax feels like it was designed to resist you.

Then I realized something simple.

Claude could write FFmpeg commands.

Not vaguely. Not “kind of correct.” It could generate full, working pipelines if you described the outcome clearly enough.

That shifted the entire problem.

Because now the hardest part of FFmpeg, the part that keeps most people out, the precision of the syntax, was no longer a blocker. It was something you could offload.

At that point, the idea stopped being optional.

If you can describe a YouTube Short in plain language, and something can translate that into a working FFmpeg command, then the entire editing process becomes… unnecessary.

Not simplified. Removed.

So I built around that.

Check it out here--> https://github.com/numbpill3d/ffmpeg-ai

What ffmpeg-ai Actually Does

At a surface level, it looks straightforward.

You describe the Short you want. The system generates an FFmpeg command. Then it runs it.

But that description misses what’s actually happening.

What it really does is collapse the distance between idea and output.

No timeline. No editor. No manual resizing to 9:16. No guessing export settings. No opening five different tools just to glue together something that should have been trivial.

You move from:

“Let me edit this into a Short”

“Give me a Short that looks like this”

And that difference is not cosmetic. It changes how you approach the entire workflow.

Shortform Videos Are a Perfect Target for This

Long-form video is messy. It benefits from visual control. You need to see pacing, cuts, emotional beats.

Shorts are different.

They’re structured. Predictable. Almost formulaic in a way people don’t like to admit.

A typical Short/TikTok/Reel involves:

A source clip or segment
A vertical crop
Some form of captioning or overlay
Compression tuned for fast upload

It’s a pipeline. The same pipeline, over and over again, with minor variations.

That makes it ideal for automation.

The problem is that most people still run that pipeline manually through tools that were not designed for repetition at scale.

ffmpeg-ai leans into that structure instead of hiding it.

The Quiet Advantage of Free

There’s a psychological shift that happens when your entire pipeline is free.

Not “free until you hit a limit.”

Not “free with watermarks.”

Actually free.

You stop optimizing for cost. You stop rationing usage. You stop thinking in terms of credits.

You can generate ten versions of the same Short without hesitation. You can experiment aggressively. You can build systems that would be financially irrational on paid platforms.

That freedom matters more than people expect.

Because it changes behavior.

And behavior is what determines whether a system actually gets used.

This Isn’t About Replacing Editors

It’s worth being clear about this.

If you enjoy editing, if you rely on visual feedback, if your work depends on fine-grained control over timing and composition, this is not a replacement.

This is something else.

It’s for the moments where editing becomes mechanical. Where the creative decision has already been made, and what remains is execution.

That execution is where most time gets lost.

ffmpeg-ai exists to remove that layer.

Where It Starts to Compound

The first time you use something like this, it feels like a shortcut.

You generate a Short faster. You skip a few steps. It’s convenient.

The second time, you start to notice patterns.

You realize you’re asking for the same transformations repeatedly. The same crops, the same styles, the same output formats.

That’s when it shifts from a tool into a system.

Because now you can start standardizing those transformations. Reusing them. Chaining them. Embedding them into scripts that process multiple clips without intervention.

At that point, you’re not “making Shorts” anymore.

You’re running a pipeline.

The Part Most People Miss

The interesting part isn’t that AI can generate FFmpeg commands.

It’s that once you trust that translation, you stop thinking in terms of tools entirely.

You start thinking in terms of outcomes.

That sounds abstract, but it has practical consequences.

You no longer ask, “which software should I use for this?”

You ask, “what needs to happen to this media?”

And once you can answer that clearly, the system can handle the rest.

That separation is subtle, but it’s where most of the leverage comes from.

Limitations That Don’t Disappear

There’s a temptation to treat this like a clean solution. It isn’t.

If your description is sloppy, the output will be off. If you don’t understand what makes a Short effective, no amount of automation will fix that. You still need judgment.

And FFmpeg remains strict. It will fail loudly if something doesn’t make sense. It won’t hold your hand.

What this removes is not difficulty, but friction.

You still need to know what you want.

Why I Had to Build It

This wasn’t a “nice to have.”

It was one of those things that becomes obvious once you see it.

The pieces were already there. FFmpeg. Local execution. AI that can translate intent into commands.

The only thing missing was the glue.

And once you see that gap clearly enough, it’s hard to ignore. You either keep working around it, or you close it.

I closed it.

Try It Yourself

The project is here:

---> https://github.com/numbpill3d/ffmpeg-ai

Run it locally. Break it. Push it in directions I didn’t.

Don’t treat it like a finished product. It’s more like a pressure point. Something that exposes how much of your workflow is still manual out of habit.

Final Thought

Most people assume the future of content creation is better tools.

Cleaner interfaces. Faster editors. Smarter features.

I think it’s thinner than that.

The tools don’t get better. They get quieter.

They move out of the way until all that’s left is a description and a result.

And somewhere in that shift, editing stops being a task you perform and becomes something that just… happens.

Once you get used to that, going back feels unnecessarily slow.

DEV Community