DEV Community

Cover image for How We Built an AI Manga Studio with Google Gemini in a Week

How We Built an AI Manga Studio with Google Gemini in a Week

Olamiposi on March 13, 2026

We created this piece of content for the purposes of entering the Gemini Live Agent Challenge. #GeminiLiveAgentChallenge Enpitsu (鉛筆) — A...
Collapse
 
theycallmeswift profile image
Swift

This is awesome. Do you have examples of generated outputs? Would be nice to include in the post if you have time to grab some!

Collapse
 
system625 profile image
Olamiposi • Edited

Thanks so much! I've updated the post with a dedicated examples section, you can see the character model sheets Gemini generates, full storyboard pages with screentone and speech bubbles, and a sketch-to-manga before/after. The sketch conversion is honestly one of my favourite parts: you draw the rough composition and Gemini handles the inking, screentones, and line weight.

Collapse
 
theycallmeswift profile image
Swift

Wow! It's even better than I was expecting! Great work

Thread Thread
 
system625 profile image
Olamiposi

Thank you very much!🚀

Collapse
 
alfie_9536 profile image
Ronnie

"This is amazing work! I love how you handled character consistency with Gemini’s multimodal input — the sketch-to-manga feature is especially impressive. Really inspiring approach for anyone wanting to generate full manga with AI."

Collapse
 
system625 profile image
Olamiposi

Thanks so much! The multimodal input was honestly the key insight, passing character sheets as image references on every panel call is what made consistency actually work. Glad it's inspiring!

Collapse
 
wvanheemstra profile image
Willem van Heemstra

Hi, great stuff you are building! Your GitHub repository is not to be found anymore, unfortunately… Can you share, please?

Collapse
 
system625 profile image
Olamiposi

Hello, thank you!
Sorry about that, it was set to private initally. You can check it now

Collapse
 
mikewirth profile image
Michael Wirth

Very creative use of Gemini! Did you consider using a platform to make calling multiple AI models easier?

Collapse
 
system625 profile image
Olamiposi

Thanks! We did consider orchestration frameworks like LangChain early on, but decided against them for this project. Since we're going deep on a single provider (Gemini) rather than swapping models, the abstraction layer didn't buy us much — and the Google GenAI SDK's native async support and multimodal Part API were exactly what we needed for the character sheet consistency technique. Going direct also meant one less dependency and easier debugging when image generation behaved unexpectedly. For a multi-provider setup it'd be a different call though!

Collapse
 
finalsudoku profile image
Final

This really resonated with me.

Collapse
 
system625 profile image
Olamiposi

Really appreciate you saying that! It was a fun week of building, hope it sparks some ideas for your own projects.

Collapse
 
shane_coughlan profile image
Shane Coughlan

It was very interesting to read about some of the transformations and adjustments for consistency you used. I wanted to make one suggestion: adding a license to the GitHub code so that it is clear if and how others can use it. I didn’t see a license when I checked it out, but please do let me know if I missed it. MIT seems to be a popular default for many AI-related projects.