DEV Community

Cover image for captio
Adir Shohat
Adir Shohat

Posted on

captio

I built an AI tool that turns images into structured text (Captio)

I’ve been working on a small side project called Captio.

The idea is simple:
you upload an image, and it turns it into clean, structured text.

It works with:

  • product photos
  • screenshots / UI
  • documents
  • posters
  • portraits

And generates:

  • title
  • key points
  • description
  • a clean summary

Why I built this

I kept running into the same problem:

Writing text from visuals is annoying.

Whether it's:

  • describing a product
  • explaining a screenshot
  • summarizing a document

it always takes more time than it should.

So I built a tool to automate that.


How it works

  1. Upload an image
  2. Click generate
  3. Get structured output in seconds

That’s it.


What surprised me

I expected it to work mostly on product images.

But it actually handles:

  • UI screenshots
  • random designs
  • mixed content

much better than I thought.


Still early

It’s definitely not perfect.

Some outputs are great, others still need improvement.

That’s why I’m sharing it — I want to see:

  • where it breaks
  • what people expect from it

Try it

👉 http://captio-d62hqol4e-adir-shohats-projects.vercel.app/


Would love feedback

If you try it, I’d really appreciate:

  • what worked well
  • what felt off
  • what you’d want it to do better

Thanks 🙏

Top comments (0)