DEV Community

Cover image for Sota Image Captioning Model Kosmos-2 Added To Our Image Captioning Scripts Arsenal
Furkan Gözükara
Furkan Gözükara

Posted on

Sota Image Captioning Model Kosmos-2 Added To Our Image Captioning Scripts Arsenal

Sota Image Captioning Model Kosmos-2 Added To Our Image Captioning Scripts Arsenal

You can download them at here : https://www.patreon.com/posts/90744385

The batch image captioning models we have right now as follows:

  • CogVML with quantization 4-bit, 8-bit, 16-bit
  • LLaVA including 34b with quantization such as 4-bit, 8-bit, 16-bit
  • Blip2 Models
  • Clip Vision Models
  • Kosmos-2 Model

Kosmos-2 supports both single image captioning and also batch image captioning. I also did some research to find a good prompt.

1 click to install both on Windows, RunPod & Linux.

Generates its own venv so will never conflict with no any other app you have.

Here news about them : https://www.patreon.com/posts/sota-image-model-98499462

Top comments (0)