Sota Image Captioning Model Kosmos-2 Added To Our Image Captioning Scripts Arsenal
You can download them at here : https://www.patreon.com/posts/90744385
The batch image captioning models we have right now as follows:
- CogVML with quantization 4-bit, 8-bit, 16-bit
- LLaVA including 34b with quantization such as 4-bit, 8-bit, 16-bit
- Blip2 Models
- Clip Vision Models
- Kosmos-2 Model
Kosmos-2 supports both single image captioning and also batch image captioning. I also did some research to find a good prompt.
1 click to install both on Windows, RunPod & Linux.
Generates its own venv so will never conflict with no any other app you have.
Here news about them : https://www.patreon.com/posts/sota-image-model-98499462
Top comments (0)