DEV Community

Shi Ling
Shi Ling

Posted on

3

Featherless - running any llama model serverless

There's a lot of interesting AI models out there on hugging face, and sometimes I want to try them them. But I procrastinate, because honestly, it is a bit of a hassle to download and deploy the AI models.

Well, looks like some folks thought the same, and made a service that lets you run any model from huggingface serverless: Featherless.

There's like 492 LLAMA models right now, but looks like they plan to add more open source models every week.

I think it's really convenient that they let you chat with the model and preview how the AI would respond. It's hard to pick which ones you want when there's so many to pick, and I don't really want to download and deploy every single one of them to evaluate. Huggingface really should have had this feature.

Right, now to test some of these interesting roleplaying models, and form my DnD party. :D

Chatting with roleplay-llama-3

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read more

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more