DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Deploying Arcee SuperNova on AWS

In this video, you will learn about Arcee-SuperNova, a new 70B model built by Arcee.ai. It excels in instruction following and human preference scores, outperforming Llama 70B-Instruct, as well as Llama 405B, Claude Sonnet 3.5, and GPT-4o in many general benchmarks.

I’ll show you how to subscribe to SuperNova on the AWS Marketplace. Then, I’ll show how to deploy the model to a SageMaker endpoint running in your AWS account, and how to run inference using the Open Messages API.

⭐️⭐️⭐️ Don’t forget to subscribe to be notified of future videos. Follow me on Medium at https://julsimon.medium.com or Substack at https://julsimon.substack.com. ⭐️⭐️⭐️

Image of Timescale

🚀 pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple

We built pgai Vectorizer to simplify embedding management for AI applications—without needing a separate database or complex infrastructure. Since launch, developers have created over 3,000 vectorizers on Timescale Cloud, with many more self-hosted.

Read more

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more