DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Two new models: Arcee-Spark and Arcee-Agent

Two new Arcee.ai model videos for you today ๐Ÿ˜€ As usual, notebooks are available. Have fun testing these models, and donโ€™t forget to follow Arcee.ai to stay on top of the latest Small Language Model action!

โšก Arcee-Spark

Arcee-Spark is an 8-billion parameter model created by Arcee.ai from Llama-3.1 8B. Spark outperforms its base model on many language and math benchmarks. In the video, I first run a quantized version locally. Then, I deploy the model on Amazon Web Services (AWS) with Amazon SageMaker. I run both synchronous and streaming inference. I also show you how to use the OpenAI Messages API, allowing you to invoke the model with the OpenAI prompting format and minimizing the migration cost. Finally, I compare ChatGPT and Arcee-Spark, and explain the benefits of open-source SLMs like Arcee-Spark compared to closed models.

โšก Arcee-Agent

Arcee-Agent is a 7-billion parameter model created by Arcee.ai from Qwen2โ€“7B. Arcee Agent is currently one of the top models for function calls and tool usage, outperforming several versions of GPT-3.5 and GPT-4o In the video, I build a financial agent able to invoke the Yahoo Finance API to answer questions on listed companies, such as โ€œwhatโ€™s the stock price?โ€, โ€œwhoโ€™s the CEOโ€, โ€œwhat is this company doing?โ€ and more.

Top comments (0)