DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Video: Llama 3 on Amazon SageMaker

In this video, I walk you through the simple process of deploying a Llama 3 8B model with Amazon SageMaker.

I use the latest version of the Text Generation Inference containers (TGI 2.0), and show you how to run synchronous inference and streaming inference.

Top comments (0)

The Most Contextual AI Development Assistant

Pieces.app image

Our centralized storage agent works on-device, unifying various developer tools to proactively capture and enrich useful materials, streamline collaboration, and solve complex problems through a contextual understanding of your unique workflow.

👥 Ideal for solo developers, teams, and cross-company projects

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay