DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Retrieval Augmented Chatbot, part 2!

Retrieval Augmented Chatbot, part 2! LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless 😀

We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them. Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain. Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.

Part 1: https://youtu.be/7kDaMz3Xnkw

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

Top comments (0)

Billboard image

The Next Generation Developer Platform

Coherence is the first Platform-as-a-Service you can control. Unlike "black-box" platforms that are opinionated about the infra you can deploy, Coherence is powered by CNC, the open-source IaC framework, which offers limitless customization.

Learn more

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay