Retrieval Augmented Chatbot, part 2! LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless 😀

Julien Simon
Nov 2, 2023

We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them. Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain. Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.

Part 1: https://youtu.be/7kDaMz3Xnkw

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Julien Simon
Julien Simon

No responses yet

What are your thoughts?