Mistral & vLLM

This guide demonstrates how to deploy a Retrieval-Augmented Generation (RAG) chatbot using Sentence-BERT (SBERT) and scikit-learn on to Defang.

Prerequisites

Set the Hugging Face Token

First, find a way to parse your information into sentences, for the correct format, please refer to rag_system.py for guidance.
Launch with Defang Compose

Run the following command to start the services:
```
defang compose up
```
The provided compose.yaml file includes the Mistral service. It's configured to run on an AWS instance with GPU support. The file also includes a UI service built with Next.js, utilizing Vercel's AI SDK.

Changing the content: The content for the bot is set in rag_system.py. You can edit the content there to change the behavior and information processed. Currently, the content is based off of the docs.md markdown file, which gives information about Defang.

Title: Scikit & SBERT & RAG

Short Description: A RAG chatbot using Scikit and SBERT trained on the defang documentation

Tags: RAG, Chatbot, SBERT, scikit-learn, Flask, AI, Docker, Python

Languages: python

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
compose.yaml		compose.yaml
docs.md		docs.md
rag_system.py		rag_system.py
requirements.txt		requirements.txt