Local RAG System with Weaviate, Ollama, and LlamaParse

This project demonstrates a complete local Retrieval-Augmented Generation (RAG) system. It leverages:

Weaviate (v4) as the vector database.
Ollama for local embedding (nomic-embed-text) and generative model (llama3.2).
LlamaParse for intelligent document parsing.

This proof of concept is designed to process documents, chunk them while preserving page and structural information, store them in Weaviate, and then answer questions using the retrieved context.

Prerequisites

Before you begin, ensure you have the following installed and running:

1. Docker

Docker or OrbStack is required to run the Weaviate vector database.

OrbStack is faster and more efficient.
Ensure it is running before starting the Weaviate container.

2. Ollama

Ollama is used to run large language models locally for both embedding (vectorization) and generation.

Download and install Ollama for your machine.
Once installed, ensure Ollama is running.

3. LlamaParse API Key

LlamaParse is used for intelligent parsing of PDF documents, extracting text, tables, and other structured data while maintaining page integrity.

Obtain an API key from the Llama Cloud website.
Set your LlamaParse API key as an environment variable named LLAMAPARSE_API_KEY:
```
export LLAMAPARSE_API_KEY="your_llamaparse_api_key_here"
```

Setup

Clone the repository:

git clone <repository_url>
cd <repository_name>

Create a Python virtual environment (recommended):

python3 -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate

Install Python dependencies:
```
pip install -r requirements.txt
```
Start Weaviate using Docker Compose:
```
docker compose up -d
```
Wait a moment for the Weaviate container to fully start.

Running the Application

Ensure Ollama is running: The main.py script will attempt to pull the required models (nomic-embed-text and llama3.2) if they are not already downloaded.
Run the main script:
```
python main.py
```
The script will:
- Connect to Ollama and Weaviate.
- Create the DocumentChunk collection in Weaviate (or use an existing one).
- If no documents are found in Weaviate, it will process the example PDF (reinsurance-agreement.pdf) using LlamaParse, chunk it, and import it into Weaviate.
- Start an interactive Q&A session.

Usage

Once the application is running, you can interact with it via the command line:

Regular Question:

💬 Ask a question (or 'quit' to exit): What are the coverage limits?

To exit the interactive session, type quit, exit, or q.

Cleaning Up

To stop and remove the Weaviate container:

docker compose down

To remove the Ollama models (if desired):

ollama rm nomic-embed-text
ollama rm llama3.2

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local RAG System with Weaviate, Ollama, and LlamaParse

Prerequisites

1. Docker

2. Ollama

3. LlamaParse API Key

Setup

Running the Application

Usage

Cleaning Up

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ajayp/contextual-weaviate-rag

Folders and files

Latest commit

History

Repository files navigation

Local RAG System with Weaviate, Ollama, and LlamaParse

Prerequisites

1. Docker

2. Ollama

3. LlamaParse API Key

Setup

Running the Application

Usage

Cleaning Up

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages