RAGgae

RAGgae is my first RAG system implementation.

The architecture diagram for the system is given as follows:

Here's the system overview:

Ingestion layer:
- Texts from three PDFs are read and RAW_KNOWLEDGE_BASE is created
- The texts are then chunked using Recursive Character Text Splitter
- Chunks produced are stored as FAISS index (vector database)
Retrieval layer:
- When the user queries, it's first tokenized and embedded using the same embedder model used in creating vector database
- Similarity search is performed in order to find the most relevant k-chunks
- On those retrieved chunks, reranking is performed in order to further improve the quality of retrieval and relevant chunk usage
Generation layer:
- Reranked chunks are then passed on to the LLM along with the original question, as a final prompt in order to generate the final answer.

Tools used in the system are:

PDF ingestion & text conversion: PyPDFLoader
EMBEDDER model: thenlper/gte-small
Chunking: Recursive Character Text splitter from huggingface.
Reranker: cross-encoder/ms-marco-MiniLM-L-6-v2
Reader model (LLM): HuggingFaceH4/zephyr-7b-beta

Evaluation

In order to see how the system performs in general, evaluation & benchmarking is also done. Two specific techniques are used, namely LLM-as-a-judge and RAGAS evaluation framework.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
best_prompts		best_prompts
chunking		chunking
docs		docs
experiment_notebooks		experiment_notebooks
files		files
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
chat_models.txt		chat_models.txt
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGgae

Here's the system overview:

Tools used in the system are:

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAGgae

Here's the system overview:

Tools used in the system are:

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages