Vyaguta AI Assistant

Vyaguta AI Assistant is an intelligent chatbot designed to help Leapfrog employees and new joiners quickly find information about Vyaguta’s modules, onboarding, policies, tools, and more. It leverages Retrieval-Augmented Generation (RAG), LangChain, and OpenAI’s LLMs to provide instant, context-aware answers from company documentation and knowledge bases.

53144_3aa9f665ec38d2b2c16cae4689619ddd5dd0d3b6d3f04f4d9795f087b25a6c6c

What is Vyaguta AI Assistant?

Vyaguta AI Assistant is your smart companion for all things Vyaguta and Leapfrog. It can:

Answer questions about Vyaguta modules (OKR, Pulse, Attendance, GAP, etc.)
Guide you through onboarding, policies, and company processes
Help you find team contacts, resources, and tools
Explain coding guidelines and best practices
Provide instant, reliable answers from internal docs and FAQs

How does it work? (Workflow Overview)

Vyaguta AI Assistant follows a Retrieval-Augmented Generation (RAG) workflow, combining company knowledge with advanced language models to deliver accurate, context-aware answers. Here’s how the system works:

1. Data Sources

Local Documents: Markdown files in the docs/ directory (policies, onboarding, guidelines, etc.)
Vyaguta API: Live employee and people data fetched from Vyaguta’s internal API
(Optional) Confluence: Company wiki pages (integration available, see guides)

2. Document Processing & Embeddings

Documents are loaded and split into chunks using markdown header-based splitting for fine-grained retrieval
Each chunk is embedded using OpenAI Embeddings (Ada-002)
All embeddings are stored in a FAISS vector database/ chromaDB for fast similarity search

3. Retrieval-Augmented Generation (RAG)

When a user asks a question, the system retrieves the most relevant document chunks using semantic search
A hybrid retriever with contextual compression ensures only the most relevant information is passed to the LLM

4. Large Language Model (LLM)

The retrieved context is sent to an OpenAI LLM (e.g., GPT-4.1-nano)
A custom prompt template ensures answers are tailored to Vyaguta and Leapfrog

5. Answer Delivery

The LLM generates a helpful, context-aware answer
The answer is displayed in a modern chat UI (Streamlit), with features like quick questions, reactions, and chat export

Visual Workflow

flowchart TD
    A[User Query] --> B[LangChain Orchestration]
    B --> C{RAG: Retrieve Relevant Docs}

    subgraph "Data Sources"
        D[Confluence Docs]
        E[Vyaguta APIs]
    end

    subgraph "Vector Database"
        H[Processed Document Chunks]
        I[Vector Embeddings]
        J[Metadata & Source Info]
    end

    D --> H
    E --> H
    H --> I

    C --> I
    I --> K[Context Retrieval]
    K --> L[LLM OpenAI/GPT]
    L --> M[Chatbot Response]
    B --> L

flowchart TD
    A[User Query] --> B[LangChain Orchestration]
    B --> C{RAG: Retrieve Relevant Docs}

    subgraph "Data Sources"
        D[Confluence Docs]
        E[Vyaguta APIs]
        F[Local Markdown Docs]
    end

    subgraph "Vector Databases"
        G[Processed Document Chunks]
        H[Vector Embeddings]
        I[FAISS Vector Store]
        J[ChromaDB Vector Store]
        K[Metadata & Source Info]
    end

    D --> G
    E --> G
    F --> G
    G --> H
    H --> I
    H --> J
    G --> K

    C --> I
    C --> J
    I --> L[Context Retrieval]
    J --> L
    L --> M[LLM OpenAI/GPT]
    M --> N[Chatbot Response]
    B --> M

For a detailed technical breakdown and architecture, see /guides/workflow-explanation.md.

Tech Stack

LLMs: OpenAI’s GPT models for natural language understanding and generation
RAG: Retrieval-Augmented Generation for context-aware answers
LangChain: For managing the workflow and integrating components
OpenAI: For semantic search and document retrieval
Data Storage: FAISS vector database/ chromaDB for fast similarity search
Frontend: Streamlit for modern, interactive UI
Backend: LangChain for orchestration, OpenAI for LLMs

For setup, usage, and advanced guides, see the /guides/ folder in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
assets		assets
guides		guides
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
auth.py		auth.py
chatbot_gui.py		chatbot_gui.py
config.py		config.py
confluence_fetch.py		confluence_fetch.py
docs_consolidated.pkl		docs_consolidated.pkl
fetch_and_store_people_data.py		fetch_and_store_people_data.py
inspect_chromadb.py		inspect_chromadb.py
log_utils.py		log_utils.py
main.py		main.py
people.py		people.py
pyproject.toml		pyproject.toml
rag_pipeline.py		rag_pipeline.py
rebuild_rag_pipeline.py		rebuild_rag_pipeline.py
requirements.txt		requirements.txt
streamlit.css		streamlit.css
watch_main.py		watch_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vyaguta AI Assistant

What is Vyaguta AI Assistant?

How does it work? (Workflow Overview)

1. Data Sources

2. Document Processing & Embeddings

3. Retrieval-Augmented Generation (RAG)

4. Large Language Model (LLM)

5. Answer Delivery

Visual Workflow

Tech Stack

About

Uh oh!

Releases

Packages

Languages

purnasth/genai-chatbot

Folders and files

Latest commit

History

Repository files navigation

Vyaguta AI Assistant

What is Vyaguta AI Assistant?

How does it work? (Workflow Overview)

1. Data Sources

2. Document Processing & Embeddings

3. Retrieval-Augmented Generation (RAG)

4. Large Language Model (LLM)

5. Answer Delivery

Visual Workflow

Tech Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages