Skip to content

An AI-powered onboarding assistant chatbot for Vyaguta, Leapfrog Technology’s Enterprise Resource Planning (ERP) system

Notifications You must be signed in to change notification settings

purnasth/genai-chatbot

Repository files navigation

Vyaguta AI Assistant

Vyaguta AI Assistant is an intelligent chatbot designed to help Leapfrog employees and new joiners quickly find information about Vyaguta’s modules, onboarding, policies, tools, and more. It leverages Retrieval-Augmented Generation (RAG), LangChain, and OpenAI’s LLMs to provide instant, context-aware answers from company documentation and knowledge bases.

53144_3aa9f665ec38d2b2c16cae4689619ddd5dd0d3b6d3f04f4d9795f087b25a6c6c

What is Vyaguta AI Assistant?

Vyaguta AI Assistant is your smart companion for all things Vyaguta and Leapfrog. It can:

  • Answer questions about Vyaguta modules (OKR, Pulse, Attendance, GAP, etc.)
  • Guide you through onboarding, policies, and company processes
  • Help you find team contacts, resources, and tools
  • Explain coding guidelines and best practices
  • Provide instant, reliable answers from internal docs and FAQs

How does it work? (Workflow Overview)

Vyaguta AI Assistant follows a Retrieval-Augmented Generation (RAG) workflow, combining company knowledge with advanced language models to deliver accurate, context-aware answers. Here’s how the system works:

1. Data Sources

  • Local Documents: Markdown files in the docs/ directory (policies, onboarding, guidelines, etc.)
  • Vyaguta API: Live employee and people data fetched from Vyaguta’s internal API
  • (Optional) Confluence: Company wiki pages (integration available, see guides)

2. Document Processing & Embeddings

  • Documents are loaded and split into chunks using markdown header-based splitting for fine-grained retrieval
  • Each chunk is embedded using OpenAI Embeddings (Ada-002)
  • All embeddings are stored in a FAISS vector database/ chromaDB for fast similarity search

3. Retrieval-Augmented Generation (RAG)

  • When a user asks a question, the system retrieves the most relevant document chunks using semantic search
  • A hybrid retriever with contextual compression ensures only the most relevant information is passed to the LLM

4. Large Language Model (LLM)

  • The retrieved context is sent to an OpenAI LLM (e.g., GPT-4.1-nano)
  • A custom prompt template ensures answers are tailored to Vyaguta and Leapfrog

5. Answer Delivery

  • The LLM generates a helpful, context-aware answer
  • The answer is displayed in a modern chat UI (Streamlit), with features like quick questions, reactions, and chat export

Visual Workflow

flowchart TD
    A[User Query] --> B[LangChain Orchestration]
    B --> C{RAG: Retrieve Relevant Docs}

    subgraph "Data Sources"
        D[Confluence Docs]
        E[Vyaguta APIs]
    end

    subgraph "Vector Database"
        H[Processed Document Chunks]
        I[Vector Embeddings]
        J[Metadata & Source Info]
    end

    D --> H
    E --> H
    H --> I

    C --> I
    I --> K[Context Retrieval]
    K --> L[LLM OpenAI/GPT]
    L --> M[Chatbot Response]
    B --> L
Loading
flowchart TD
    A[User Query] --> B[LangChain Orchestration]
    B --> C{RAG: Retrieve Relevant Docs}

    subgraph "Data Sources"
        D[Confluence Docs]
        E[Vyaguta APIs]
        F[Local Markdown Docs]
    end

    subgraph "Vector Databases"
        G[Processed Document Chunks]
        H[Vector Embeddings]
        I[FAISS Vector Store]
        J[ChromaDB Vector Store]
        K[Metadata & Source Info]
    end

    D --> G
    E --> G
    F --> G
    G --> H
    H --> I
    H --> J
    G --> K

    C --> I
    C --> J
    I --> L[Context Retrieval]
    J --> L
    L --> M[LLM OpenAI/GPT]
    M --> N[Chatbot Response]
    B --> M
Loading

For a detailed technical breakdown and architecture, see /guides/workflow-explanation.md.

Tech Stack

  • LLMs: OpenAI’s GPT models for natural language understanding and generation
  • RAG: Retrieval-Augmented Generation for context-aware answers
  • LangChain: For managing the workflow and integrating components
  • OpenAI: For semantic search and document retrieval
  • Data Storage: FAISS vector database/ chromaDB for fast similarity search
  • Frontend: Streamlit for modern, interactive UI
  • Backend: LangChain for orchestration, OpenAI for LLMs

For setup, usage, and advanced guides, see the /guides/ folder in this repository.

About

An AI-powered onboarding assistant chatbot for Vyaguta, Leapfrog Technology’s Enterprise Resource Planning (ERP) system

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published