Welcome to ARYA, the official chatbot for Arya Bhatt Hostel. This chatbot is designed to help students and hostel residents with information related to hostel rules, facilities, and other queries. It uses Pinecone for vector search and Hugging Face for language model inference.
- Provides accurate and concise answers about hostel facilities and rules.
- Friendly, professional responses.
- Real-time question processing.
- Simple and easy-to-use interface.
- Streamlit: For building the front-end of the chatbot.
- Pinecone: For vector-based document retrieval.
- Hugging Face: For language model and text generation.
- LangChain: To manage LLM chains and document search.
- Python: The core language used.
git clone https://github.com/zenitsu0509/ARYA_Chatbot.git
cd ARYA_Chatbot
Ensure that you have Python 3.8+ installed. You can install the required Python packages using:
pip install -r requirements.txt
You will need to configure environment variables to access the necessary APIs. Create a .env
file in the project directory and add the following keys:
PINECONE_API_KEY=<your-pinecone-api-key>
PINECONE_ENV=<your-pinecone-environment>
HUGGING_FACE_API=<your-huggingface-api-key>
Make sure you have a Pinecone index created and properly configured with the documents for Arya Bhatt Hostel information.
You can start the Streamlit app using the following command:
streamlit run app.py
The application will launch locally, and you can access the chatbot via your browser at http://localhost:8501
.
- Pinecone Vector Store: Retrieves relevant information from the hostel knowledge base using vector embeddings.
- Hugging Face LLM: Generates human-like responses based on the retrieved context and the user's question.
- Streamlit Interface: Provides an intuitive and responsive front-end for users to interact with the bot.
arya-hostel-chatbot/
│
├── app.py # Main application file
├── config.py # Configuration loader for API keys and settings
├── requirements.txt # Python dependencies
├── .env # Environment variables file
├── README.md # Project documentation
└── .gitignore # Files to be ignored in version control
- setup_pinecone: Initializes the Pinecone vector store for document search.
- setup_llm: Configures the Hugging Face language model used for generating responses.
- create_qa_chain: Combines the vector store with the LLM to create the question-answering logic.
- main: Manages the Streamlit interface and user interactions.
You can modify the following parameters to customize ARYA:
- Language Model: Change the Hugging Face model by updating the
repo_id
in thesetup_llm
function. - Index Name: Change the Pinecone index by modifying the
index_name
insetup_pinecone
.
- Admin Interface: Allow hostel admins to update or add new knowledge base entries.
- Multilingual Support: Support for other languages to assist international students.
If you'd like to contribute to the development of this project:
- Fork the repository.
- Create a new branch for your feature/bugfix.
- Submit a pull request with a detailed explanation of your changes.
This project is licensed under the MIT License. See the LICENSE file for details.
💻 Developed and maintained by Himanshu Gangwar
🔄 Last updated: October 2024