MCQ Generator

An AI-powered application for automatically generating Multiple Choice Questions (MCQs) from uploaded documents. The system uses state-of-the-art NLP models to extract context, generate relevant questions, correct answers, and plausible distractors.

📋 Features

Generate high-quality MCQs from various document formats (PDF, DOCX, TXT, images)
Automatic text extraction with OCR fallback for scanned documents
Special handling for book-like documents with chapter detection
Mobile-friendly React Native frontend
FastAPI backend with WebSocket support for real-time progress updates
Customizable number of questions to generate

🏗️ Project Structure

├── api.py                 # FastAPI server implementation
├── mcq_generator.py       # Core MCQ generation logic
├── fileprocessor.py       # Document processing utilities
├── test.py                # Test script for the MCQ generator
├── frontend1/             # React Native mobile app
├── qa/                    # Question generation model (T5-based)
├── distractor/            # Distractor generation model
├── s2v_old/               # Sense2Vec model for semantic processing

🚀 Installation

Prerequisites

Python 3.8+
Node.js and npm (for frontend)
Expo CLI (for mobile app development)

Backend Setup

Clone the repository:

git clone https://github.com/yourusername/mcq_generator_final.git
cd mcq_generator_final

Create and activate a virtual environment:

python -m venv myvenv
source myvenv/bin/activate  # On Windows, use: myvenv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Download the models:
- Question Generation Model (1.2GB) - Extract to qa directory
- Distractor Generation Model (1.5GB) - Extract to distractor directory
- Sense2Vec Model (573MB) - Extract to s2v_old directory

Frontend Setup

Navigate to the frontend directory:
```
cd frontend1
```
Install dependencies:
```
npm install
```
Update the server URL in frontend1/services/apiService.js to point to your backend server.

🔌 Running the Application

Starting the Backend

uvicorn api:app --host 0.0.0.0 --port 8000

Starting the Frontend (Development Mode)

cd frontend1
expo start

Connect using the Expo Go app on your mobile device or use an emulator.

📚 Usage

Launch the application on your device
Tap "Upload Document" to select a file (PDF, DOCX, TXT, or image)
Choose whether the document is a regular document or a book
For books, you can choose to process the entire book or select specific chapters
Set the number of questions to generate
Review and save the generated MCQs

Command Line Usage

You can also generate MCQs using the command line:

python test.py --input_file path/to/your/document.pdf --num_questions 10

🧠 Model Information

Question Generation Model

Based on T5 transformer architecture
Fine-tuned on SQuAD and RACE datasets
Optimized for creating context-relevant questions

Distractor Generation Model

Specialized T5-based model for generating plausible incorrect options
Trained to create distractors that are semantically related but factually incorrect

Sense2Vec Model

Used for semantic understanding and relationship processing
Helps in creating better distractors by identifying semantic relationships

💻 API Reference

The system exposes several API endpoints:

POST /process-file - Process a document and extract text
POST /generate-mcqs-from-file - Generate MCQs from a file
POST /generate-mcqs - Generate MCQs from raw text
WebSocket /api/ws/{client_id} - Real-time processing updates

Example API Usage

import requests

# Generate MCQs from text
response = requests.post(
    "http://localhost:8000/generate-mcqs",
    json={
        "text": "Your document text here...",
        "num_questions": 5
    }
)

mcqs = response.json()
print(mcqs)

📱 Mobile App Features

Document upload from device storage
Camera capture for scanning documents
Real-time progress tracking during MCQ generation
Save and share generated MCQs
Offline mode for viewing previously generated MCQs

🔧 Troubleshooting

Model loading issues: Ensure all model files are correctly placed in their respective directories
Memory errors: For large documents, try processing in smaller chunks
OCR problems: For poor quality scans, try improving the image quality before upload
Backend connection issues: Verify the API URL in the frontend configuration

🛠️ Technologies Used

Backend: Python, FastAPI, WebSockets, PyTorch, Transformers, Spacy
Frontend: React Native, Expo, JavaScript
Models: T5 (fine-tuned), Sense2Vec
Document Processing: PyMuPDF, PyPDF2, Tesseract OCR

🔜 Future Enhancements

Subject-specific models for domains like medicine, law, etc.
Difficulty level classification for questions
Export options to various LMS formats
Support for more document formats
Enhanced UI for desktop environments

📝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

The SQuAD dataset for question generation training
The RACE dataset for MCQ format training
Hugging Face for model hosting and libraries
The open-source community for various tools and libraries used

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
frontend1		frontend1
.gitignore		.gitignore
README.md		README.md
api.py		api.py
demo.py		demo.py
distractor.ipynb		distractor.ipynb
fileprocessor.py		fileprocessor.py
mcq_generation_final.ipynb		mcq_generation_final.ipynb
mcq_generator.py		mcq_generator.py
mcq_question_generation_squad.ipynb		mcq_question_generation_squad.ipynb
requirements.txt		requirements.txt
test.py		test.py
test_distractor.py		test_distractor.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MCQ Generator

📋 Features

🏗️ Project Structure

🚀 Installation

Prerequisites

Backend Setup

Frontend Setup

🔌 Running the Application

Starting the Backend

Starting the Frontend (Development Mode)

📚 Usage

Command Line Usage

🧠 Model Information

Question Generation Model

Distractor Generation Model

Sense2Vec Model

💻 API Reference

Example API Usage

📱 Mobile App Features

🔧 Troubleshooting

🛠️ Technologies Used

🔜 Future Enhancements

📝 Contributing

📄 License

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MCQ Generator

📋 Features

🏗️ Project Structure

🚀 Installation

Prerequisites

Backend Setup

Frontend Setup

🔌 Running the Application

Starting the Backend

Starting the Frontend (Development Mode)

📚 Usage

Command Line Usage

🧠 Model Information

Question Generation Model

Distractor Generation Model

Sense2Vec Model

💻 API Reference

Example API Usage

📱 Mobile App Features

🔧 Troubleshooting

🛠️ Technologies Used

🔜 Future Enhancements

📝 Contributing

📄 License

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages