ngram-recommender

The script takes two corpora (output_1.csv and output_2.csv for the student model and training.txt for the teacher model, output_1.csv and output_2.csv are automatically combined) of Java methods as input and automatically identifies the best-performing model based on a specific N-value. It then evaluates the selected model on the test set extracted from output_*.csv. Since the training corpus differs from both the instructor-provided dataset and our own dataset, we store the results in a file named results_[student/teacher]_model.json to distinguish them accordingly.

Installation:

Install python 3.9+ locally
Clone the repository to your workspace:

~ $ git clone https://github.com/WhittJS/ngram-recommender.git

Navigate into the repository:

~ $ cd ngram-recommender
~/ngram-recommender $

Set up a virtual environment and activate it:

~/ngram-recommender $ python -m venv ./venv/

For macOS/Linux:

~/ngram-recommender $ source venv/bin/activate
(venv) ~/ngram-recommender $

For Windows:

~\ngram-recommender $ .\venv\Scripts\activate.bat
(venv) ~\ngram-recommender $

To install the required packages:

(venv) ~/ngram-recommender $ pip install -r requirements.txt

Running the Program

Generate new JSON files based on student_model.pkl and teacher_model.pkl:

python ngram_recommender.py

To retrain either model, delete the file of the one you want to train and rerun the above command.
- Edit the min_ngram and max_ngram values in the train_test_model function to train on ngrams within specified parameters.

Report

The assignment report is available in the file Assignment_Report.pdf.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
__pycache__		__pycache__
.gitattributes		.gitattributes
.gitignore		.gitignore
Assignment_Report.pdf		Assignment_Report.pdf
README.md		README.md
git_repos.csv		git_repos.csv
ngram_recommender.py		ngram_recommender.py
output_1.csv		output_1.csv
output_2.csv		output_2.csv
pydriller_windows.py		pydriller_windows.py
requirements.txt		requirements.txt
results_student_model.json		results_student_model.json
results_teacher_model.json		results_teacher_model.json
student_model.pkl		student_model.pkl
teacher_model.pkl		teacher_model.pkl
training.txt		training.txt
writeup.md		writeup.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ngram-recommender

Installation:

Running the Program

Report

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ngram-recommender

Installation:

Running the Program

Report

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages