This repository contains the material used for Machine Learning sessions conducted at Helpshift. Most of the content will be in the form of Jupyter notebooks & we will use Python 3.
Beginner level proficiency is enough to understand the concepts & material used in this repository/class. In case you haven't used Python recently, please refer to following chapters from A Byte of Python to understand the Python concepts which will be useful in understanding & exploring the code in this class/repository.
Also try to read the full book if possible.
- Exploratory Data Analysis using Covid19 Dataset
- Text classification using Naive Bayes
- Text classification using Logistic Regression
- Tuning Logistic Regression model using Regularization techniques
- Text classification using Ensemble Model
- Mini Project: Sentiment Analysis of IMDB Movie Reviews
-
Install Docker by following the instructions given for different platforms on this page.
-
You can either pull the existing docker image or build your own image
2.1 Using Docker
make pull
2.2 Build image
make build
-
Verify that docker image exists
docker images | grep ml_classroom
-
Start the Docker container
make run
-
Verify by visiting URL in browser.
-
Install Python3 using the instructions given on this page.
-
Create Virtual Env
virtualenv venv
-
Activate the Virtual Env
source venv/bin/activate
-
Install required dependencies
pip install -r requirements.txt
-
Launch Jupyter notebook
jupyter notebook --port 4545
-
Verify by visiting URL in browser.