This repository contains an end-to-end machine learning workflow using logistic regression to predict student placement outcomes based on CGPA and IQ. The project demonstrates the steps of data preprocessing, exploratory data analysis (EDA), feature selection, model training, evaluation, and deployment.
The dataset (placement.csv) contains the following features:
cgpa: Student's CGPA (float)iq: Student's IQ score (float)placement: Placement outcome (binary; 1 for placed, 0 for not placed)
The workflow in the Jupyter notebook (end_to_end_ml.ipynb) includes:
-
Data Loading & Inspection
- Load dataset using pandas
- Inspect data with
.head(),.info(),.shape()
-
Preprocessing
- Remove unnecessary columns
- Visualize data using matplotlib
-
Feature Selection
- Select relevant features (
cgpa,iq) as input (X) - Target variable:
placement(y)
- Select relevant features (
-
Train-Test Split
- Split data into training (90%) and testing (10%) sets
-
Feature Scaling
- Standardize features using
StandardScaler
- Standardize features using
-
Model Training
- Train a logistic regression model (scikit-learn)
-
Evaluation
- Assess model performance on the test set
-
Deployment
- Prepare the model for deployment (additional steps can be added)
- Python 3.x
- Jupyter Notebook
- pandas
- numpy
- matplotlib
- scikit-learn
-
Clone the repository:
git clone https://github.com/campusx-official/placement-project-logistic-regression.git cd placement-project-logistic-regression -
Open the notebook:
jupyter notebook end_to_end_ml.ipynb
-
Follow the steps in the notebook to run the workflow on the dataset.
├── end_to_end_ml.ipynb
├── placement.csv
└── README.md