This project demonstrates how to build a binary classifier using Logistic Regression in Python. The model predicts outcomes for a binary classification problem using features from a dataset.
- Python
- Scikit-learn
- Pandas
- Matplotlib
You can use any binary classification dataset for this task.
Example: Breast Cancer Wisconsin Dataset.
- Choose a binary classification dataset.
- Train/test split the dataset and standardize features.
- Fit a Logistic Regression model to the training data.
- Evaluate the model using:
- Confusion matrix
- Precision
- Recall
- ROC-AUC
- Tune the classification threshold and explain the sigmoid function.
- Binary classification
- Evaluation metrics (precision, recall, ROC-AUC)
- Sigmoid function and threshold tuning
- How does logistic regression differ from linear regression?
- What is the sigmoid function?
- What is precision vs recall?
- What is the ROC-AUC curve?
- What is the confusion matrix?
- What happens if classes are imbalanced?
- How do you choose the threshold?
- Can logistic regression be used for multi-class problems?
- Clone the repository:
git clone <repository-url>