Statistical Bias In ML

This repository contains code for analyzing the COMPAS recidivism dataset using various machine learning models. The analysis includes decision trees, XGBoost, and neural networks, along with model interpretability using LIME and SHAP.

Setup

Clone the repository:

git clone https://github.com/yourusername/compas-analysis.git
cd compas-analysis

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Initialize DVC:

dvc init
dvc add data/  # If you have data files to track

Project Structure

OAIP_Skeleton.ipynb: Main notebook containing model training and evaluation
LIME_and_SHAP.ipynb: Model interpretability analysis using LIME and SHAP
models/: Directory containing saved models (tracked by DVC)
requirements.txt: Project dependencies
.dvc/: DVC configuration and cache

Usage

Run the main analysis notebook:

jupyter notebook OAIP_Skeleton.ipynb

Run the interpretability analysis:

jupyter notebook LIME_and_SHAP.ipynb

The models will be automatically saved to your local models/ directory and tracked by DVC.

Model Storage

Models are saved locally in the models/ directory and tracked using DVC. Each model is saved with its metadata in JSON format. The following models are generated:

Decision Tree: models/decision_tree_model.joblib
XGBoost: models/xgboost_model.json
Neural Network: models/neural_network_model.keras

Model Interpretability

The LIME_and_SHAP.ipynb notebook provides detailed analysis of model predictions using:

LIME (Local Interpretable Model-agnostic Explanations)
SHAP (SHapley Additive exPlanations)

Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.devcontainer		.devcontainer
AIF_and_Fairlearn.ipynb		AIF_and_Fairlearn.ipynb
LICENSE		LICENSE
LIME_and_SHAP.ipynb		LIME_and_SHAP.ipynb
OAIP_Skeleton.ipynb		OAIP_Skeleton.ipynb
README.md		README.md
cox-violent-parsed_filt.csv		cox-violent-parsed_filt.csv
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Statistical Bias In ML

Setup

Project Structure

Usage

Model Storage

Model Interpretability

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

ehas1/Statistical-Bias-in-ML

Folders and files

Latest commit

History

Repository files navigation

Statistical Bias In ML

Setup

Project Structure

Usage

Model Storage

Model Interpretability

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages