model_selection

A set of Python 3 tools to compare the performance and metrics of different ML models

Features

Preprocessing functions to read in CSV data and manipulate it
Functions that return a number of essential metrics on model performance
Generates visualizations to better understand the model

Setup

To use these model selection tools, you'll need to:

Clone this repository:

$ git clone https://github.com/mmoderwell/model_selection.git
$ cd model_selection

Copy analysis.py to your project directory, install packages:

$ cp analysis.py ../path/to/project
$ pip3 install numpy matplotlib matplotlib_venn seaborn

## Using the functions

1) performance

    import analysis
    analysis.performance(estimated, actual, visualize=True, verbose=True):

Arguments: estimated: array of estimated output probabilities, actual: array of actual output classifications Optional: visualize (Bool), verbose (Bool)

Returns: returns accuracy, optionally prints other metrics and a performance visualization

2) distribution_metric

    import analysis
    analysis.distribution_metric(estimated, actual, precision=2, visualize=True, verbose=True):

Arguments: estimated: array of estimated output probabilities, actual: array of actual output classifications Optional: precision (int), visualize (Bool), verbose (Bool)

Returns: prints the calculated percentage of predictions outside of 1 standard deviation from the mean number of predictions at each probability, optionally draws a visualization

Example Notebook

Within /notebooks, you can try out these functions by running the analysis notebook. However, you should first install the extra pacakges.

  $ pip install -r requirements.txt

Authors

Matt Moderwell - Initial work - mmoderwell.com

Also see the list of contributors who participated in this project.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
datasets		datasets
img		img
notebooks		notebooks
README.md		README.md
analysis.py		analysis.py
model_utils.py		model_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

model_selection

A set of Python 3 tools to compare the performance and metrics of different ML models

Features

Setup

1) performance

2) distribution_metric

Example Notebook

Authors

About

Releases

Packages

Languages

mmoderwell/model_selection

Folders and files

Latest commit

History

Repository files navigation

model_selection

A set of Python 3 tools to compare the performance and metrics of different ML models

Features

Setup

1) performance

2) distribution_metric

Example Notebook

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages