PDPbox

python partial dependence plot toolbox

This repository is inspired by ICEbox. The goal is to visualize the impact of certain features towards model prediction for any supervised learning algorithm. (now support all scikit-learn algorithms)

The common problem

When using black box machine learning algorithms like random forest and boosting, it is hard to understand the relations between predictors and model outcome. For example, in terms of random forest, all we get is the feature importance. Although we can know which feature is significantly influencing the outcome based on the importance calculation, it really sucks that we don’t know in which direction it is influencing. And in most of the real cases, the effect is non-monotonic. We need some powerful tools to help understanding the complex relations between predictors and model prediction.
PDPbox aims to wrap up and enrich some useful functions mentioned in ICEbox in Python.

Highlight

Support one-hot encoding features.
For numeric features, create grids with percentile points.
Directly handle multiclass classifier.
Support two variable interaction plot.
Support actual prediction plot. (new)

Documentation

For details about the ideas, please refer to Introducing PDPbox.
For description about the functions and parameters, please refer to PDPbox functions and parameters.
For test and demo, please refer to https://github.com/SauceCat/PDPbox/tree/master/test.

Install PDPbox

git clone https://github.com/SauceCat/PDPbox.git
cd PDPbox
python setup.py install

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
images		images
pdpbox		pdpbox
test		test
README.md		README.md
parameter.md		parameter.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDPbox

The common problem

Highlight

Documentation

Install PDPbox

Examples

Binary feature: single variable plot with original points and individual lines

Binary feature: single variable plot with clustered individual lines

Binary feature: actual predictions plot for a single variable

Numeric feature: single variable plot with x_quantile=True, original points and individual lines

Numeric feature: single variable plot with percentile_range=(5, 95)

Numeric feature: single variable plot with customized grid points

Numeric feature: actual predictions plot for a single variable

Onehot encoding feature: single variable plot with individual lines and original points

Onehot encoding feature: single variable plot without centering the lines

Onehot encoding feature: actual predictions plot for a single variable

Multiclass: single variable plot with individual lines and original points

Interaction between two variables: the complete plot

Interaction between two variables: multiclass with only contour plots

About

Uh oh!

Releases

Packages

Languages

Gavin2318/PDPbox

Folders and files

Latest commit

History

Repository files navigation

PDPbox

The common problem

Highlight

Documentation

Install PDPbox

Examples

Binary feature: single variable plot with original points and individual lines

Binary feature: single variable plot with clustered individual lines

Binary feature: actual predictions plot for a single variable

Numeric feature: single variable plot with x_quantile=True, original points and individual lines

Numeric feature: single variable plot with percentile_range=(5, 95)

Numeric feature: single variable plot with customized grid points

Numeric feature: actual predictions plot for a single variable

Onehot encoding feature: single variable plot with individual lines and original points

Onehot encoding feature: single variable plot without centering the lines

Onehot encoding feature: actual predictions plot for a single variable

Multiclass: single variable plot with individual lines and original points

Interaction between two variables: the complete plot

Interaction between two variables: multiclass with only contour plots

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages