Explainability-Dermatology

For further explanation of the methods and more examples of the results see our confluence page.

Black box machine learning models that cannot be understood by people, such as deep neural networks and large ensembles, are achieving impressive accuracy on various tasks. However, as machine learning is increasingly used to inform high stakes decisions, explainability and interpretability of the models is becoming essential. The lack of understanding on how neural networks make predictions enables unpredictable/biased models, causing real harm to society and a loss of trust in AI-assisted systems. There are many ways to explain: data vs. model, directly interpretable vs. post hoc explanation, local vs. global, static vs. interactive; the appropriate choice depends on the persona of the consumer of the explanation.

For our particular project, our aim is twofold:

Increase trustworthiness by delivering better explanations to both physicians and patients.
Study biases to build more accurate and robust models.

This repository contains code for testing different local and global methods using diverse pytorch libraries and ISIC 2020 dataset.

Methods

Global vs Local Explanations

Global explanations are for entire models whereas local explanations are for single sample points.

Global directly interpretable models are important for personas that need to understand the entire decision making process and ensure its safety, reliability, or compliance. Such personas include regulators and data scientists responsible for the deployment of systems. Global post hoc explanations are useful for decision maker personas that are being supported by the machine learning model. Physicians, judges, and loan officers develop an overall understanding of how the model works, but there is necessarily a gap between the black box model and the explanation. Therefore, a global post hoc explanation may hide some safety issues but its antecedent black box model may have favorable accuracy. Local models are the most useful for affected user personas such as patients, defendants, and applicants who need to understand the decision on a single sample (theirs).

LOCAL XAI

Attribution Methods

Attribution methods link a deep neural network's prediction to the input features that most influence that prediction. If the model makes a misprediction, we might want to know which features contributed to the misclassification. We can visualize the attribution scores for image models as a grayscale image with the same dimensions as the original image with brightness corresponding to the importance of the pixel.

Primary Attribution: Evaluates contribution of each input feature to the output of a model.
Layer Attribution: Evaluates contribution of each neuron in a given layer to the output of the model.
Neuron Attribution: Evaluates contribution of each input feature on the activation of a particular hidden neuron.

Libraries used:

PAIRML Saliency Methods

pip install saliency

This repository contains code for the following saliency techniques:

Guided Integrated Gradients* (paper, poster)
XRAI* (paper, poster)
SmoothGrad* (paper)
Vanilla Gradients (paper, paper)
Guided Backpropogation (paper)
Integrated Gradients (paper)
Occlusion
Grad-CAM (paper)
Blur IG (paper)

Pytorch GradCam - Class Activation Map methods implemented in Pytorch

pip install grad-cam

GradCAM (paper) - Weight the 2D activations by the average gradient
GradCAM++ (paper) - Like GradCAM but uses second order gradients
XGradCAM (paper) - Like GradCAM but scale the gradients by the normalized activations
AblationCAM (paper) - Zero out activations and measure how the output drops
ScoreCAM (paper) - Perbutate the image by the scaled activations and measure how the output drops
EigenCAM (paper) - Takes the first principle component of the 2D Activations (no class discrimination, but seems to give great results)
EigenGradCAM - Like EigenCAM but with class discrimination: First principle component of Activations*Grad. Looks like GradCAM, but cleaner
LayerCAM (paper) - Spatially weight the activations by positive gradients. Works better especially in lower layers
FullGrad (paper) - Computes the gradients of the biases from all over the network, and then sums them

Captum

pip install captum

Multimodality: Supports interpretability of models across modalities including vision, text, and more.
Supports most types of PyTorch models and can be used with minimal modification to the original neural network.
Easy to use and extensible: Open source, generic library for interpretability research. Easily implement and benchmark new algorithms.
Captum Insights · Captum interpretability visualization widget built on top of Captum

CODE

local_xai_attr.py: Uses Captum and Pytorch GradCam libraries to implement different attribution methods.

PAIR_saliency.ipynb: Uses PAIRML saliency library to test the following methods:

Vanilla Gradient
SmoothGrad
XRAI
Guided Integrated Gradients

XRAI Examples: Local methods are used to explore individual instances that are important for some reason, like edge cases. The following images come from a different dataset the model has not seen before during training. In this particular case, the XRAI method was used, as it was considered clearer and easier to interpret.

The last column shows the most salient n% (15% or 5%) of the image. It appears that asymmetry plays an important role in discerning between melanomas and non-melanomas. Rulers seem to be somewhat important to the model, hence inserting a bias.

The following two images showcase the strong bias of the model towards black frames. However, white frames, which are very rare in the dataset, are not a source of bias.

GLOBAL XAI

TCAV: Testing with Concept Activation Vectors

TCAV.ipynb - The notebook is self-contained.

Projection of XRAI

XRAI_global_projection.py

Apply XRAI to 6k images from ISIC and save the masked images
Apply EffNetB2 as a feature extractor → 500D vectors
Project with UMAP

We observe two clear clusters corresponding to frames and rulers, as well as one zone with thick hair.

Acknowledgements

The project was developed during the first rotation of the Eye for AI Program at the AI Competence Center of Sahlgrenska University Hospital. Eye for AI initiative is a global program focused on bringing more international talents into the Swedish AI landscape.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
CaptumInsights		CaptumInsights
ConceptWhitening		ConceptWhitening
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
PAIR_saliency.ipynb		PAIR_saliency.ipynb
README.md		README.md
TCAV.ipynb		TCAV.ipynb
XRAI_global_projection.py		XRAI_global_projection.py
__init__.py		__init__.py
local_xai_attr.py		local_xai_attr.py
utils_xai.py		utils_xai.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainability-Dermatology

Methods

Global vs Local Explanations

LOCAL XAI

Attribution Methods

CODE

GLOBAL XAI

TCAV: Testing with Concept Activation Vectors

Projection of XRAI

Acknowledgements

About

Releases

Packages

Languages

License

sancarlim/Explainability-Dermatology

Folders and files

Latest commit

History

Repository files navigation

Explainability-Dermatology

Methods

Global vs Local Explanations

LOCAL XAI

Attribution Methods

CODE

GLOBAL XAI

TCAV: Testing with Concept Activation Vectors

Projection of XRAI

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages