Representational Difference Explanations (RDX)

Updates

Paper was accepted to NeurIPS 2025!
NLMCD, TopK-SAE, and USAE added as baseline options
- tested on mnist modification and cub experiments
Updated RDX algorithm (RDX v2) with classifier guidance, symmetrized distances, and filter thresholding. The original RDX algorithm is still available as an option (rdx_paper.py).
- Guidance - allows you to use a classifier to guide RDX clusters so they stay within prediction-coherent groups (e.g. true positives vs false positives).
- Symmetrized distances - Instead of symmetrizing the affinity matrix, we symmetrize the neighborhood distances. Reduces sensitivity to outlier points.
- Filter thresholding - Rather than filtering out the first cluster by default, we set a threshold and filter any clusters with a mean affinity below the threshold.
Added configs for classifier guided RDX on CLIP vs CLIP-iNat (Exp. 4a).
Added configs for RDXv2 on CLIP vs CLIP-iNat (Exp. 4a).
Added an interactive visualizer for easier exploration of RDX results. The visualizer can be generated for any new experiments.
Added experiments on BiomedCLIP and the RSNA Pneumonia dataset. There is a blog post with findings and an interactive visualizer available at: https://nkondapa.github.io/rdx-page/blog/rsna-biomedclip/.

Setup

conda create -n "RDX" python=3.10.15
conda activate RDX
bash setup.sh

Downloading Checkpoints

Download checkpoints for the MNIST experiments here: bash download_checkpoints.sh

Downloading Datasets

MNIST: You can download the MNIST dataset with bash download_mnist.sh.
INaturalist (Subset): You can download the iNaturalist subset with bash download_inaturalist.sh.
CUB: You can download CUB and supplementary files for CUB CBMs with bash download_cub.sh.
ImageNet: You can download imagenet with bash download_imagenet.sh.

If you have already downloaded some of these datasets, you can symlink them to the data/ directory. See symlinks.txt for examples.

Experiments

To reproduce the MNIST experiments, download the checkpoints and the mnist dataset.
- bash mnist_835_experiment.sh for the MNIST subset experiment with only 3s, 5s, and 8s.
- bash mnist_modification_experiment_k=3.sh for the MNIST training modification experiments, with k=3.
To reproduce the CUB PCBM experiments, download the CUB dataset and run:
- bash cub_pcbm_v_cub_masked_pcbm.sh
To reproduce the ImageNet experiments, download the ImageNet dataset and run:
- bash dino_vs_dinov2_imagenet_ar.sh (aligned)
- bash dino_vs_dinov2_imagenet.sh (unaligned)
To reproduce the iNaturalist experiments, download the iNaturalist subset and run:
- bash clip_vs_clipinat_ar.sh (aligned)
- bash clip_vs_clipinat.sh (unaligned)

Minimal Example

The smallest dataset is the INat Subset. The fastest way to run a minimal example is to download the iNaturalist subset and run Experiment 4.

Visualizations

To visualize the results of the experiments, you can run: python analyze_explanations.py. By default this will analyze the inat subset experiment (aligned) (Exp. 4a). There are several commented functions in the script that you can uncomment to visualize the results of other experiments.

Additional Experiments/Visualizations

5) BiomedCLIP layer-wise RDX on RSNA Pneumonia: Applies RDX to compare pre-block layer representations and post-block layer representations of BiomedCLIP to reveal how representations of chest X-rays evolve through the network. Requires the RSNA Pneumonia Detection Challenge dataset. A live interactive visualizer is available at: https://nkondapa.github.io/rdx-page/blog/rsna-biomedclip/

python -m rsna_experiments.analyze_biomedclip --data_root /path/to/RSNA --output_dir outputs/rsna_biomedclip/

5) iNaturalist with classifier guidance (CLIP vs CLIP-iNat, aligned): Uses a TP/FP classifier to guide RDX clusters so they stay within prediction-coherent groups (e.g. true positives vs false positives). Requires the iNaturalist subset.

bash clip_vs_clipinat_inat_ar_guided.sh

6) iNaturalist with RDX v2 (CLIP vs CLIP-iNat, aligned): Uses the updated RDX v2 algorithm (symmetrized difference map, filter threshold) on the same CLIP vs CLIP-iNat comparison. Requires the iNaturalist subset.

bash clip_vs_clipinat_inat_ar_v2.sh

Interactive visualizer for general cross-model comparisons: For experiments 5–7 (and any custom cross-model comparison), an interactive HTML visualizer can be generated with interactive_cluster_viz_general.py. The shell scripts above already invoke it automatically after running the comparisons.

Citation

@article{kondapaneni2025repdiffexp,
  title={Representational Difference Explanations},
  author={Kondapaneni, Neehar and Mac Aodha, Oisin and Perona, Pietro},
  journal={arXiv preprint arXiv:2505.23917},
  year={2025}
}

Note on Reproducibility

Our original code seeded once at the start of comparisons for all methods, however, we realized it is likely to cause inconsistencies due to arbitrary choices made in the order of running the different comparisons. The new code re-seeds at the beginning of each comparison for all methods. This may lead to slightly different results than those reported in the paper, but we have checked that the trends remain the same. We apologize for any inconvenience this may cause.

Note on Concept Selection

We stayed as close as possible to the original concept selection strategies for each baseline method. However, TopK-SAEs, NLMCD, and USAE concept selection strategies were modified for fair comparison on our comparison tasks.

Let k = the number of concepts shown to the user.

TopK-SAE uses 50 latents during training with the top k remaining active. TopK-SAE are trained on each representation independently. After training, we select the k concepts per model with the largest mean activations to show to the user.
NLMCD uses HDBSCAN clustering and generates an arbitrary number of concepts for each representation. We measure concept similarity across models and select the top k most dissimilar concepts for our comparisons.
USAE learns an internal representation of 8 * (representation dimension), much larger than k. To select k concepts we measure firing entropy for each concept and select the k concepts per model with the lowest firing entropy. Firing entropy is defined in the USAE paper and measures how evenly a concept activates across the different models. Low entropy indicates that a concept is more specific to certain models, and is thus more likely to be useful for distinguishing them.

While we feel these choices are reasonable, it is possible that different concept selection strategies may improve baseline performance. Feel free to experiment with different strategies!

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
comparison_configs		comparison_configs
datasets		datasets
models		models
pymf		pymf
rsna_experiments		rsna_experiments
src		src
.gitignore		.gitignore
README.md		README.md
analyze_explanations.py		analyze_explanations.py
clip_vs_clipinat_inat.sh		clip_vs_clipinat_inat.sh
clip_vs_clipinat_inat_ar.sh		clip_vs_clipinat_inat_ar.sh
clip_vs_clipinat_inat_ar_guided.sh		clip_vs_clipinat_inat_ar_guided.sh
clip_vs_clipinat_inat_ar_v2.sh		clip_vs_clipinat_inat_ar_v2.sh
cub_pcbm_v_cub_masked_pcbm.sh		cub_pcbm_v_cub_masked_pcbm.sh
dino_vs_dinov2_imagenet.sh		dino_vs_dinov2_imagenet.sh
dino_vs_dinov2_imagenet_ar.sh		dino_vs_dinov2_imagenet_ar.sh
download_checkpoints.sh		download_checkpoints.sh
download_cub.sh		download_cub.sh
download_imagenet.sh		download_imagenet.sh
download_inat.sh		download_inat.sh
download_mnist.sh		download_mnist.sh
eval_model.py		eval_model.py
generate_comparison_explanations.py		generate_comparison_explanations.py
interactive_cluster_viz_core.py		interactive_cluster_viz_core.py
interactive_cluster_viz_general.py		interactive_cluster_viz_general.py
mnist_835_experiment.sh		mnist_835_experiment.sh
mnist_modification_experiments_k=3.sh		mnist_modification_experiments_k=3.sh
requirements.txt		requirements.txt
setup.py		setup.py
setup.sh		setup.sh
sym_links.txt		sym_links.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Representational Difference Explanations (RDX)

Updates

Setup

Downloading Checkpoints

Downloading Datasets

Experiments

Minimal Example

Visualizations

Additional Experiments/Visualizations

Citation

Note on Reproducibility

Note on Concept Selection

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Representational Difference Explanations (RDX)

Updates

Setup

Downloading Checkpoints

Downloading Datasets

Experiments

Minimal Example

Visualizations

Additional Experiments/Visualizations

Citation

Note on Reproducibility

Note on Concept Selection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages