Amortized-Interpretability

Codebase for the ACL 2023 paper "Efficient Shapley Values Estimation by Amortization for Text Classification"

Author Team:

Chenghao Yang ([email protected]) (University of Chicago)
Fan Yin ([email protected]) (University of California, Los Angeles)

Supervisor Team:

He He (New York University)
Kai-Wei Chang (University of California, Los Angeles)

Industrial Support:

Xiaofei Ma, Bing Xiang (AWS AI Labs)

Reference

If you use this code as part of any published research, please acknowledge the following paper (it encourages researchers who publish their code!):

@inproceedings{yang-2023-amortized,
    title = "Efficient Shapley Values Estimation by Amortization for Text Classification",
    author = "Yang, Chenghao and Yin, Fan and He, He and Chang, Kai-Wei and Ma, Xiaofei and Xiang, Bing",
    booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics",
    year = "2023",
    publisher = "Association for Computational Linguistics",
}

Project Structure

thermostat: Configuration files we use to run Ranking Stability experiments. You should update it with original thermotet Repo to get most up-to-date implementation. Here we just upload our updated thermostat repo.
InterpCalib: Calibration via Interpretation (ACL'22) We update the implementation contributed by Xi Ye to 1) allow using different random seeds when computing Shapley Values; 2) updated the I/O interface to use our own outputs to do calibration. We here upload our updated InterpCalib repo.
Other parts are just our own codes. Chenghao will do better packaging later to improve readability. But importantly, amortized_model.py is the main Amortized Model code and you can run via run.py.

Dependency Installation

For the main repo:

git clone https://github.com/yangalan123/Amortized-Interpretability.git
cd Amortized-Interpretability
git submodule update --init --recursive
conda create -p ./env python=3.9 # Python 3.8 should also work, but we use 3.9 here for better compatibility. This is subject to future updates.
conda activate ./env # the environment position is optional, you can choose whatever places you like to save dependencies. Here I choose ./env for example.
cd thermostat # we now have the local thermostat copy, so let's build it first
pip install -e . # please ignore the requirements.txt in thermostat/, as it is older and may result in unexpected errors
cd ..
pip install -r requirements.txt # build the main repo dependencies

For the dependency of thermostat and InterpCalib, please check their individual README.

Running Instructions (Ranking Stability for Shapley Values)

First we need to compute a bunch of Shapley Values with different random seeds. We use thermostat to do this. Please check thermostat/README.md for more details. We prepare a running script thermostat/run.sh to assist you. For example, you can run

   bash run.sh task=yelp_polarity model=bert explainer=svs-3600 seed=1 batch_size=1 device=0

We understand it might be computationally expensive to run all the seeds. So we provide pre-computed Shapley Values here. You can download and unzip it under the thermostat directory. The resulted directory structure should be thermostat/experiments/....

Then you can compute Spearman's Ranking Correlation Coefficient (SRCC) between different Shapley Values. Check out internal_correlation.py for more details. You need to update the directory of Shapley Values in the code. This file will automatically create Table 1-2 and Figure 2 in the paper.

Running Instructions (Amortized Model Training)

We mainly use the pre-computed thermostat Shapley Values to train our Amortized Model. You should follow the instructions in thermostat/README.md to compute Shapley Values first.
We need to create the dataset for training and evaluation purposes. Please check out create_dataset.py for more details. You need to update the filepath there.
Then, you can run run.py to train the Amortized Model. You can check out run.sh for more details. The example running command would be:

  CUDA_VISIBLE_DEVICES=${device} python run.py --seed ${seed}  --lr ${lr} -e ${epoch} --train_bsz ${train_bsz} --explainer ${explainer} --topk 10 --task ${task} -tm ${target_model} --storage_root ${output_dir}

Note that running run.py will automatically compute the performance numbers in Table 3 and you can find them in the output directory. To save computational resources, this code will first check the model is already trained and saved. If not, it will train the model.

Feel free to change the setting in config.py for better training performance.

You can use compute_amortized_model_consistency.py to compute the consistency between Amortized Model and Shapley Values. You need to update the filepath there. This file will automatically create Table 5 in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
InterpCalib @ d9ef786		InterpCalib @ d9ef786
thermostat		thermostat
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
amortized_model.py		amortized_model.py
calibration_using_amortized_model.py		calibration_using_amortized_model.py
compute_amortized_model_consistency.py		compute_amortized_model_consistency.py
config.py		config.py
create_dataset.py		create_dataset.py
cross_gt_correlation.py		cross_gt_correlation.py
dataset_stat.py		dataset_stat.py
draw_feature_selection.py		draw_feature_selection.py
draw_histogram.py		draw_histogram.py
draw_lr.py		draw_lr.py
draw_lr_training.py		draw_lr_training.py
export_model_output_as_thermostat.py		export_model_output_as_thermostat.py
feature_selection.py		feature_selection.py
heatmap.py		heatmap.py
internal_correlation.py		internal_correlation.py
internal_correlation_with_lib.py		internal_correlation_with_lib.py
metrics.py		metrics.py
requirements.deprecated		requirements.deprecated
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
samplers.py		samplers.py
utils.py		utils.py
visualization_intro.py		visualization_intro.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amortized-Interpretability

Reference

Project Structure

Dependency Installation

Running Instructions (Ranking Stability for Shapley Values)

Running Instructions (Amortized Model Training)

About

Releases

Packages

Contributors 2

Languages

License

yangalan123/Amortized-Interpretability

Folders and files

Latest commit

History

Repository files navigation

Amortized-Interpretability

Reference

Project Structure

Dependency Installation

Running Instructions (Ranking Stability for Shapley Values)

Running Instructions (Amortized Model Training)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages