lc2st: local classifier two-sample test

This repo contains the official code for L-C2ST: Local Diagnostics for Posterior Approximations in Simulation-Based Inference,
by Julia Linhart, Alexandre Gramfort and Pedro L.C. Rodrigues. Published in NeurIPS 2023.

A more practical (and maintained) implementation of lc2st can be found within the sbi python package, including an introductory tutorial on how to use and interpret these validation diagnostics.

How to run the code

Dependencies

Create a python 3.10 conda environment: conda create -n lc2st python=3.10
Clone this repo and run the following to install the lc2st package and a subset of its dependencies (lampe, sbi, sbibm, seaborn, tueplots, zuko) as specified in setup.py:

pip install -e .

Special requirements

The following dependencies are only required to run experiments for the Bernoulli GLM (Raw) task from sbibm.

pypolyagamma: clone the repo and make required changes in the deps folder
sbibm: clone the repo and change the task.py file as follows:
- add observation: Optional[torch.Tensor] = None, as an input variable to the _sample_reference_posterior method
- change dtype=np.int to int in line 245

After making these changes, run pip install -e . in both repository directories.

Getting Started

A detailed tutorial on lc2st will be uploaded soon. Until then see the code to reproduce results and figures from the paper.

Repoduce Results and Figures

Experiment 1: Single class evaluation

To generate the Figure 1, run:

python figure1_lc2st_2023.py --plot

Precomputed are available in saved_experiments/exp_1. They were obtained by running the following commands in the terminal:

python figure1_lc2st_2023.py --opt_bayes --t_shift
python figure1_lc2st_2023.py --power_shift

Experiment 2: Statistical method comparison on `sbibm` tasks

To generate the figure for each task (i.e. each row in Figures 2 in the main and 5 in Appendix F.1), run:

python figure2_lc2st_2023.py --plot --task <task_name>

For task_name = two_moons, slcp, gaussian_mixture, gaussian_linear_uniform, bernoulli_glm, bernoulli_glm_raw.

Precomputed are available in saved_experiments/exp_2. They were obtained by running the following commands in the terminal:

Varying N_train (Columns 1 and 2):

python figure2_lc2st_2023.py --task <task_name> --t_res_ntrain --n_train 100 1000 10000 100000
python figure2_lc2st_2023.py --task <task_name> --t_res_ntrain --n_train 100 1000 10000 100000 --power_ntrain

Varing N_cal (Columns 3 and 4):

python figure2_lc2st_2023.py --task <task_name> --power_ncal --n_cal 100 500 1000 2000 5000 10000

Runtimes (Appendix A.5)

python figure2_lc2st_2023.py --task slcp --runtime -nt 0 --n_cal 5000 10000

Experiment 3: Global / Local Coverage Test and graphical diagnostics on the JRNMM

To generate Figure 3, run:

python figure3_lc2st_2023.py --plot

Add --lc2st_interpretability for Figure 4 and Appendix D.

Precomputed are available in saved_experiments/exp_2. They were obtained by running the following commands in the terminal:

Global Coverage results (left panel of Figure 3):

python figure3_lc2st_2023.py --global_ct

Local Coverage results for varying gain (right panel of Figure 3 and Figure 4 and Appendix D):

python figure3_lc2st_2023.py --local_ct_gain

Appendix F.2: Accuracy of L-C2ST w.r.t. the true C2ST

To generate Figure 6 (correlation scatter plots) and results for Table 3 (corresponding p-values), run:

python figure6_lc2st_2023.py --observations <task/empirical> --method <lc2st/lc2st_nf> --task <task_name>

For task_name = two_moons, slcp, gaussian_mixture, gaussian_linear_uniform, bernoulli_glm, bernoulli_glm_raw.

Specifying empirical for the --observations argument, generates results and plots for 100 different reference observations (instead of just 10).

Precomputed are available in saved_experiments/exp_2/<task_name>/scatter_plots. They were obtained by running the previous command in the terminal. Add --sbibm_obs to generate the new 100 observations via the sbibm platform. This can be problematic for some tasks, which is why by default they are simply generated them with the prior and simulator directly.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
classifiers		classifiers
hnpe		hnpe
lc2st		lc2st
nde		nde
saved_experiments		saved_experiments
tasks		tasks
.gitignore		.gitignore
README.md		README.md
c2st_p_values_roc.py		c2st_p_values_roc.py
experiment_utils_jrnmm.py		experiment_utils_jrnmm.py
experiment_utils_sbibm.py		experiment_utils_sbibm.py
figure1_neurips2023.py		figure1_neurips2023.py
figure2_neurips2023.py		figure2_neurips2023.py
figure3_neurips2023.py		figure3_neurips2023.py
figure6_neurips2023.py		figure6_neurips2023.py
plots_neurips2023.py		plots_neurips2023.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lc2st: local classifier two-sample test

How to run the code

Dependencies

Special requirements

Getting Started

Repoduce Results and Figures

Experiment 1: Single class evaluation

Experiment 2: Statistical method comparison on `sbibm` tasks

Experiment 3: Global / Local Coverage Test and graphical diagnostics on the JRNMM

Appendix F.2: Accuracy of L-C2ST w.r.t. the true C2ST

About

Releases

Packages

Languages

JuliaLinhart/lc2st

Folders and files

Latest commit

History

Repository files navigation

lc2st: local classifier two-sample test

How to run the code

Dependencies

Special requirements

Getting Started

Repoduce Results and Figures

Experiment 1: Single class evaluation

Experiment 2: Statistical method comparison on sbibm tasks

Experiment 3: Global / Local Coverage Test and graphical diagnostics on the JRNMM

Appendix F.2: Accuracy of L-C2ST w.r.t. the true C2ST

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Experiment 2: Statistical method comparison on `sbibm` tasks

Packages