Self-Counterfactual Learning with Large Language Models

This repository contains code for generating causal graphs from natural language textual data and performing counterfactual reasoning with large language models.

Install

Run the following command to install the required packages:

pip install -r requirements.txt

You can install the packages ina virtual environment by running the following commands:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Data

For evaluation, we use data from the Cladder dataset. The data is downloaded automatically from the Huggingface hub.

We also use real-world data from news article via EventRegistry. the dataset is not publicly available as of now but th raw data can be downloaded from here.

Usage

You can build a causal graph from text data by running the following command. Use the provided template to create your own configuration file.

python build_graph.py config/build_graph.yaml

You can perform counterfactual reasoning by running the following command. Load a generated causal graph and use the provided template to create your own configuration file.

python compute_counterfactuals.py config/counterfactual.yaml

⚠️ Warning: Inference computations does not handle colliders yet. The generated computation graph shows the structure with colliders but the result of inference does not integrate information from colliders. Colliders are v-structures X -> Z <- Y with Z observed, creating a dependency between X and Y. As of now, Z is ignored in the inference computations. In practice, modifying observations can lead to nonsensical causal graphs. Favour using intervention nodes and counterfactual setting as this is the way this repository is intended to be used.

You can perform end-to-end inference by running the following command. Use the provided template to create your own configuration file.

python end_to_end.py config/end_to_end.yaml

Evaluate the model on the Cladder dataset by running the following command:

python evaluate.py config/cladder_evaluate.yaml

Causal Graph Attributes

The built causal graphs have the following attributes:

Nodes:

description: [str] A description of the causal variable
type: [str] The type of the causal variable
values: [str] The possible values of the causal variable
current_value: [str] The current value of the causal variable
context: [str] The context in which the causal variable is defined
observed: [bool] Whether the causal variable is observed or not
layer: [int] (Optional) The layer of the causal variable in the topological causal graph (used for visualisation)
updated_value: [str] (Optional) The updated value of the causal variable after inference

Edges:

description: [str] A description of the causal relationship
details: [str] Additional details about the causal relationship
observed: [bool] Whether the causal relationship is observed or not

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
config		config
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_graph.py		build_graph.py
compute_counterfactuals.py		compute_counterfactuals.py
end_to_end.py		end_to_end.py
evaluate.py		evaluate.py
read_stats.py		read_stats.py
requirements.txt		requirements.txt
similarity.py		similarity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Counterfactual Learning with Large Language Models

Install

Data

Usage

Causal Graph Attributes

About

Releases

Packages

Languages

License

Strong-AI-Lab/counterfactual-llm-inference

Folders and files

Latest commit

History

Repository files navigation

Self-Counterfactual Learning with Large Language Models

Install

Data

Usage

Causal Graph Attributes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages