Add RetailHero
and MovieLens25
bipartite datasets with causal parameters
#9471
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I would like to add two new bipartite networks in PyG.
One is based on the RetailHero (https://ods.ai/competitions/x5-retailhero-uplift-modeling/data) (users-buy-products) with causal outcomes and interventions from a marketing campaign, and the other is based on MovieLens25 (https://grouplens.org/datasets/movielens/25m/) (movies-ratedby-users) with observational causal parameters .
I use them to test a GNN for causal outcome prediction in https://hal.science/hal-04601553/document ( ECMLPKDD 2024 ) and I believe they re useful for this line of research.