CS224W - Bag of Tricks for Node Classification with GNN - GAT Normalization #5

liuvince · 2024-12-08T12:13:57Z

Add normalize parameter to GATConv and GATv2Conv.

Part of #4 (TODO update this) for our final project for the Stanford CS224W course, this allows "GAT with Symmetric Normalized Adjacency Matrix" as described in “Bag of Tricks for Node Classification with Graph Neural Networks”.

Details

Implementation of gat_norm inspired from gcn_norm, when edge_index is a SparseTensor, is_torch_sparse_tensor or dense torch Tensor.
gat_norm is called after computing the alpha coefficients and return the updated values of edge_index and alpha. The outputs of gat_norm are passed as inputs of self.propagate.
Update the docstring of GATConv and GATv2Conv.
Add unit test cases.
Override the add_self_loops parameter. We remove self loops from the initial graph before calling to gat_norm and add self loops with normalization in gat_norm as described in the paper. We tried to use the tools already provided in the library such as torch_sparse.fill_diag, to_edge_index, add_remaining_self_loops, add_self_loops and to_torch_csr_tensor.
One concern is that there is no learned weight regardless of add_self_loops, because we explicitly remove self loops before edge update. This is consistent with the paper's description and gcn_norm, but different from the paper's implementation. Also, it seems that they use both out-degree and in-degree. We would appreciate your feedback on the preferred approach.
When is_torch_sparse_tensor(edge_index) == True, we have an issue formatting back the index edge_index and the corresponding values in att_mat in the appropriate format. Our workaround consists of sorting lexicographically the values of att_mat, so it matches the index of edge_index for the propagate and update subsequent steps.
Only support non-bipartite graph mesasge passing.

Benchmarks

I have the following metrics with one T4 GPU, so it performs better for CiteSeer and PubMed dataset with a computation time cost.

dataset	Test Accuracy	Test Accuracy (with GAT Norm)	Duration	Duration (with GAT Norm)
Cora	0.831 ± 0.004	0.825 ± 0.005	4.296s	5.172s
CiteSeer	0.707 ± 0.005	0.715 ± 0.005	4.767s	5.592s
PubMed	0.789 ± 0.003	0.796 ± 0.004	6.603s	7.204s

with the following run commands:

python gat.py --dataset=Cora
python gat.py --dataset=Cora --normalize

python gat.py --dataset=CiteSeer
python gat.py --dataset=CiteSeer --normalize  

python gat.py --dataset=PubMed --lr=0.01 --output_heads=8 --weight_decay=0.001
python gat.py --dataset=PubMed --lr=0.01 --output_heads=8 --weight_decay=0.001 --normalize

mattjhayes3 · 2024-12-09T03:28:44Z

Just a few nits on the description:

I'd say ...this allows "GAT with Symmetric Normalized Adjacency Matrix" as described... to make it a bit clearer which part of the paper its implementing
"I would appreciate your feedback on whether this is the correct approach." I'd change to something like "This is consistent with the paper's description and gcn_norm, but different from the paper's implementation. We would appreciate your feedback on the preferred approach"
Do we want to mention that they use both out-degree and in-degree?

torch_geometric/nn/conv/gat_conv.py

mattjhayes3 · 2024-12-09T03:37:32Z

torch_geometric/nn/conv/gat_conv.py

+def gat_norm(  # noqa: F811
+    edge_index: Adj,
+    edge_weight: Tensor,
+    num_nodes: Optional[int] = None,


If we never pass this I think we should remove it from the function signature. But would it make sense to use size.size(1) in the case the user passes it? (only exists on GATConv but not GATv2Conv)

Sorry, on second look, size is already factored when computing alpha, right? So can't we just use alpha's shape? Then we can get rid of the num_nodes parameter.

hmm I don't think it is necessary true, especially when the input is sparse

mattjhayes3 · 2024-12-09T03:45:03Z

torch_geometric/nn/conv/gat_conv.py

+
+        return to_torch_csr_tensor(edge_index), att_mat
+
+    assert flow in ['source_to_target', 'target_to_source']


Maybe move this to the top as it's relevant to all the tensor type cases?

Actually we don't currently use flow in the SparseTensor case, should we use it when computing degree as in the other tensor type cases?

Should we use flow when computing deg in the SparseTensor case too or would that be wrong?

based on the gcn_norm, I don't think we need it

Could be a bug in GCN norm?
Any idea why we would need it in the other cases but not here?
I thought switching flow is supposed to effectively swap the direction of edges.
This way won't we compute the same degrees regardless of flow, which is not correct in the directed case?
Maybe worth asking the PR.

mattjhayes3 · 2024-12-10T03:02:57Z

Typo "paper's. Also implementation" -> "paper's implementation"

mattjhayes3 · 2024-12-10T03:03:47Z

torch_geometric/nn/conv/gat_conv.py

+                        "The usage of 'normalize' is not supported "
+                        "for bipartite message passing.")
+
+        if self.normalize:


We could combine the if statements, but actually, is there much advantage of putting the error here instead of where the you already have the assert statements?

torch_geometric/nn/conv/gatv2_conv.py

…sage passing yet and add num_nodes as parameters

liuvince requested review from chriskynguyen and mattjhayes3 December 8, 2024 12:13

mattjhayes3 reviewed Dec 9, 2024

View reviewed changes

torch_geometric/nn/conv/gat_conv.py Show resolved Hide resolved

mattjhayes3 reviewed Dec 9, 2024

View reviewed changes

liuvince requested a review from mattjhayes3 December 9, 2024 20:27

mattjhayes3 reviewed Dec 10, 2024

View reviewed changes

torch_geometric/nn/conv/gatv2_conv.py Outdated Show resolved Hide resolved

Vincenzooo added 11 commits December 11, 2024 20:58

Use unnormalized attention matrix for GAT

4c9a384

Update GAT norm with torch_csr_tensor

2cfaefc

Add full test to GAT Conv with normalization

d2a52a0

Add normalization to GATv2Conv

45f63fd

Add GAT normalization parameter in benchmark/citation/gat.py

8c05677

Fix typo

beb2e34

Remove unused variable and passing more parameters to gat_norm

f346c8c

Outline the fact that GAT Normalization does not handle bipartite mes…

8bbdbc4

…sage passing yet and add num_nodes as parameters

Checking self.normalize once

d22db1d

Update test condition to check bipartite graph in forward pass

c19ada3

Fix gat_norm when edge_index is SparseTensor

76cab5d

liuvince force-pushed the gat_normalization branch from 0ff800b to 76cab5d Compare December 11, 2024 19:59

Vincenzooo added 5 commits December 11, 2024 21:22

Add small fixes

ce302c4

Clean typing and add typing test

00c73d8

Update CHANGELOG.md

6197e8f

Update tests

9c5c19e

Update flow

d4864ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CS224W - Bag of Tricks for Node Classification with GNN - GAT Normalization #5

CS224W - Bag of Tricks for Node Classification with GNN - GAT Normalization #5

Uh oh!

liuvince commented Dec 8, 2024 •

edited

Loading

Uh oh!

mattjhayes3 commented Dec 9, 2024

Uh oh!

Uh oh!

mattjhayes3 Dec 9, 2024

Uh oh!

mattjhayes3 Dec 10, 2024

Uh oh!

liuvince Dec 10, 2024

Uh oh!

mattjhayes3 Dec 9, 2024

Uh oh!

mattjhayes3 Dec 10, 2024

Uh oh!

liuvince Dec 10, 2024

Uh oh!

mattjhayes3 Dec 13, 2024 •

edited

Loading

Uh oh!

mattjhayes3 commented Dec 10, 2024

Uh oh!

mattjhayes3 Dec 10, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		return to_torch_csr_tensor(edge_index), att_mat

		assert flow in ['source_to_target', 'target_to_source']

CS224W - Bag of Tricks for Node Classification with GNN - GAT Normalization #5

Are you sure you want to change the base?

CS224W - Bag of Tricks for Node Classification with GNN - GAT Normalization #5

Uh oh!

Conversation

liuvince commented Dec 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details

Benchmarks

Uh oh!

mattjhayes3 commented Dec 9, 2024

Uh oh!

Uh oh!

mattjhayes3 Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

mattjhayes3 Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

liuvince Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

mattjhayes3 Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

mattjhayes3 Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

liuvince Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

mattjhayes3 Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattjhayes3 commented Dec 10, 2024

Uh oh!

mattjhayes3 Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

liuvince commented Dec 8, 2024 •

edited

Loading

mattjhayes3 Dec 13, 2024 •

edited

Loading

mattjhayes3 Dec 10, 2024 •

edited

Loading