Why does the XR-Transformer require exceptional RAM on Amazon-670k (and in other large datasets)? #223

celsofranssa · 2023-05-19T15:55:48Z

Hello,

Reading the XR-Transformer paper, I would like to know the Multi-resolution learning time complexity considering the N(number of text instances) and the number of labels (L).
Is the Multi-resolution learning step that causes the right amount of RAM required to apply XR-Transformer over Amazon-670k?

jiong-zhang · 2023-05-23T18:33:48Z

XR-Transformer model consists of two parts: text encoder and XMC ranker. The XMC ranker part has space complexity linear to the number of output labels and the dimension/sparsity of the input features. Therefore, generally speaking when output label space is large and the input features (TFIDF + dense embeddings) are dense, there will be more memory cost.

To reduce the memory cost you can adjust the threshold to sparsify the XMC ranker (link) where parameters below that value will be set to 0.

celsofranssa · 2023-05-25T10:25:45Z

Thank you.

celsofranssa · 2023-12-16T20:49:21Z

XR-Transformer model consists of two parts: text encoder and XMC ranker. The XMC ranker part has space complexity linear to the number of output labels and the dimension/sparsity of the input features. Therefore, generally speaking when output label space is large and the input features (TFIDF + dense embeddings) are dense, there will be more memory cost.

To reduce the memory cost you can adjust the threshold to sparsify the XMC ranker (link) where parameters below that value will be set to 0.

Could you provide the threshold that allows training XR-Transformer on the Amazon-670k dataset in a computational environment with 128GB RAM?

celsofranssa added the bug Something isn't working label May 19, 2023

celsofranssa closed this as completed May 25, 2023

celsofranssa reopened this Dec 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does the XR-Transformer require exceptional RAM on Amazon-670k (and in other large datasets)? #223

Why does the XR-Transformer require exceptional RAM on Amazon-670k (and in other large datasets)? #223

celsofranssa commented May 19, 2023

jiong-zhang commented May 23, 2023

celsofranssa commented May 25, 2023

celsofranssa commented Dec 16, 2023 •

edited

Loading

Why does the XR-Transformer require exceptional RAM on Amazon-670k (and in other large datasets)? #223

Why does the XR-Transformer require exceptional RAM on Amazon-670k (and in other large datasets)? #223

Comments

celsofranssa commented May 19, 2023

jiong-zhang commented May 23, 2023

celsofranssa commented May 25, 2023

celsofranssa commented Dec 16, 2023 • edited Loading

celsofranssa commented Dec 16, 2023 •

edited

Loading