How to skip the batch when OOM happens? #652

Xinheng-He · 2024-09-06T01:01:18Z

Hi developers:

Hydra-lightning is a really cool tool and I like it! However, my batch includes highly different size of graphs and sometimes it causes OOM issues. Previously I would manually skip this batch but in hydra-lightning it seems hard to do this. Is it possible to add it in a future version, or how can I skip batch when such batch OOM (out of memory in GPU)?

Xinheng

Xinheng-He · 2024-09-06T02:01:56Z

I made it by adding a module in trainind_step like this, however, when I run the code on multiple GPUs, it stops when return None (no matter clean or not), I think maybe such trick can only be played on single GPU training.
Wish it helps for others.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to skip the batch when OOM happens? #652

How to skip the batch when OOM happens? #652

Xinheng-He commented Sep 6, 2024

Xinheng-He commented Sep 6, 2024

How to skip the batch when OOM happens? #652

How to skip the batch when OOM happens? #652

Comments

Xinheng-He commented Sep 6, 2024

Xinheng-He commented Sep 6, 2024