PT: add uniform likelihood bucket batching #1661

NeoLegends · 2024-12-05T13:04:45Z

This is a first attempt at automatically optimizing the bucket limits during training. After every subepoch the limits are adjusted to make every bucket catch a roughly equal number of segments.

I am debating on two more things:

Keeping only a fixed number of seq lens that have passed through the batching (e.g. 10k + upper + lower value) to have an upper bound on the memory consumption. Whenever the list is extended by another batch and crosses said limit, we re-sample randomly from that list to keep it at a fixed size. As long as the number of seq lens we keep is large enough this shouldn't have a big influence on the accuracy of the results.
Keeping seq len statistics even across subepochs to make the statistics better.

albertz · 2024-12-05T13:42:41Z

When I mention that I would try to optimize this, I was more thinking about writing a dedicated script just to do that.

But this here could also be interesting.

I'm not sure thought that I would put this into RETURNN yet, while you are still experimenting with it. You can rather put this somewhere into your i6_experiments.users. and then just use it from there. (I think we might need to extend the class resolution slightly, if "." in cls_name: ...; see similar code in rf.build_from_dict or get_optimizer_class or other such functions.)

NeoLegends self-assigned this Dec 5, 2024

NeoLegends force-pushed the moritz-optim-bucket-limits branch 4 times, most recently from 9b53a49 to c80d589 Compare December 5, 2024 13:10

NeoLegends marked this pull request as ready for review December 5, 2024 13:13

NeoLegends requested review from albertz and a team as code owners December 5, 2024 13:13

NeoLegends force-pushed the moritz-optim-bucket-limits branch 2 times, most recently from a2d513a to 1392e99 Compare December 5, 2024 13:17

NeoLegends changed the title ~~PT: add optimizing bucket batching~~ PT: add uniform likelihood bucket batching Dec 5, 2024

PT: add uniform likelihood bucket batching

0a561e4

NeoLegends force-pushed the moritz-optim-bucket-limits branch from 1392e99 to 0a561e4 Compare December 5, 2024 13:17

add docs

2e59d16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PT: add uniform likelihood bucket batching #1661

PT: add uniform likelihood bucket batching #1661

NeoLegends commented Dec 5, 2024 •

edited

Loading

albertz commented Dec 5, 2024

PT: add uniform likelihood bucket batching #1661

Are you sure you want to change the base?

PT: add uniform likelihood bucket batching #1661

Conversation

NeoLegends commented Dec 5, 2024 • edited Loading

albertz commented Dec 5, 2024

NeoLegends commented Dec 5, 2024 •

edited

Loading