Hi!
How is represented state of data iteration in Trinity?
In Verl StatefulDataLoader is used (and in general classic DataLoader components): https://github.com/volcengine/verl/blob/313366fd85e95ad43d567a808dd647089723a255/verl/trainer/ppo/ray_trainer.py#L990-L997
How is iteration index / data permutation is represented in Trinity?
Is this state saved to checkpoint? or some metadata near the checkpoint?
Is this state being loaded/recovered when resuming from a checkpoint?
In general, we're lacking understanding of data prep infrastructure of Trinity (and how it relates to Verl's - if any of the Verl's data loading components are reused in Trinity)
Thanks!
Hi!
How is represented state of data iteration in Trinity?
In Verl StatefulDataLoader is used (and in general classic DataLoader components): https://github.com/volcengine/verl/blob/313366fd85e95ad43d567a808dd647089723a255/verl/trainer/ppo/ray_trainer.py#L990-L997
How is iteration index / data permutation is represented in Trinity?
Is this state saved to checkpoint? or some metadata near the checkpoint?
Is this state being loaded/recovered when resuming from a checkpoint?
In general, we're lacking understanding of data prep infrastructure of Trinity (and how it relates to Verl's - if any of the Verl's data loading components are reused in Trinity)
Thanks!