Added md5sums utilities functions, for a faster data loading #8

g-antonello · 2024-08-20T11:17:52Z

Description of the usage is in the file added. In brief my idea is that every time you generate a new data version, you also run a couple of extra functions on the same parameters used to generate the file version. This should be enough to generate a md5sum object to then compare when loading.

A data loading example is also shown in the same script.

Overall, these functions could be implemented in a fancier way in a data generation pipeline, but even so they should speed data loading by 5x at least.

Added md5sums utilities functions, for a faster data loading

0b476b1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added md5sums utilities functions, for a faster data loading #8

Added md5sums utilities functions, for a faster data loading #8

g-antonello commented Aug 20, 2024

Added md5sums utilities functions, for a faster data loading #8

Are you sure you want to change the base?

Added md5sums utilities functions, for a faster data loading #8

Conversation

g-antonello commented Aug 20, 2024