Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support finding mutliple data layers during dataset exploration for simpler dataset formats without a metadata file #7924

Open
MichaelBuessemeyer opened this issue Jul 17, 2024 · 0 comments

Comments

@MichaelBuessemeyer
Copy link
Contributor

MichaelBuessemeyer commented Jul 17, 2024

Detailed Description

The upcoming changes of #7912 allow the exploration to work recursively and to look for a dataset within subpaths of the given path by the user. This works well for dataset formats that have a metadata file describing the whole dataset and the location of all its layers (e.g. like a datasource-properties.json file.) However, the exploration of simpler formats with such a metadata file usually only finds some mag of one single layer during the exploration.

If a dataset would look like this:

  • dataset
    • layer 1
      • mag 1
        • shard 1
          • actual data files
        • shard 2
          • actual data files
        • ...
      • mag 2
        • shard 1
          • actual data files
        • shard 2
          • actual data files
        • ...
    • layer 2
      • mag 1
        • shard 1
          • actual data files
        • shard 2
          • actual data files
        • ...
      • mag 2
        • shard 1
          • actual data files
        • shard 2
          • actual data files
        • ...

The exploration current code would only find layer 1 & mag 1 as a layer. The other mags and layers would not be found in case of a simpler dataset format.

Thus, the exploration could be extended to look for additional layers and mags and group them correctly.

For reference:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant