Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trac: Advanced support for reading external data files #191

Open
MichaelClerx opened this issue May 5, 2020 · 0 comments
Open

Trac: Advanced support for reading external data files #191

MichaelClerx opened this issue May 5, 2020 · 0 comments

Comments

@MichaelClerx
Copy link
Contributor

https://chaste.cs.ox.ac.uk/trac/ticket/2528

While you can write code to pass data as a protocol input, it would be nice if data files could be referenced directly in a protocol. Both Python and C++ can easily support reading CSV as 1 or 2-d arrays; the Python implementation could also support HDF5. For COMBINE friendliness, we might consider ​NuML too. What would we need to specify beyond the file name? For something like HDF5 that can store multiple n-d arrays, we might need to specify a path within the file. Would we need to allow describing the dimensions with units too? Does there need to be a separate data set section of the protocol, or just reference inline (e.g. my_data = data("file_path.h5", "/group/dataset"))? If the former, where do these fit in the name resolution graph?

Possible examples:

data sets {
csv_data = "file.csv"
hdf5_data = "file.h5" "/group/dataset"
}

In writing this, I'm leaning towards the inline format with a new reserved function data, like we have for map, but welcome feedback.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant