-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chunked reading of files #34
Comments
I will look into this some more. When orthorectifying, the geometry lookup table (GLT) included in the location group is used to reshape the data array to fit the GLT dimensions. I think to do that on chunks would require modification of the GLT. My understanding is that chunking along the bands dimension would likely slow down operations focused on imaging spectroscopy since that would require more reads. Maybe @pgbrodrick has some insight? At minimum we can at least add notes about memory requirements. |
Yes, I tried fiddling with the orthorectification but could not get it to work with I think this is worthwhile because the recommended workflow is to perform data manipulations and biophysical modelling on the unorthorectified data, and later orthorectify the variables produced downstream. This works for me because the output variables will typically have far fewer variables than the image cube has bands (I go from 250ish bands to 4 endmembers). It then becomes much more feasible to load the data into memory before orthorectifying. My guess is that this workflow is common enough to warrant accommodating it by specifying a chucking option if |
Currently reading files using
emit_xarray
from emit_tools.py reads into and.array
backedxr.dataset
. An option to read into a chunkeddask.array
backedxr.dataset
would help prevent out-of-memory errors when reading on machines with limited memory (loading failed on an 8GB SMCE machine) and potentially speed up operations on downstream operations using dask.Adding
chunks='auto'
toEMIT-Data-Resources/python/modules/emit_tools.py
Line 56 in 6958963
ortho=False
but not forortho=True
The text was updated successfully, but these errors were encountered: