-
Notifications
You must be signed in to change notification settings - Fork 61
Use dask array native reduction #864
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Yes, we were being a bit extra around being scale-aware here. I figured if I wanted tree-reduce across arrays, I'd need to write out the graph, so I just did it for the arrays themselves too. I am surprised this just works, are these Session objects being automatically cast to object arrays of the right shape under the hood in |
|
There is no and there is no final concat step because the final reduction is also a Sorry for the typing mess and thanks for cleaning up |
|
sorry for the linting problems. I didn't setup pre-commit because I struggled with the uv setup, etc. I think it should be working |
|
ok, rust ci is now failing with... is this related? I'll try to reproduce. Main doesn't seem to eb affected |
|
@fjetter the PR is fine, this is an unrelated error. |
dcherian
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @fjetter!

This came up in dask/dask#11844 (comment)
I don't see a reason why you shouldn't be able to use the dask native
da.reduction. This way, dask owns the tree reduction logic and it will work as intended once we move to an expression backend.The only caveat with this is that the very final reduction / Session merge would happen locally. This should typically only be a very small number of sessions (one per dask.array). The only reason why this would be bad is if that merge would be expensive or would require IO but I believe this is just merging metadata, isn't it?
cc @dcherian