You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
At the time of three-way partition implementation, it was faster to perform two look-backs of I32 counter to figure out the number of items selected by op_1, and op_2 compared to performing one I64 look-back of a counter pair. Since the look-back was recently improved, we should revisit this decision.
Describe the solution you'd like
We should try combining two look-backs into a single one, tune it and compare resulting performance of three-way partition.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
Is this a duplicate?
Area
CUB
Is your feature request related to a problem? Please describe.
At the time of three-way partition implementation, it was faster to perform two look-backs of I32 counter to figure out the number of items selected by
op_1
, andop_2
compared to performing one I64 look-back of a counter pair. Since the look-back was recently improved, we should revisit this decision.Describe the solution you'd like
We should try combining two look-backs into a single one, tune it and compare resulting performance of three-way partition.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: