Is your feature request related to a problem? Please describe.
A lot of the processing in the GPU aggregate operations are not as close to spark as they could be. It would be nice to make it more similar to help reduce the possibility of bugs and to increase the possibility of code reuse.