Add prefetching kernel as new fallback for cub::DeviceTransform
#2396
+224
−35
cub::DeviceTransform
#2396