[Backport to 3.1] Use cp_async_bulk(space_shared,...) when available (#6362)
#7102
+20
−9
cp_async_bulk(space_shared,...) when available (#6362)
#7102