CCCL 3.2 (and 3.1) benchmark on B200 #6728
Unanswered
bernhardmgruber
asked this question in
Show and tell
Replies: 1 comment
-
|
A few things I noticed that could be improved:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I figured we could use an up-to-date benchmark of CUB and Thrust, so I ran a few batch jobs on a B200. The commit c5a7406 is just before we branched 3.2, so if there are no perf related backports, this will correspond to CCCL 3.2:
CUB:

Thrust:

Benchmark SQLite database (on CCCL team drive):
https://drive.google.com/drive/folders/1eaRnrsz7gEmX386rgI0L_boRJIejNGDk
The benchmark was run using
benchmarks/scripts/submit_benchmark_job.sh, which runsrun.py. The plot is made withsol.py(ran twice with a regex to split CUB and Thrust).CUB 3.1 (blue) vs CUB 3.2 (orange):

Beta Was this translation helpful? Give feedback.
All reactions