GPU benchmarks for exllamav2 #744
Unanswered
volodymyr-barannik
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello.
There is a GPU benchmark ran on llama.cpp.
My question is whether we can safely translate these results to exllamav2? Or does exllamav2 have some features which make some GPUs perform better or worse than in that llama.cpp benchmark?
I am most interested in 2x3090/4090 vs A100.
Thank you!
P.S. if you benchmarked exllamav2 on several GPUs it would be really nice if you could share the results..
Beta Was this translation helpful? Give feedback.
All reactions