Replies: 1 comment 12 replies
-
The 1070 has faster FP32 (single-precision) floating-point performance than FP16 (half-precision) performance. By using the I'm in the same boat with my 1080ti. So, I understand what you are experiencing. More modern cards have been designed with greater FP16 performance. This wiki table also shows the legacy performance of the 10 Series cards: Hard to believe it's only been five years. 😃 |
Beta Was this translation helpful? Give feedback.
12 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, so this fork is great for small VRam, with only 2.4 GB, with turbo it takes 3.4GB, but its still slow vs un-optimized of other forks like hlky 49sec here vs hlky fork 37sec that takes 6.4GB. So it would be cool if there is a config to enable un-optimized for people who owns > 8Gb VRam.
I used my gtx 1070 8GB here for testing.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions