-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do not preserve the sign in argument reduction if computing a Cos #4161
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Comparison on Zen3. Before: ``` RAW TSC: min 1‰ 1% 5% 10% 25% 50% identity 4.94 +0.00 +0.00 +0.00 +0.38 +0.38 +0.38 sqrtps_xmm0_xmm0 16.72 +0.38 +0.38 +0.38 +0.38 +0.38 +0.38 mulsd_xmm0_xmm0 7.60 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 mulsd_xmm0_xmm0_4x 15.20 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 Slope: 1.186186 cycle/TSC Correlation coefficient: 0.999887 Cycles: expected min 1‰ 1% 5% 10% 25% 50% R identity 0 -0.07 +0.00 +0.00 +0.00 +0.45 +0.45 +0.45 R mulsd_xmm0_xmm0 3 3.08 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 R mulsd_xmm0_xmm0_4x 12 12.10 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 principia_cos 65.73 +0.00 +0.00 +0.45 +0.45 +0.90 +0.90 principia_sin 64.83 +1.35 +1.80 +1.80 +1.80 +2.25 +2.25 R sqrtps_xmm0_xmm0 14 13.90 +0.45 +0.45 +0.45 +0.45 +0.45 +0.45 std_cos 55.37 +0.45 +0.90 +0.90 +0.90 +0.90 +0.90 std_sin 63.03 +0.00 +0.00 +0.00 +0.00 +0.00 +0.45 ``` After: ``` RAW TSC: min 1‰ 1% 5% 10% 25% 50% identity 4.94 +0.00 +0.00 +0.00 +0.38 +0.38 +0.38 sqrtps_xmm0_xmm0 16.72 +0.38 +0.38 +0.38 +0.38 +0.38 +0.38 mulsd_xmm0_xmm0 7.60 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 mulsd_xmm0_xmm0_4x 15.20 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 Slope: 1.186186 cycle/TSC Correlation coefficient: 0.999887 Cycles: expected min 1‰ 1% 5% 10% 25% 50% R identity 0 -0.07 +0.00 +0.00 +0.00 +0.45 +0.45 +0.45 R mulsd_xmm0_xmm0 3 3.08 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 R mulsd_xmm0_xmm0_4x 12 12.10 +0.00 +0.00 +0.00 +0.00 +0.00 +0.00 principia_cos 63.48 +3.61 +3.61 +4.06 +4.06 +4.06 +4.06 principia_sin 69.34 +0.00 +0.45 +0.45 +0.45 +0.45 +0.90 R sqrtps_xmm0_xmm0 14 13.90 +0.45 +0.45 +0.45 +0.45 +0.45 +0.45 std_cos 54.47 +0.45 +0.45 +0.45 +0.45 +0.90 +0.90 std_sin 63.03 +0.00 +0.00 +0.00 +0.00 +0.00 +0.45 ``` Comparison on Golden Cove. Before: ``` RAW TSC: min 1‰ 1% 5% 10% 25% 50% identity 6.74 +0.22 +0.24 +0.26 +0.28 +0.30 +0.32 sqrtps_xmm0_xmm0 26.94 +0.32 +0.34 +0.36 +0.38 +0.42 +0.46 mulsd_xmm0_xmm0 14.04 +0.14 +0.58 +0.62 +0.64 +0.66 +0.70 mulsd_xmm0_xmm0_4x 32.36 +0.10 +1.22 +1.28 +1.30 +1.34 +1.40 Slope: 0.622294 cycle/TSC Correlation coefficient: 0.998924 Cycles: expected min 1‰ 1% 5% 10% 25% 50% R identity 0 -0.23 +0.11 +0.11 +0.12 +0.14 +0.15 +0.16 R mulsd_xmm0_xmm0 4 4.30 +0.06 +0.09 +0.36 +0.37 +0.39 +0.41 R mulsd_xmm0_xmm0_4x 16 15.64 +0.10 +0.19 +0.83 +0.85 +0.87 +0.91 principia_cos 69.40 +0.35 +0.54 +0.71 +0.80 +0.97 +1.17 principia_sin 70.49 +0.41 +0.58 +0.73 +0.82 +0.97 +1.15 R sqrtps_xmm0_xmm0 12 12.44 +0.06 +0.09 +0.10 +0.11 +0.12 +0.15 std_cos 55.23 +0.42 +0.52 +0.62 +0.68 +0.77 +0.88 std_sin 64.13 +0.12 +0.20 +0.26 +0.30 +0.37 +0.46 ``` After: ``` RAW TSC: min 1‰ 1% 5% 10% 25% 50% identity 6.84 +0.14 +0.14 +0.16 +0.18 +0.20 +0.22 sqrtps_xmm0_xmm0 27.14 +0.12 +0.14 +0.16 +0.18 +0.22 +0.26 mulsd_xmm0_xmm0 14.16 +0.44 +0.48 +0.50 +0.52 +0.54 +0.56 mulsd_xmm0_xmm0_4x 32.42 +1.14 +1.18 +1.22 +1.24 +1.28 +1.34 Slope: 0.622041 cycle/TSC Correlation coefficient: 0.998720 Cycles: expected min 1‰ 1% 5% 10% 25% 50% R identity 0 -0.26 +0.07 +0.07 +0.09 +0.10 +0.11 +0.12 R mulsd_xmm0_xmm0 4 4.28 +0.05 +0.30 +0.31 +0.32 +0.34 +0.35 R mulsd_xmm0_xmm0_4x 16 15.60 +0.76 +0.78 +0.80 +0.81 +0.83 +0.87 principia_cos 66.31 +0.47 +0.71 +0.95 +1.09 +1.34 +1.64 principia_sin 70.76 +0.37 +0.55 +0.71 +0.81 +0.96 +1.16 R sqrtps_xmm0_xmm0 12 12.37 +0.07 +0.09 +0.12 +0.15 +4.85 +4.89 std_cos 55.29 +0.31 +0.45 +0.57 +0.66 +0.88 +22.36 std_sin 64.01 +0.16 +0.24 +0.31 +0.36 +0.46 +1.02 ```
eggrobin
approved these changes
Jan 27, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Comparison on Zen 3. Before:
After:
Comparison on Golden Cove. Before:
After: