Add a fast_matrix_mul_4x4 function with SIMD optimization for the LoongArch64. #19959

KatyushaScarlet · 2025-02-08T12:20:48Z

Add the function fast_matrix_mul_4x4_lsx for LoongArch64.
Add the CFLAGS -mlsx and -mlasx for LoongArch64, which enable gcc to build with LSX/LASX (128/256bit SIMD extension for loongson CPU) instructions.

Here is the Unofficial LoongArch Intrinsics Guide: https://jia.je/unofficial-loongarch-intrinsics-guide/migrating_sse/

hrydgard

Ideally CrossSIMD.h needs a Loongarch impl too, in the future some more of the SIMD code of the app will be migrated to use that.

hrydgard · 2025-02-10T17:14:30Z

Common/Math/fast/fast_matrix.c

+
+static __m128 __lsx_vreplfr2vr_s(float val)
+{
+    FloatInt tmpval = {.f = val};


Nowadays it's more standard to use memcpy for this, compilers optimize it down properly. This usage of unions is UB. However, in practice it does work, so I'll allow it.

Add fast_matrix_mul_4x4_lsx function for LoongArch64

66f5ac9

KatyushaScarlet force-pushed the dev-loongarch64 branch from a9c7fe3 to 66f5ac9 Compare February 10, 2025 08:21

hrydgard approved these changes Feb 10, 2025

View reviewed changes

hrydgard merged commit 6ec74f5 into hrydgard:master Feb 10, 2025
19 checks passed

hrydgard added this to the v1.19.0 milestone Feb 10, 2025

hrydgard added the LoongArch64 label Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a fast_matrix_mul_4x4 function with SIMD optimization for the LoongArch64. #19959

Add a fast_matrix_mul_4x4 function with SIMD optimization for the LoongArch64. #19959

KatyushaScarlet commented Feb 8, 2025 •

edited

Loading

hrydgard left a comment

hrydgard Feb 10, 2025

Add a fast_matrix_mul_4x4 function with SIMD optimization for the LoongArch64. #19959

Add a fast_matrix_mul_4x4 function with SIMD optimization for the LoongArch64. #19959

Conversation

KatyushaScarlet commented Feb 8, 2025 • edited Loading

hrydgard left a comment

Choose a reason for hiding this comment

hrydgard Feb 10, 2025

Choose a reason for hiding this comment

KatyushaScarlet commented Feb 8, 2025 •

edited

Loading