[Feature] Support AVX2 on linux #192

lshAlgorithm · 2025-05-13T17:55:33Z

Support the AVX2 on Linux.
Main change: In rwkv_operators_wkv_v7.inc, it is straightforward, we just need to add things up in vector and reduce then to sum out of loop using horizontal_sum.
Reproduce: I've added the compile option, so just compile it using ./script.sh, and run generate_completion.py following README.
Remaining issues: Because the reduction of AVX vector is instrinsics-depedent, and my pc doesn't support avx512, let alone arm, the code support AVX2 ONLY; and it seems that ggml doesn't fully support AVX2, and doesn't include <immintrin.h>. (Maybe I don't get it right, I just add the header in the code.)
Outcome: Test on Fedora40 with Intel(R) Core(TM) Ultra 9 185H (No GPU). Slightly speedup using rwkv-7-1.5B, fp16, so as rwkv-7-2.9B, fp16. (To be honest, it is about ms level faster, and my PC is not that stable, so I cannot show an accurate speedup...)
Statement: Sorry for the stupid commit explanation, it was meant for private use days before. No much experience in PR, but really wanna contribute to the community. Hope for suggestion.

Signed-off-by: lshAlgorithm <[email protected]>

lshAlgorithm added 3 commits May 14, 2025 02:01

FINISHED!

6e89c96

Signed-off-by: lshAlgorithm <[email protected]>

change format

493682d

Signed-off-by: lshAlgorithm <[email protected]>

vectorization on sum of sa

1c50bb0

Signed-off-by: lshAlgorithm <[email protected]>

lshAlgorithm force-pushed the avx2 branch from e64ce83 to 1c50bb0 Compare May 13, 2025 18:01

add comments

97a51fc

Signed-off-by: lshAlgorithm <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Support AVX2 on linux #192

[Feature] Support AVX2 on linux #192

Uh oh!

lshAlgorithm commented May 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Feature] Support AVX2 on linux #192

Are you sure you want to change the base?

[Feature] Support AVX2 on linux #192

Uh oh!

Conversation

lshAlgorithm commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lshAlgorithm commented May 13, 2025 •

edited

Loading