add support for Apple M1 and other arm64+Neon architectures #1

cskiraly · 2022-05-17T11:09:27Z

There was already some NEON support, through a separate code
path. This version relies on the sse2neon library to add
NEON support directly without a separate code path.

Signed-off-by: Csaba Kiraly [email protected]

There was already some NEON support, through a separate code path. This version relies on the sse2neon library to add Neon support Signed-off-by: Csaba Kiraly <[email protected]>

cskiraly · 2022-05-17T13:27:01Z

Just linking original issue upstream to help those looking at it.
catid#18

cskiraly · 2022-05-18T12:09:04Z

Tested on:

Apple M1 MacOS:

Parameters: [original count=1000] [recovery count=100] [buffer bytes=64000] [loss count=100] [random seed=2]
Leopard Encoder(64 MB in 1000 pieces, 100 losses): Input=3136.18 MB/s, Output=313.618 MB/s
Leopard Decoder(64 MB in 1000 pieces, 100 losses): Input=708.302 MB/s, Output=70.8302 MB/s

Apple M1 under Parallels Linux arm64 VM (4 cores, openMP):

Parameters: [original count=1000] [recovery count=100] [buffer bytes=64000] [loss count=100] [random seed=2]
Leopard Encoder(64 MB in 1000 pieces, 100 losses): Input=5281.4 MB/s, Output=528.14 MB/s
Leopard Decoder(64 MB in 1000 pieces, 100 losses): Input=901.332 MB/s, Output=90.1332 MB/s

Raspberry Pi 3:

Parameters: [original count=1000] [recovery count=100] [buffer bytes=64000] [loss count=100] [random seed=2]
Leopard Encoder(64 MB in 1000 pieces, 100 losses): Input=136.62 MB/s, Output=13.662 MB/s
Leopard Decoder(64 MB in 1000 pieces, 100 losses): Input=27.0177 MB/s, Output=2.70177 MB/s

RK3328 (Rock64 SBC, 4 x Cortex-A53):

Parameters: [original count=1000] [recovery count=100] [buffer bytes=64000] [loss count=100] [random seed=2]
Leopard Encoder(64 MB in 1000 pieces, 100 losses): Input=202.632 MB/s, Output=20.2632 MB/s
Leopard Decoder(64 MB in 1000 pieces, 100 losses): Input=34.2045 MB/s, Output=3.42045 MB/s

LeopardCommon.h

liamsi · 2022-06-02T21:57:21Z

@cskiraly when you tested this on an Apple M1 under MacOS, did you just run cmake and then make using the generated Makefile? When I do this on my machine (m1 max with Monterey), I run into Undefined symbols for architecture arm64 🤔

cskiraly · 2022-06-04T12:40:47Z

@liamsi I was compiling it through our wrapper in https://github.com/status-im/nim-leopard
which disables openMP. It actually does
cmake .. -DCMAKE_BUILD_TYPE=Release -DENABLE_OPENMP=off

-DENABLE_OPENMP=off should work for you.

The undefined symbols I see when enabling openMP are omp symbols. Either there is something wrong with the linker args, or the omp installed by brew is not arm64, I have yet to check. We do have some related discussion at https://github.com/status-im/nim-leopard#openmp

liamsi · 2022-06-07T12:59:12Z

Thanks, -DENABLE_OPENMP=off does the trick. I made sure to use llvm/clang installed via brew but I still do see errors (this time related to std::). I'll continue playing around with different flags and see if I can get it compile with all optimizations. Will let you know.

As per comment status-im/leopard#1 (comment)

add support for Mac M1, maybe other arm64+neon as well

a16be41

There was already some NEON support, through a separate code path. This version relies on the sse2neon library to add Neon support Signed-off-by: Csaba Kiraly <[email protected]>

cskiraly changed the title ~~add support for Mac M1, maybe other arm64+neon as well~~ add support for Apple M1 and other arm64+Neon architectures May 18, 2022

cskiraly requested a review from a team May 18, 2022 12:53

michaelsbradleyjr approved these changes May 18, 2022

View reviewed changes

LeopardCommon.h Show resolved Hide resolved

cskiraly mentioned this pull request May 18, 2022

enable build on Mac M1 and other arm64+Neon architectures codex-storage/nim-codex#100

Merged

dryajov approved these changes May 18, 2022

View reviewed changes

cskiraly mentioned this pull request May 18, 2022

Add Apple M1 and generic ARM64+Neon support codex-storage/nim-leopard#5

Merged

cskiraly merged commit 20eb7c8 into status-im:master May 19, 2022

kirbyquerby mentioned this pull request Jun 1, 2022

add support for Mac M1, maybe other arm64+neon as well celestiaorg/leopard#4

Merged

liamsi mentioned this pull request Jun 15, 2022

add support for Mac M1, maybe other arm64+neon as well catid/leopard#19

Merged

emizzle added a commit to codex-storage/nim-leopard that referenced this pull request Jun 12, 2024

change version to 1.6.20 and stable, and disable openmp on macos

957d976

As per comment status-im/leopard#1 (comment)

emizzle mentioned this pull request Jun 13, 2024

Use nim v1.6+ codex-storage/nim-leopard#12

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for Apple M1 and other arm64+Neon architectures #1

add support for Apple M1 and other arm64+Neon architectures #1

cskiraly commented May 17, 2022 •

edited

Loading

cskiraly commented May 17, 2022

cskiraly commented May 18, 2022

liamsi commented Jun 2, 2022

cskiraly commented Jun 4, 2022

liamsi commented Jun 7, 2022

add support for Apple M1 and other arm64+Neon architectures #1

add support for Apple M1 and other arm64+Neon architectures #1

Conversation

cskiraly commented May 17, 2022 • edited Loading

cskiraly commented May 17, 2022

cskiraly commented May 18, 2022

liamsi commented Jun 2, 2022

cskiraly commented Jun 4, 2022

liamsi commented Jun 7, 2022

cskiraly commented May 17, 2022 •

edited

Loading