How do I evaluate the performance of a .pte file after Executorch quantization?
#14988
Unanswered
cupid20103
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone!
I'm currently working on a
react-nativeproject and want to integrate the Hugging Face model (awav2vec2model) into my app.As you know, to use the large model on the mobile environment, the model needs to be quantized.
So, I used
executorchto quantize, and it generated a.ptefile.For now, I need to evaluate the performance (PER, FER, RTF, Peak RAM usage) of this file.
How to solve this problem?
Beta Was this translation helpful? Give feedback.
All reactions