-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[MLAS] Bf16 implementations of NCHWc Conv kernels #26980
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
e1c28f3
4a814a2
e2197cd
57c1c73
998267d
a560ac8
eb59c07
9a55be9
4113c91
06e05a7
f94a66a
7cb2b44
98af370
3b7dd2f
741df51
8f0e39c
5b82b17
c5a769d
a0a5bd5
7ea4926
abe4e21
a6b2e10
2e9bed9
8d76b80
6cbbd25
8eb4f5a
b63aac1
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
@@ -567,9 +567,13 @@ Return Value: | |||||||||||||||||||||||||||||
|
|
||||||||||||||||||||||||||||||
| #if defined(MLAS_USE_ARM_NEON_NCHWC) | ||||||||||||||||||||||||||||||
| this->ConvNchwFloatKernel = MlasConvNchwFloatKernelNeon; | ||||||||||||||||||||||||||||||
| this->ConvNchwFloatKernel = MlasConvNchwBf16KernelNeon; | ||||||||||||||||||||||||||||||
| this->ConvNchwcFloatKernel = MlasConvNchwcFloatKernelNeon; | ||||||||||||||||||||||||||||||
| this->ConvNchwcFloatKernel = MlasConvNchwcBf16KernelNeon; | ||||||||||||||||||||||||||||||
| this->ConvDepthwiseFloatKernel = MlasConvDepthwiseFloatKernelNeon; | ||||||||||||||||||||||||||||||
| // this->ConvDepthwiseFloatKernel = MlasConvDepthwiseBf16KernelNeon; | ||||||||||||||||||||||||||||||
| this->ConvPointwiseFloatKernel = MlasConvPointwiseFloatKernelNeon; | ||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||
| this->ConvPointwiseFloatKernel = MlasConvPointwiseFloatKernelNeon; |
Copilot
AI
Jan 16, 2026
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Duplicate assignment to ConvNchwcFloatKernel. The second assignment overwrites the first. If the intent is to replace the float kernel with the BF16 kernel, remove the first assignment. Otherwise, clarify the initialization logic.
| this->ConvNchwFloatKernel = MlasConvNchwBf16KernelNeon; | |
| this->ConvNchwcFloatKernel = MlasConvNchwcFloatKernelNeon; | |
| this->ConvNchwcFloatKernel = MlasConvNchwcBf16KernelNeon; | |
| this->ConvDepthwiseFloatKernel = MlasConvDepthwiseFloatKernelNeon; | |
| // this->ConvDepthwiseFloatKernel = MlasConvDepthwiseBf16KernelNeon; | |
| this->ConvPointwiseFloatKernel = MlasConvPointwiseFloatKernelNeon; | |
| this->ConvPointwiseFloatKernel = MlasConvPointwiseBf16KernelNeon; | |
| // this->ConvNchwFloatKernel = MlasConvNchwBf16KernelNeon; | |
| this->ConvNchwcFloatKernel = MlasConvNchwcFloatKernelNeon; | |
| // this->ConvNchwcFloatKernel = MlasConvNchwcBf16KernelNeon; | |
| this->ConvDepthwiseFloatKernel = MlasConvDepthwiseFloatKernelNeon; | |
| // this->ConvDepthwiseFloatKernel = MlasConvDepthwiseBf16KernelNeon; | |
| this->ConvPointwiseFloatKernel = MlasConvPointwiseFloatKernelNeon; | |
| // this->ConvPointwiseFloatKernel = MlasConvPointwiseBf16KernelNeon; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Duplicate assignment to
ConvNchwFloatKernel. The second assignment overwrites the first. If the intent is to replace the float kernel with the BF16 kernel, remove the first assignment. Otherwise, clarify the initialization logic.