-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[MLAS/CPU EP]: Introduce a backend kernel selector config in MLAS #27136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
hariharans29
wants to merge
64
commits into
main
Choose a base branch
from
hari/kleidiai_opt_out
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+1,095
−531
Open
Changes from 6 commits
Commits
Show all changes
64 commits
Select commit
Hold shift + click to select a range
2a2ce5e
Trial
hariharans29 e7d9b70
Trial 2
hariharans29 b117de5
Trial 3
hariharans29 1fb9ceb
Trial 4
hariharans29 2519272
Trial 4 fix
hariharans29 29e3740
Trial 5
hariharans29 bc6528a
Update onnxruntime/contrib_ops/cpu/attnlstm/bahdanau_attention.cc
hariharans29 b9b9773
Update onnxruntime/contrib_ops/cpu/attnlstm/deep_cpu_attn_lstm.h
hariharans29 007f3be
Update onnxruntime/contrib_ops/cpu/bert/attention.cc
hariharans29 c13c293
Update onnxruntime/contrib_ops/cpu/bert/attention_base.h
hariharans29 6ebb592
Update onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h
hariharans29 615b31e
Update onnxruntime/contrib_ops/cpu/word_conv_embedding.h
hariharans29 f17ef4d
Update onnxruntime/contrib_ops/cpu/transformers/generation_device_hel…
hariharans29 8c15dac
Update onnxruntime/contrib_ops/cpu/sparse/sparse_attention_base.h
hariharans29 e0240c2
Update onnxruntime/contrib_ops/cpu/transformers/sampling_cpu_helper.h
hariharans29 990f185
Update onnxruntime/contrib_ops/cpu/word_conv_embedding.h
hariharans29 d03c491
Update onnxruntime/contrib_ops/cpu/quantization/matmul_bnb4.cc
hariharans29 a475a66
Copilot comments
hariharans29 c2d0d2a
Update onnxruntime/contrib_ops/cpu/bert/attention.cc
hariharans29 14994d9
Silence some build failures
hariharans29 93bd45a
Build failures
hariharans29 840766d
Fix builds
hariharans29 e0affbb
Fix builds
hariharans29 9329b0d
Fix builds
hariharans29 bb4e57b
Update onnxruntime/contrib_ops/cpu/cdist.h
hariharans29 f32c0a9
Update onnxruntime/contrib_ops/cpu/cdist.h
hariharans29 fda3cb0
Fix builds
hariharans29 9579426
Merge branch 'hari/kleidiai_opt_out' of https://github.com/microsoft/…
hariharans29 e04d660
Fix builds
hariharans29 ba87f69
Fix builds
hariharans29 fca4503
Fix CUDA builds
hariharans29 7bd8b6f
Fix cuda builds
hariharans29 d37b146
Fix builds
hariharans29 b68029c
Fix builds
hariharans29 e398270
Fix builds
hariharans29 1a8b523
Fix builds
hariharans29 ed1dc8a
Fix builds
hariharans29 03a3eb2
Update onnxruntime/contrib_ops/cpu/nchwc_ops.h
hariharans29 b325dc9
Update onnxruntime/contrib_ops/cpu/transformers/generation_device_hel…
hariharans29 57d20a0
Update onnxruntime/contrib_ops/cpu/transformers/sampling_cpu_helper.h
hariharans29 0a98bb4
Update onnxruntime/contrib_ops/cpu/transformers/generation_device_hel…
hariharans29 1071268
Fix builds
hariharans29 961c6bb
Merge branch 'hari/kleidiai_opt_out' of https://github.com/microsoft/…
hariharans29 243bb02
Update onnxruntime/contrib_ops/cpu/transformers/generation_device_hel…
hariharans29 d9e6e6d
Update onnxruntime/contrib_ops/cpu/transformers/generation_device_hel…
hariharans29 d3a862e
Fix builds
hariharans29 8002f01
Merge branch 'hari/kleidiai_opt_out' of https://github.com/microsoft/…
hariharans29 14277c1
Copilot comments
hariharans29 db7d4a8
Fix builds
hariharans29 5c9f593
Fix some TODOs
hariharans29 1dc6525
Merge remote-tracking branch 'origin' into hari/kleidiai_opt_out
hariharans29 3bcfaa1
Format changes
hariharans29 c6cc01b
Moref fixes
hariharans29 e4438e0
More fixes
hariharans29 69c0931
More fixes
hariharans29 1c0606e
Fix training builds
hariharans29 3b0b3ea
More fixes
hariharans29 485ce6e
Plumb through kernel selector logic to MLAS from QNBitGemm + Copliot …
hariharans29 fa823e8
More QNBitGemm fixes
hariharans29 3d2dc08
More fixes
hariharans29 f75d9f6
More fixes
hariharans29 66cbce8
Test fixes
hariharans29 035bca2
Fix
hariharans29 bc43948
Fix
hariharans29 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.