Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WebGPU EP] Support GroupQueryAttention #22658
base: main
Are you sure you want to change the base?
[WebGPU EP] Support GroupQueryAttention #22658
Changes from all commits
0a5d212
5bfa070
e6615e9
449afb4
8d10472
4ea58d1
4bcf257
5c5c934
e716546
aba59e5
067ecd1
53f1c78
f4dc9fc
3d1af1c
2eaeebc
9c828cc
26caa06
64b093f
a8bd38b
d613df4
0fedb9f
993140b
7502493
5f1fdae
6d2bd68
82a005d
fd9409f
15c96b3
72601d1
65495b6
63f20ed
0102206
71ed10c
9c08c82
eb5d7b4
7a2d3b6
664022f
a48d782
4334b39
d53d7ef
5dc95c8
e448b1a
60af2f5
47e6f52
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Large diffs are not rendered by default.
Check warning on line 16 in onnxruntime/contrib_ops/webgpu/bert/attention.h
GitHub Actions / Optional Lint C++
Check warning on line 92 in onnxruntime/contrib_ops/webgpu/bert/attention.h
GitHub Actions / Optional Lint C++
Check warning on line 18 in onnxruntime/contrib_ops/webgpu/bert/attention_common.h
GitHub Actions / Optional Lint C++
Check warning on line 42 in onnxruntime/contrib_ops/webgpu/bert/attention_common.h
GitHub Actions / Optional Lint C++