Skip to content

Commit

Permalink
Bug fix
Browse files Browse the repository at this point in the history
  • Loading branch information
satyajandhyala committed Nov 23, 2024
1 parent 60af2f5 commit 47e6f52
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion onnxruntime/contrib_ops/webgpu/bert/attention.cc
Original file line number Diff line number Diff line change
Expand Up @@ -462,7 +462,7 @@ Status ComputeVxAttentionScore(onnxruntime::webgpu::ComputeContext& context, int
{static_cast<uint32_t>(parameters.v_head_size_)},
{static_cast<uint32_t>(parameters.num_heads_)},
{static_cast<uint32_t>(parameters.head_size_)},
{static_cast<uint32_t>(parameters.v_hidden_size_)},
{static_cast<uint32_t>(parameters.v_hidden_size_ * parameters.n_reps)},
{static_cast<uint32_t>(past_sequence_length)},
{static_cast<uint32_t>(parameters.kv_sequence_length_)},
{static_cast<uint32_t>(seqlen_k == nullptr ? total_sequence_length : parameters.seqlen_present_kv_cache_)},
Expand Down

0 comments on commit 47e6f52

Please sign in to comment.