Skip to content

How does RadixAttention implements multi-head/multi-query/grouped-query attention. #652

Closed Answered by merrymercy
Griffintaur asked this question in Q&A
Discussion options

You must be logged in to vote

All of them are supported without any specific modification.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by merrymercy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #402 on July 18, 2024 16:29.