-
I am thinking about how to achieve PD separation, but I am a bit confused about the transmission of the kv cache. |
Beta Was this translation helpful? Give feedback.
Answered by
zhyncs
Oct 18, 2024
Replies: 1 comment
-
You can refer to the implementation in the Mooncake paper, which uses a Distributed KV Cache Pool and RDMA for inter-node transmission. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
zhyncs
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
You can refer to the implementation in the Mooncake paper, which uses a Distributed KV Cache Pool and RDMA for inter-node transmission.