Skip to content

[Speculative Decoding] Support draft model on different tensor-parallel size than target model #11405

[Speculative Decoding] Support draft model on different tensor-parallel size than target model

[Speculative Decoding] Support draft model on different tensor-parallel size than target model #11405

Annotations

2 errors

This job was cancelled