Skip to content

[Speculative Decoding] Support draft model on different tensor-parallel size than target model #6104

[Speculative Decoding] Support draft model on different tensor-parallel size than target model

[Speculative Decoding] Support draft model on different tensor-parallel size than target model #6104

Annotations

2 warnings

The logs for this run have expired and are no longer available.