[Feature]: Support Eagle Draft Model with different number of KV heads

### 🚀 The feature, motivation and pitch

The current EAGLE implementation raises `NotImplementedError` (https://github.com/vllm-project/vllm/blob/8e8e0b6af189d262bcfdaef6c0cfb94772e86b0b/vllm/v1/core/kv_cache_utils.py#L1098), because of different  `page_size_bytes` calculation ( 
https://github.com/vllm-project/vllm/blob/main/vllm/v1/kv_cache_interface.py#L68 ) when using a draft model with different `num_kv_heads` compared to the target model. 
However in practice, we can have draft models trained with different `num_kv_heads` than the target model. Would that be possible to implement this change? Or it would require a lot of architectural changes?

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Support Eagle Draft Model with different number of KV heads #22432

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Support Eagle Draft Model with different number of KV heads #22432

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions