Skip to content

Conversation

@richardhuo-nv
Copy link
Contributor

@richardhuo-nv richardhuo-nv commented Aug 22, 2025

Overview:

The "two models" eagle solution became very unstable in Trtllm 1.0.0rc6 and TRTLLM team seems is not going to maintain it in a longer term.
Cherry-pick for #2661

Details:

Remove the ‘two-models config’ and set the ‘one-model’ solution as the default, since whether one or two models are used is purely an implementation detail, they are both just Eagle speculative decoding.

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@copy-pr-bot
Copy link

copy-pr-bot bot commented Aug 22, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@github-actions github-actions bot added the fix label Aug 22, 2025
@richardhuo-nv richardhuo-nv marked this pull request as ready for review August 22, 2025 20:44
@richardhuo-nv richardhuo-nv force-pushed the rihuo/fix_eagle_config branch 3 times, most recently from 151b5b6 to b1cd2ae Compare August 22, 2025 20:48
@richardhuo-nv richardhuo-nv requested review from KrishnanPrash, rmccorm4 and tanmayv25 and removed request for KrishnanPrash and tanmayv25 August 22, 2025 20:48
@richardhuo-nv richardhuo-nv force-pushed the rihuo/fix_eagle_config branch 2 times, most recently from 0d168c5 to 91285c7 Compare August 22, 2025 21:03
fix name

Signed-off-by: richardhuo-nv <rihuo@nvidia.com>

sign
@richardhuo-nv richardhuo-nv force-pushed the rihuo/fix_eagle_config branch from 91285c7 to 6851193 Compare August 22, 2025 21:05
@dmitry-tokarev-nv dmitry-tokarev-nv merged commit 1a5f302 into release/0.4.1 Aug 22, 2025
13 of 14 checks passed
@dmitry-tokarev-nv dmitry-tokarev-nv deleted the rihuo/fix_eagle_config branch August 22, 2025 22:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants