Adding dynamic swap number and defragmenter warmup #183

ksmusz · 2025-09-16T15:12:25Z

Introducing dynamic swap buckets to defragmenter, together with defragmenter warmup.

Currently only a maximum of 32 blocks can be swapped of one iteration of a defragmenter. This change introduces a bucketing system, which asserts the minimal size bucket of swaps to be done in current defragmenter iteration based on actual number of blocks, that need to be swapped. Size of the buckets range from 8 swaps up to 512 swaps in a single defragmenter run.

As the number of possible swap buckets grew from a single size bucket, a warmup of defragmenter has been added. Thanks to the warmup, no additional graph compilations connected to the defragmenter were visible during the inference.

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>

ksmusz · 2025-09-18T10:17:13Z

/run-gaudi-tests

ksmusz · 2025-09-18T15:46:27Z

/run-gaudi-tests

ksmusz · 2025-09-19T08:22:33Z

/run-gaudi-tests

Introducing dynamic swap buckets to defragmenter, together with defragmenter warmup. Currently only a maximum of 32 blocks can be swapped of one iteration of a defragmenter. This change introduces a bucketing system, which asserts the minimal size bucket of swaps to be done in current defragmenter iteration based on actual number of blocks, that need to be swapped. Size of the buckets range from 8 swaps up to 512 swaps in a single defragmenter run. As the number of possible swap buckets grew from a single size bucket, a warmup of defragmenter has been added. Thanks to the warmup, no additional graph compilations connected to the defragmenter were visible during the inference. cherry-pick #183 Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai> Co-authored-by: Marcin Swiniarski <marcin.swiniarski@intel.com>

ksmusz · 2025-09-19T13:01:36Z

/run-gaudi-tests

ksmusz · 2025-09-22T07:49:57Z

/run-gaudi-tests

ksmusz · 2025-09-23T07:18:59Z

/run-gaudi-tests

ksmusz · 2025-09-23T13:15:57Z

/run-gaudi-tests

ksmusz · 2025-09-24T06:25:43Z

/run-gaudi-tests

ksmusz · 2025-09-24T07:09:54Z

/run-gaudi-tests

Introducing dynamic swap buckets to defragmenter, together with defragmenter warmup. Currently only a maximum of 32 blocks can be swapped of one iteration of a defragmenter. This change introduces a bucketing system, which asserts the minimal size bucket of swaps to be done in current defragmenter iteration based on actual number of blocks, that need to be swapped. Size of the buckets range from 8 swaps up to 512 swaps in a single defragmenter run. As the number of possible swap buckets grew from a single size bucket, a warmup of defragmenter has been added. Thanks to the warmup, no additional graph compilations connected to the defragmenter were visible during the inference. --------- Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai> Co-authored-by: Marcin Swiniarski <marcin.swiniarski@intel.com> Signed-off-by: Iryna Boiko <iboiko@habana.ai>

ksmusz added 4 commits September 16, 2025 18:04

Adding dynamic swap number and defragmenter warmup

7555ea4

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>

Clearing up comment

dddfbc9

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

1f73488

Code cleanup

e4cee01

Signed-off-by: Krzysztof Smusz <ksmusz@habana.ai>

ksmusz marked this pull request as ready for review September 17, 2025 14:13

ksmusz requested review from adobrzyn, kzawora-intel, mswiniarsk and xuechendi as code owners September 17, 2025 14:13

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

4578d10

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

d192e40

ksmusz mentioned this pull request Sep 19, 2025

Adding dynamic swap number and defragmenter warmup #198

Merged

madamczyk-intel approved these changes Sep 19, 2025

View reviewed changes

mswiniarsk approved these changes Sep 19, 2025

View reviewed changes

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

8a33199

mswiniarsk requested review from afierka-intel and mgawarkiewicz-intel as code owners September 23, 2025 11:25

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

a041564

ksmusz requested a review from vivekgoe as a code owner September 24, 2025 06:25

Merge branch 'main' into dev/ksmusz/defrag_warmup_and_dynamic_swap

a65fd51

mswiniarsk merged commit 60808d7 into vllm-project:main Sep 25, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding dynamic swap number and defragmenter warmup #183

Adding dynamic swap number and defragmenter warmup #183

Uh oh!

ksmusz commented Sep 16, 2025 •

edited

Loading

Uh oh!

ksmusz commented Sep 18, 2025

Uh oh!

ksmusz commented Sep 18, 2025

Uh oh!

ksmusz commented Sep 19, 2025

Uh oh!

ksmusz commented Sep 19, 2025

Uh oh!

ksmusz commented Sep 22, 2025

Uh oh!

ksmusz commented Sep 23, 2025

Uh oh!

ksmusz commented Sep 23, 2025

Uh oh!

ksmusz commented Sep 24, 2025

Uh oh!

ksmusz commented Sep 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding dynamic swap number and defragmenter warmup #183

Adding dynamic swap number and defragmenter warmup #183

Uh oh!

Conversation

ksmusz commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ksmusz commented Sep 18, 2025

Uh oh!

ksmusz commented Sep 18, 2025

Uh oh!

ksmusz commented Sep 19, 2025

Uh oh!

ksmusz commented Sep 19, 2025

Uh oh!

ksmusz commented Sep 22, 2025

Uh oh!

ksmusz commented Sep 23, 2025

Uh oh!

ksmusz commented Sep 23, 2025

Uh oh!

ksmusz commented Sep 24, 2025

Uh oh!

ksmusz commented Sep 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ksmusz commented Sep 16, 2025 •

edited

Loading