[Optimization] Use a cheaper cache key in `get_model_architecture` #25682

DarkLight1337 · 2025-09-25T15:20:54Z

Purpose

FIX (partial) #25671

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request introduces a caching mechanism for the model configuration hash to optimize performance. While the caching logic itself is sound, the implementation for cache invalidation via __setattr__ introduces a critical flaw by allowing mutations on the ModelConfig object that can lead to an inconsistent state. This can cause silent misconfigurations and hard-to-debug issues. I've provided detailed comments on this and other potential improvements.

vllm/config/model.py

tests/test_config.py

ProExpertProg

This introduces complexity, it would be better if we can just not recompute model arch or class.

This reverts commit 9fd2542.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-09-25T18:19:58Z

Updated according to #25671 (comment), see if it looks good to you now

DarkLight1337 · 2025-09-25T19:10:51Z

We need both this PR and #25702 to solve #25671

ProExpertProg

Thanks for addressing!

…25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: yewentao256 <zhyanwentao@126.com>

…llm-project#25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…llm-project#25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

…llm-project#25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

[Optimization] Cache the hash of the model config

9fd2542

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested review from WoosukKwon and simon-mo as code owners September 25, 2025 15:20

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 25, 2025

DarkLight1337 requested review from ProExpertProg, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners September 25, 2025 15:20

DarkLight1337 mentioned this pull request Sep 25, 2025

[Performance] model_config.compute_hash is computed every time and introduce overhead in each new multi-modal req #25671

Closed

gemini-code-assist bot reviewed Sep 25, 2025

View reviewed changes

vllm/config/model.py Outdated Show resolved Hide resolved

vllm/config/model.py Outdated Show resolved Hide resolved

tests/test_config.py Outdated Show resolved Hide resolved

DarkLight1337 requested a review from Isotr0py September 25, 2025 15:22

Isotr0py approved these changes Sep 25, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) September 25, 2025 16:49

ProExpertProg requested changes Sep 25, 2025

View reviewed changes

DarkLight1337 disabled auto-merge September 25, 2025 17:48

DarkLight1337 changed the title ~~[Optimization] Cache the hash of the model config~~ [Optimization] Use a cheaper cache key in get_model_architecture Sep 25, 2025

DarkLight1337 added 2 commits September 25, 2025 18:14

Revert "[Optimization] Cache the hash of the model config"

45371ce

This reverts commit 9fd2542.

Use a cheaper cache key

fa7579a

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from 22quinn as a code owner September 25, 2025 18:19

DarkLight1337 added this to the v0.11.0 milestone Sep 25, 2025

ProExpertProg approved these changes Sep 25, 2025

View reviewed changes

ProExpertProg merged commit 89fa54e into vllm-project:main Sep 25, 2025
43 checks passed

DarkLight1337 deleted the cache-hash branch September 26, 2025 03:41

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[Optimization] Use a cheaper cache key in get_model_architecture (#…

b558c3a

…25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: yewentao256 <zhyanwentao@126.com>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Optimization] Use a cheaper cache key in get_model_architecture (v…

3aa6f87

…llm-project#25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Optimization] Use a cheaper cache key in get_model_architecture (v…

4009276

…llm-project#25682) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Optimization] Use a cheaper cache key in `get_model_architecture` #25682

[Optimization] Use a cheaper cache key in `get_model_architecture` #25682

Uh oh!

DarkLight1337 commented Sep 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ProExpertProg left a comment

Uh oh!

DarkLight1337 commented Sep 25, 2025

Uh oh!

DarkLight1337 commented Sep 25, 2025

Uh oh!

ProExpertProg left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Optimization] Use a cheaper cache key in get_model_architecture #25682

[Optimization] Use a cheaper cache key in get_model_architecture #25682

Uh oh!

Conversation

DarkLight1337 commented Sep 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Sep 25, 2025

Uh oh!

DarkLight1337 commented Sep 25, 2025

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Optimization] Use a cheaper cache key in `get_model_architecture` #25682

[Optimization] Use a cheaper cache key in `get_model_architecture` #25682

DarkLight1337 commented Sep 25, 2025 •

edited by github-actions bot

Loading