Update transformers to `v4.55` #21931

hmellor · 2025-07-30T13:00:15Z

Updates to the latest version of Transformers.

Notable changes:

Transformers backend:
- Get the tp_plan from the config of the base model because it is no longer added to the base model itself post init
- Add a default mapping from nn.Linear to ReplicatedLinear to enable weight loaders like BitsAndBytesModelLoader to work with models that do not support TP if TP is disabled
- Decoder modules no longer return tuple as of Refactor the way we handle outputs for new llamas and new models huggingface/transformers#39120 so return_tuple has been removed from PPMissingLayer.forward()

Enables the proper type hinting introduced by #21913

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

github-actions · 2025-07-30T13:00:25Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request updates the transformers library to version v4.54.1 and huggingface-hub to v0.34.3. While this is a necessary update, it introduces potential risks due to reliance on workarounds and private APIs from older transformers versions. I recommend addressing the following points to ensure stability and correctness. First, a critical issue in vllm/model_executor/models/transformers.py is the use of the private _attn_implementation API. With transformers v4.54.1, the public model.set_attn_implementation() should be used to avoid silently disabling vLLM's custom attention, which would impact performance and correctness. Second, there are high-risk, potentially obsolete workarounds in vllm/config.py for gemma2 and gemma3n_text model configs, and in vllm/transformers_utils/tokenizer.py for ChatGLMTokenizer. These may now be fixed in transformers and could cause incorrect behavior if they override updated library logic. Verifying and removing these if they are no longer necessary is highly recommended.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

DarkLight1337 · 2025-07-30T13:33:00Z

What failures are introduced by that PR? From my understanding those test failures also happen on main.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor · 2025-07-30T13:41:07Z

I wasn't aware they were also on main. I assumed they were from that PR because many of them refer to the same configs that were modified.

DarkLight1337 · 2025-07-30T13:43:07Z

Well in any case, it's great to fix them anyway

DarkLight1337 · 2025-07-30T13:46:26Z

Ok I figured out that some of the failures in that PR are caused by missing trust_remote_code=True (because we previously loaded those configs from vLLM directly), let me open a separate PR.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor · 2025-07-30T13:50:22Z

Ok, happy to review once it's open

DarkLight1337 · 2025-07-30T13:58:45Z

Opened #21934

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-07-30T16:16:09Z

Hope this passes now

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

…t now Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

noooop · 2025-07-31T07:08:59Z

gte-Qwen2-1.5B-instruct hf Implementation does not support v4.54.1

hmellor · 2025-07-31T08:46:16Z

@noooop are you suggesting that we upper bound this model in tests to 4.53.2?

hmellor · 2025-08-05T18:01:22Z

Ok, I'll make that change in the next batch of commits (I'll let the tests run to completion)

zyongye · 2025-08-06T04:16:47Z

How long can we land this? Need this first before merging gpt-oss changes (#22259)

DarkLight1337 · 2025-08-06T04:38:11Z

We just need to get CI to pass

hmellor · 2025-08-06T07:56:26Z

I understand this was a priority for gpt-oss, I'll work on hotfixiing anything that wasn't fully resolved in CI

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Noam Gat <noamgat@gmail.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Update transformers to v4.54.1

a742162

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 30, 2025

mergify bot added the ci/build label Jul 30, 2025

gemini-code-assist bot reviewed Jul 30, 2025

View reviewed changes

hmellor added 2 commits July 30, 2025 15:06

Use public method to set attn implementation in Transformers backend

97d7f25

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix MPT

fa697f5

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Exaone is a remote model

24bb2c4

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix solar

ecebd0c

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor requested review from DarkLight1337 and ywang96 as code owners July 30, 2025 13:51

hmellor added 4 commits July 30, 2025 15:59

Fix telechat

ad38ae2

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix skywork

d439137

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix hunyuan

1dcf9f4

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

spaces

30bdcde

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

DarkLight1337 mentioned this pull request Jul 30, 2025

[Misc] Use config definitions from Transformers library #21913

Merged

4 tasks

DarkLight1337 added 3 commits July 30, 2025 16:12

Merge branch 'main' into update-transformers-4-54

3ce689f

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Drop min_transformers_version="4.53"

36621f4

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix duplicated code

c305846

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

hmellor added 2 commits July 30, 2025 18:53

Revert telechat2 to how it is on main

59cd39e

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Revert public method as it's too brittle to use for our purposes righ…

0af4810

…t now Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

zyongye mentioned this pull request Aug 6, 2025

Support gpt-oss #22259

Closed

3 tasks

Merge branch 'main' into update-transformers-4-54

0a6ff09

WoosukKwon approved these changes Aug 6, 2025

View reviewed changes

WoosukKwon merged commit 796bae0 into vllm-project:main Aug 6, 2025
12 of 16 checks passed

hmellor deleted the update-transformers-4-54 branch August 6, 2025 07:55

myselvess mentioned this pull request Aug 6, 2025

[Model] support new model ovis2.5 #22187

Closed

4 tasks

This was referenced Aug 7, 2025

[CI Failure]: test_reward.py::test_prm_models - AttributeError: 'DynamicCache' object has no attribute 'get_usable_length'. Did you mean: 'get_seq_length'? #22398

Closed

[CI] Skip the pooling models that do not support transformers v4.55 #22411

Merged

hmellor mentioned this pull request Aug 8, 2025

[Bug]: Smollm3M not working anymore #22517

Closed

1 task

hmellor mentioned this pull request Aug 11, 2025

Fix Transformers backend tensor parallel for multimodal models #22673

Merged

Uh oh!

Uh oh!

Update transformers to v4.55 #21931

Update transformers to v4.55 #21931

Uh oh!

Conversation

hmellor commented Jul 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hmellor commented Jul 30, 2025

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

hmellor commented Jul 30, 2025

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

DarkLight1337 commented Jul 30, 2025

Uh oh!

noooop commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hmellor commented Jul 31, 2025

Uh oh!

hmellor commented Aug 5, 2025

Uh oh!

zyongye commented Aug 6, 2025

Uh oh!

DarkLight1337 commented Aug 6, 2025

Uh oh!

Uh oh!

hmellor commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Update transformers to `v4.55` #21931

Update transformers to `v4.55` #21931

hmellor commented Jul 30, 2025 •

edited by github-actions bot

Loading

DarkLight1337 commented Jul 30, 2025 •

edited

Loading

noooop commented Jul 31, 2025 •

edited

Loading