[New Model]: Support GteNewModelForSequenceClassification #23524

noooop · 2025-08-25T06:10:31Z

TL;DR

New Model: GteNewModelForSequenceClassification

Alibaba-NLP/gte-multilingual-reranker-base

vllm serve Alibaba-NLP/gte-multilingual-reranker-base --hf-overrides '{"architectures": ["GteNewForSequenceClassification"]}' --trust_remote_code

The second-generation GTE model (mGTE-TRM) is named NewForSequenceClassification. The name NewForSequenceClassification is too generic, you should set --hf-overrides '{"architectures": ["GteNewForSequenceClassification"]}' to specify the use of the GteNewForSequenceClassification architecture.

Purpose

Fix #21595

Test Plan

pytest -s -vvv tests/models/language/pooling/test_gte.py::test_rerank_models_mteb[model_info1]

Test Result

pass

(Optional) Documentation Update

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: wang.yuqi <noooop@126.com>

DarkLight1337

Can you split out the classification optimization into a separate PR?

noooop · 2025-08-25T09:21:12Z

Can you split out the classification optimization into a separate PR?

The content is too small, so they are combined. you tend to separate them.

For the following period of time, I will profile and optimize different models.

Their optimizations are all small, but I don't want to wait until all optimizations are completed before submitting,

so the best way is to support similar models and hitchhike.

tests/models/language/pooling/test_gte.py

DarkLight1337 · 2025-08-25T09:28:17Z

The content is too small, so they are combined. you tend to separate them.

It's better to separate them to better isolate the issue in case CI breaks

noooop · 2025-08-25T09:29:27Z

The content is too small, so they are combined. you tend to separate them.

It's better to separate them to better isolate the issue in case CI breaks

For the following period of time, I will profile and optimize different models.

Their optimizations are all small, but I don't want to wait until all optimizations are completed before submitting,

so the best way is to support similar models and hitchhike.

DarkLight1337 · 2025-08-25T09:32:56Z

For the refactoring of num_labels specifically I think that should be in a separate PR. The optimization inside pooler.py is small enough to keep in this PR.

Signed-off-by: wang.yuqi <noooop@126.com>

noooop · 2025-08-28T04:36:43Z

@DarkLight1337

Can we merge this PR first?

DarkLight1337

Sure, LGTM if tests pass

Signed-off-by: wang.yuqi <noooop@126.com>

noooop · 2025-08-28T07:35:36Z

@DarkLight1337

Sorry for disable auto-merge.

…ct#23524) Signed-off-by: wang.yuqi <noooop@126.com>

mergify bot added documentation Improvements or additions to documentation frontend new-model Requests to new models v1 labels Aug 25, 2025

noooop added 2 commits August 25, 2025 14:13

Score API

42fce02

Signed-off-by: wang.yuqi <noooop@126.com>

+ GteNewForSequenceClassification

4d2572e

Signed-off-by: wang.yuqi <noooop@126.com>

noooop force-pushed the gte_seq_cls branch from 96efd88 to 4d2572e Compare August 25, 2025 06:13

noooop added 2 commits August 25, 2025 14:15

Merge branch 'main' into gte_seq_cls

747f8b1

baseline

a8d845f

Signed-off-by: wang.yuqi <noooop@126.com>

mergify bot added the qwen Related to Qwen models label Aug 25, 2025

Removing pooled_data.shape[-1] causes CUDA sync

708962c

Signed-off-by: wang.yuqi <noooop@126.com>

noooop marked this pull request as ready for review August 25, 2025 09:16

noooop requested review from DarkLight1337, WoosukKwon, aarnphm, alexm-redhat, comaniac, hmellor, njhill, robertgshaw2-redhat, sighingnow, simon-mo and ywang96 as code owners August 25, 2025 09:16

Merge branch 'main' into gte_seq_cls

09b6537

DarkLight1337 reviewed Aug 25, 2025

View reviewed changes

tests/models/language/pooling/test_gte.py Outdated Show resolved Hide resolved

noooop added 7 commits August 25, 2025 22:41

Merge branch 'main' into gte_seq_cls

0371c71

fix registry

7c9c0f6

Signed-off-by: wang.yuqi <noooop@126.com>

Merge branch 'main' into gte_seq_cls

1ec16b5

Merge branch 'main' into gte_seq_cls

887a4e4

conflicts

bcbf00e

Signed-off-by: wang.yuqi <noooop@126.com>

Merge branch 'main' into gte_seq_cls

9baa126

add back

112f557

Signed-off-by: wang.yuqi <noooop@126.com>

DarkLight1337 approved these changes Aug 28, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) August 28, 2025 04:53

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 28, 2025

fix docs

3ba9300

Signed-off-by: wang.yuqi <noooop@126.com>

auto-merge was automatically disabled August 28, 2025 05:08
Head branch was pushed to by a user without write access

DarkLight1337 merged commit 11a7faf into vllm-project:main Aug 28, 2025
42 checks passed

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

[New Model]: Support GteNewModelForSequenceClassification (vllm-proje…

907bd2a

…ct#23524) Signed-off-by: wang.yuqi <noooop@126.com>

noooop deleted the gte_seq_cls branch August 29, 2025 07:38

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Sep 3, 2025

[New Model]: Support GteNewModelForSequenceClassification (vllm-proje…

f7e9e33

…ct#23524) Signed-off-by: wang.yuqi <noooop@126.com>

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[New Model]: Support GteNewModelForSequenceClassification (vllm-proje…

a53ad8d

…ct#23524) Signed-off-by: wang.yuqi <noooop@126.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[New Model]: Support GteNewModelForSequenceClassification (vllm-proje…

7d57749

…ct#23524) Signed-off-by: wang.yuqi <noooop@126.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[New Model]: Support GteNewModelForSequenceClassification #23524

[New Model]: Support GteNewModelForSequenceClassification #23524

Uh oh!

noooop commented Aug 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

DarkLight1337 left a comment

Uh oh!

noooop commented Aug 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

DarkLight1337 commented Aug 25, 2025

Uh oh!

noooop commented Aug 25, 2025

Uh oh!

DarkLight1337 commented Aug 25, 2025

Uh oh!

noooop commented Aug 28, 2025

Uh oh!

DarkLight1337 left a comment

Uh oh!

noooop commented Aug 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[New Model]: Support GteNewModelForSequenceClassification #23524

[New Model]: Support GteNewModelForSequenceClassification #23524

Uh oh!

Conversation

noooop commented Aug 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TL;DR

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

noooop commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Aug 25, 2025

Uh oh!

noooop commented Aug 25, 2025

Uh oh!

DarkLight1337 commented Aug 25, 2025

Uh oh!

noooop commented Aug 28, 2025

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

noooop commented Aug 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

noooop commented Aug 25, 2025 •

edited by github-actions bot

Loading

noooop commented Aug 25, 2025 •

edited

Loading