Skip to content

Conversation

@noooop
Copy link
Collaborator

@noooop noooop commented Sep 15, 2025

Purpose

Support BertForTokenClassification / Named Entity Recognition (NER) task

fix bert + Flex Attention + torch.comple by @Isotr0py
(see: #24872 (comment)

Fix #24752
Fix #25060

Test Plan

tests/models/language/pooling/test_token_classification.py

Test Result

pass


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

@mergify mergify bot added documentation Improvements or additions to documentation new-model Requests to new models labels Sep 15, 2025
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
@noooop
Copy link
Collaborator Author

noooop commented Sep 15, 2025

@mlinmg

try: examples/online_serving/ner.py

But it's definitely not very user-friendly

Do you ok with that. or We still need to further optimize the API.

@mlinmg
Copy link
Contributor

mlinmg commented Sep 15, 2025

That's great! The implementation really depends on the popularity of the token classification API. For now, a basic implementation should be more than enough imo. If more people start using it, it might be worthwhile to create a complete end-to-end flow.

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: wang.yuqi <noooop@126.com>
@noooop noooop marked this pull request as ready for review September 16, 2025 06:41
@noooop
Copy link
Collaborator Author

noooop commented Sep 16, 2025

@DarkLight1337

Ready for review

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
@noooop
Copy link
Collaborator Author

noooop commented Sep 18, 2025

@DarkLight1337
Could you please review this pr.

Copy link
Member

@DarkLight1337 DarkLight1337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM if the test can pass

@DarkLight1337 DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 18, 2025
@noooop
Copy link
Collaborator Author

noooop commented Sep 18, 2025

@DarkLight1337

You didn't using auto-merge..

@DarkLight1337 DarkLight1337 merged commit 5f696c3 into vllm-project:main Sep 18, 2025
66 checks passed
@noooop noooop deleted the fix_bert_seq_cls branch September 18, 2025 15:26
debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: charlifu <charlifu@amd.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
…tion (NER) task (vllm-project#24872)

Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation frontend new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Does not run embedding model sergeyzh/rubert-tiny-turbo [Bug]: Some Bert models no longer compatible for BertForSequenceClassification

4 participants