♻️ Compatibility with vllm main #338

prashantgupta24 · 2025-07-25T17:03:32Z

Changes

Remove prompt adapter config
- Based on upstream vllm changes in [V0 Deprecation] Remove Prompt Adapters vllm#20588
Implement get_supported_tasks in model_runner for online API
- only needed for online API, related PR - [V1] Get supported tasks from model runner instead of model config vllm#21585
Implementation for online API:
```
if envs.VLLM_USE_V1:
    supported_tasks = await engine_client \
        .get_supported_tasks()  # type: ignore
else:
    supported_tasks = model_config.supported_tasks
```

Related Issues

github-actions · 2025-07-25T17:03:40Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

maxdebayser · 2025-07-25T18:55:21Z

The pooling task changes are already part of the embedding PR. There are more upstream changes around the pooling tasks and I think they should be part of a follow-up PR. I don't want to pile up more changes in a PR that is already big enough.

prashantgupta24 · 2025-07-25T19:40:50Z

Yeah I was trying to get a minimum set of changes in so that the tests pass against vllm:main. I think the pooling tasks are non-breaking, I need to get get_supported_tasks in for vllm:main to work atm

Based on upstream vllm changes in vllm-project/vllm#20588 Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 · 2025-07-25T20:18:43Z

vllm_spyre/worker/spyre_embedding_model_runner.py

+        if hasattr(Pooler, "from_config_with_defaults"):
+            # TODO: remove this when we no longer support
+            # vllm version v0.9.2
+            self.pooler = Pooler.from_config_with_defaults(
+                pooler_config,
+                pooling_type=PoolingType.CLS,
+                normalize=True,
+                softmax=False)
+        else:
+            self.pooler = Pooler.for_embed(
+                pooler_config=pooler_config,
+                default_pooling_type=PoolingType.CLS,
+                default_normalize=True,
+                default_softmax=False)


@maxdebayser I think just need this piece of code from your PR to enable vllm:main breaking changes, I've removed all other pooling changes from this PR. I'm fine in waiting for your PR to get in first in which case I'll rebase and remove this change

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 · 2025-07-25T22:40:07Z

vllm_spyre/v1/worker/spyre_model_runner.py

+        if "generate" in self.model_config.supported_tasks:
+            tasks.extend(["generate"])


Ideally we would want this to be coming from the model directly:

if is_text_generation_model(model): supported_tasks.append("generate")

but SpyreCausalLM doesn't seem to support it atm

type(self.model) <class 'vllm_spyre.model_executor.model_loader.spyre.SpyreCausalLM'> is_text_generation_model(self.model) False

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

yannicks1

lgtm! thanks for taking that on!

prashantgupta24 force-pushed the remove-prompt-adapters branch from 9074133 to 5ae4775 Compare July 25, 2025 17:19

prashantgupta24 changed the title ~~🔥 Remove Prompt Adapters~~ ♻️ Compatibility with vllm 0.10.0 Jul 25, 2025

prashantgupta24 force-pushed the remove-prompt-adapters branch from 5ae4775 to 59ef9c9 Compare July 25, 2025 17:25

prashantgupta24 changed the title ~~♻️ Compatibility with vllm 0.10.0~~ ♻️ Compatibility with vllm main Jul 25, 2025

prashantgupta24 added 2 commits July 25, 2025 13:05

🔥 Remove Prompt Adapters

efffcb3

Based on upstream vllm changes in vllm-project/vllm#20588 Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

♻️ add get_supported_tasks

a7cf3ab

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 force-pushed the remove-prompt-adapters branch from 652eaa5 to a7cf3ab Compare July 25, 2025 20:06

prashantgupta24 commented Jul 25, 2025

View reviewed changes

🐛 online api requires get_supported_tasks to return something

44278f5

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 commented Jul 25, 2025

View reviewed changes

🎨 add import line

10faffb

Signed-off-by: Prashant Gupta <prashantgupta@us.ibm.com>

prashantgupta24 marked this pull request as ready for review July 25, 2025 22:41

prashantgupta24 requested review from nikolaospapandreou, sducouedic, tdoublep and yannicks1 as code owners July 25, 2025 22:41

yannicks1 enabled auto-merge (squash) July 28, 2025 08:55

yannicks1 approved these changes Jul 28, 2025

View reviewed changes

yannicks1 merged commit 91e1a00 into main Jul 28, 2025
19 checks passed

yannicks1 deleted the remove-prompt-adapters branch July 28, 2025 08:55

github-actions bot added the ready label Jul 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

♻️ Compatibility with vllm main #338

♻️ Compatibility with vllm main #338

Uh oh!

prashantgupta24 commented Jul 25, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

maxdebayser commented Jul 25, 2025

Uh oh!

prashantgupta24 commented Jul 25, 2025

Uh oh!

prashantgupta24 Jul 25, 2025 •

edited

Loading

Uh oh!

prashantgupta24 Jul 25, 2025

Uh oh!

yannicks1 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if "generate" in self.model_config.supported_tasks:
		tasks.extend(["generate"])

♻️ Compatibility with vllm main #338

♻️ Compatibility with vllm main #338

Uh oh!

Conversation

prashantgupta24 commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Related Issues

Uh oh!

github-actions bot commented Jul 25, 2025

Uh oh!

maxdebayser commented Jul 25, 2025

Uh oh!

prashantgupta24 commented Jul 25, 2025

Uh oh!

prashantgupta24 Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

yannicks1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

prashantgupta24 commented Jul 25, 2025 •

edited

Loading

prashantgupta24 Jul 25, 2025 •

edited

Loading