Fix an endless loop issue when engine_step throws a RuntimeError #339

coolcloudcol · 2023-07-03T09:52:56Z

No description provided.

zhuohan123

LGTM! Thanks for your contribution!

This PR updates the benchmarking performed in remote-push and nightly runs according to the first set of deliverables from our recent meeting: * Only the `benchmark_serving.json` config is run * This is accomplished with a new list, `nm_benchmark_base_config_list.txt`, other lists are untouched * The `benchmark_serving.json` has various reductions: * Model list reduced to `facebook/opt-350m` and `meta-llama/Meta-Llama-3-8B-Instruct` * `nr-qps` list reduced to `300,1` * Metric tracking reduced to mean TPOT and mean TTFT (other metrics still recorded/logged per usual) There is also a small fix related to server startup (changing from `localhost` to `127.0.0.1` because `localhost` on the machines is mapped to the IPv6 `::1` which something in the server stack doesn’t seem to like). In a commit prior to opening the PR with all functional changes, the full `benchmark` job took <30 min: https://github.com/neuralmagic/nm-vllm/actions/runs/9669361155/job/26709082658

…th LoRA (vllm-project#339) This PR has following fixes, - Increase size of indices tensors used to maintain multi-lora state information from max_num_batched_tokens to 3*max_num_batched_tokens. This increase is done to provide buffer for padding done in batch & sequence dimensions. - Move logic to remove padding from lora_logits from execute_model() back to Class LogitsProcessorWithLoRA, this is done to fix race condition caused by updating multi-lora state information directly. FIX HabanaAI#237

Signed-off-by: Yan Ma <yan.ma@intel.com>

Fix an endless loop issue when engine_step throws a RuntimeError

7d3678b

zhuohan123 approved these changes Jul 3, 2023

View reviewed changes

zhuohan123 merged commit 7717d08 into vllm-project:main Jul 3, 2023

coolcloudcol deleted the fix-endless-loop branch July 4, 2023 01:37

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Fix an endless loop issue when engine_step throws a RuntimeError (vll…

3e1c4a9

…m-project#339)

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Sep 19, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

8982f98

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 25, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

6ce0cdf

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 26, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

944a6f9

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 26, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

10eab15

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 28, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

55b1a85

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 30, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

4a770f2

Signed-off-by: Yan Ma <yan.ma@intel.com>

yma11 added a commit to yma11/vllm that referenced this pull request Oct 31, 2025

fix vit attn for models like THUDM/GLM-4v-9B on xpu (vllm-project#339)

4b1693f

Signed-off-by: Yan Ma <yan.ma@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix an endless loop issue when engine_step throws a RuntimeError #339

Fix an endless loop issue when engine_step throws a RuntimeError #339

Uh oh!

coolcloudcol commented Jul 3, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix an endless loop issue when engine_step throws a RuntimeError #339

Fix an endless loop issue when engine_step throws a RuntimeError #339

Uh oh!

Conversation

coolcloudcol commented Jul 3, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants