Add synchronization of freeing worker after stream reqiest processing #244

mayabar · 2025-10-30T12:18:03Z

Problem: When handling requests with stream=true, a worker is returned to the pool of free workers before streaming completes. As a result, the same worker may get more requests to process, in addition calculation of decode time is wrong.

Solution: Run processRequest in a separate go routine, and use a WaitGroup to ensure that worker is not released until all streaming chunks have been sent.

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

…lculations for requests in streaming mode Signed-off-by: Maya Barnea <mayab@il.ibm.com>

shmuelk · 2025-10-30T14:03:07Z

/lgtm
/approve

add synchronization of freeing worker after stream reqiest processing

aaab574

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

mayabar requested a review from shmuelk October 30, 2025 12:18

additioal changes which fix e2e request latency and inference time ca…

caa3003

…lculations for requests in streaming mode Signed-off-by: Maya Barnea <mayab@il.ibm.com>

github-actions bot added the lgtm label Oct 30, 2025

github-actions bot approved these changes Oct 30, 2025

View reviewed changes

github-actions bot merged commit 658e3e5 into llm-d:main Oct 30, 2025
4 checks passed

mayabar deleted the streaming-in-queue branch November 4, 2025 09:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add synchronization of freeing worker after stream reqiest processing #244

Add synchronization of freeing worker after stream reqiest processing #244

Uh oh!

mayabar commented Oct 30, 2025

Uh oh!

shmuelk commented Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add synchronization of freeing worker after stream reqiest processing #244

Add synchronization of freeing worker after stream reqiest processing #244

Uh oh!

Conversation

mayabar commented Oct 30, 2025

Uh oh!

shmuelk commented Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants