Support long responses and additional fixes #104

mayabar · 2025-07-24T07:43:41Z

Currently, responses are generated by randomly selecting one sentence from a predefined list. As a result, even when max_token is set to a large value, long messages are never returned.

The suggestion to use a lorem ipsum generator was declined because the library we identified can produce only up to 191 words; any request for a larger word count causes it to panic.

New behavior:

Echo mode: The input text is returned as-is, if max_tokens or max_completion_tokens is set to value higher than the input tokens number, if max_tokens or max_completion_tokens is lower - the input will be trimmed. Useful for testing where we need to know exact response in advance.
Random mode:
- if max_tokens or max_completion_tokens is specified, sentences are selected from the predefined collection until the token count reaches the limit
- if number of output tokens is not defined in the request, a random token count is generated for the response. This count is based on a Gaussian distribution with mean of 40 and standard deviation of 20, capped by a maximum response length which is currently set to 128 tokens.

Additional changes:

Use tokenize function which divide text by space and additional characters in request processing too (not only in tools related part)
Validate max_token and max_completion_token as request arrived and return 400 status if it's invalid
Protect generating any random value with mutex
Fix test for the changes above + add test for random texts creation

…racters in request processing too (not only in tools related part) - Validate max_token and max_completion_token as request arrived - Protect generating any random value with mutex - Fix test for the changes above + add test for random texts creation Signed-off-by: Maya Barnea <mayab@il.ibm.com>

pkg/llm-d-inference-sim/utils_test.go

pkg/llm-d-inference-sim/simulator_test.go

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

pkg/llm-d-inference-sim/utils.go

…at it could be built from the predefined parts Signed-off-by: Maya Barnea <mayab@il.ibm.com>

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

shmuelk

/lgtm

/approve

mayabar requested review from irar2 and shmuelk July 24, 2025 07:43

mayabar mentioned this pull request Jul 24, 2025

feature request: simulate from lorem ipsum generator for longer streaming & complex unicode / utf-8 scenarios #102

Closed

nekomeowww reviewed Jul 24, 2025

View reviewed changes

pkg/llm-d-inference-sim/utils_test.go Outdated Show resolved Hide resolved

shmuelk requested changes Jul 24, 2025

View reviewed changes

pkg/llm-d-inference-sim/simulator_test.go Show resolved Hide resolved

fix lint problem according the PR comment

450a861

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

nekomeowww reviewed Jul 24, 2025

View reviewed changes

pkg/llm-d-inference-sim/utils.go Outdated Show resolved Hide resolved

restore tests that check validity of returned response text, check th…

b57c7dc

…at it could be built from the predefined parts Signed-off-by: Maya Barnea <mayab@il.ibm.com>

mayabar requested a review from shmuelk July 27, 2025 04:56

fixed typo in comment

6c41969

Signed-off-by: Maya Barnea <mayab@il.ibm.com>

shmuelk approved these changes Jul 27, 2025

View reviewed changes

mayabar merged commit 2b4a79a into llm-d:main Jul 27, 2025
2 checks passed

mayabar deleted the long-responses branch July 29, 2025 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support long responses and additional fixes #104

Support long responses and additional fixes #104

Uh oh!

mayabar commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shmuelk left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support long responses and additional fixes #104

Support long responses and additional fixes #104

Uh oh!

Conversation

mayabar commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shmuelk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mayabar commented Jul 24, 2025 •

edited

Loading