Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement stateful inference session timeout #3263

Merged
merged 8 commits into from
Jul 23, 2024

Conversation

namannandan
Copy link
Collaborator

@namannandan namannandan commented Jul 22, 2024

Description

Add support for session timeout for stateful inference.

sequenceTimeoutMSec: the max duration in milliseconds of a sequence inference request of this stateful model. The default value is 0 (ie. there is effectively no sequence timeout and the sequence does not expire). TorchServe does not process a new inference request if the sequence timeout is exceeded.

Type of change

  • New feature (non-breaking change which adds functionality)

Feature/Issue validation/testing

  • test/pytest/test_example_stateful_sequence_continuous_batching_http.py::test_infer_stateful_idle_timeout
  • test/pytest/test_example_stateful_sequence_continuous_batching_http.py::test_infer_stateful_session_timeout

@namannandan namannandan marked this pull request as ready for review July 23, 2024 17:01
@namannandan namannandan requested review from mreso, lxning and agunapal July 23, 2024 17:01
Copy link
Collaborator

@mreso mreso left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for the quick reaction time

examples/stateful/sequence_batching/Readme.md Outdated Show resolved Hide resolved
examples/stateful/sequence_batching/Readme.md Outdated Show resolved Hide resolved
examples/stateful/sequence_continuous_batching/Readme.md Outdated Show resolved Hide resolved
namannandan and others added 3 commits July 23, 2024 10:30
Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>
Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>
Co-authored-by: Matthias Reso <13337103+mreso@users.noreply.github.com>
@namannandan namannandan enabled auto-merge July 23, 2024 18:00
@namannandan namannandan added this pull request to the merge queue Jul 23, 2024
Merged via the queue into master with commit ace390c Jul 23, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants