Feat: Evict requests if the client has disconnected #208

bhimrazy · 2024-08-21T04:43:04Z

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Fixes #165.

This PR addresses the situation when client requests are disconnected before completion. It tracks canceled requests and stops ongoing/running tasks, thereby saving computational resources.

Approach

The solution involves checking whether the client has disconnected during both predict and stream predict operations.
If a disconnection is detected, the system adds the corresponding request ID to the request_evicted_status multiprocessing dictionary.

This dictionary is then monitored by the running loops (both streaming and non-streaming) in worker process. If the worker loop detects that the current running request ID is present in the request_evicted_status, it immediately terminates the ongoing task associated with that request.

Potential Impacts

This approach might impact performance due to the additional overhead of monitoring and terminating tasks, which could introduce minor delays in processing. Benchmarking

TODO

Handle client request disconnection in streaming mode
Handle client request disconnection in non-streaming mode (In progress)
- Investigating methods to terminate tasks from the worker process (run_single_loop).
Add/Improve tests (In progress)
- Exploring ways to verify task termination from the worker process
Handle client disconnection for batched requests

Any help or guidance on this PR would be greatly appreciated 🙏. Thank you! 😊

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in GitHub issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

aniketmaurya · 2024-08-21T15:40:51Z

looking good @bhimrazy so far! We might have to run some benchmarks to verify that we don't lose performance because of multiprocessing synchronization. But really good approach 😄

bhimrazy · 2024-08-21T16:47:43Z

looking good @bhimrazy so far! We might have to run some benchmarks to verify that we don't lose performance because of multiprocessing synchronization. But really good approach 😄

Thanks, @aniketmaurya! for the feedback.
I'm currently exploring ways to test this feature and will keep you updated on the progress.

aniketmaurya · 2024-08-21T16:49:50Z

@bhimrazy I'd suggest to go ahead with this technique and maybe implement is for single non batched loop then we can run some tests. and if everything goes well then we can implement it for other loops too.

bhimrazy · 2024-08-21T17:01:09Z

Sure, that sounds great!

codecov · 2024-08-21T18:43:11Z

Codecov Report

Attention: Patch coverage is 77.77778% with 12 lines in your changes missing coverage. Please review.

Project coverage is 81%. Comparing base (a65fadf) to head (6ffe51c).
Report is 9 commits behind head on main.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #208   +/-   ##
===================================
- Coverage    82%    81%   -0%     
===================================
  Files        13     13           
  Lines      1048   1084   +36     
===================================
+ Hits        855    881   +26     
- Misses      193    203   +10

for more information, see https://pre-commit.ci

tests/test_lit_server.py

…razy/LitServe into feat/evict-req-on-client-disconnect

src/litserve/server.py

williamFalcon · 2024-08-23T11:07:59Z

@bhimrazy please add a proper PR description. LitServe is live now, so it needs to follow Lightning AI's guidelines for production-ready OSS software.

clear PR descriptions
tests
documentation

thanks!

bhimrazy · 2024-08-26T16:26:13Z

Closing this PR.

Due to the complexity involved, the streaming and non-streaming cases will be handled separately in new PRs (in more better and cleaner way).

You can find the streaming case PR here: #223.

chore: Add request_evicted_status to streaming loop to cancel requests

cdff37b

bhimrazy requested review from lantiga, aniketmaurya, awaelchli and Andrei-Aksionov as code owners August 21, 2024 04:43

bhimrazy marked this pull request as draft August 21, 2024 04:43

Merge branch 'main' into feat/evict-req-on-client-disconnect

7564479

aniketmaurya mentioned this pull request Aug 21, 2024

Evict requests if the client has disconnected #165

Open

Merge branch 'main' into feat/evict-req-on-client-disconnect

99da82b

bhimrazy added 2 commits August 22, 2024 00:17

Merge branch 'main' into feat/evict-req-on-client-disconnect

15fe905

fix failing test

e5565a8

bhimrazy and others added 13 commits August 22, 2024 01:11

fixed: cannot access local variable 'uid'

36429a5

feat: adds test for test_stream_client_disconnection

9c08744

ref: format imports using ruff

f5522fa

fix lint warning for @pytest.mark.asyncio

2f46532

adds a todo in the test for reminder

7aacee6

[pre-commit.ci] auto fixes from pre-commit.com hooks

4327e49

for more information, see https://pre-commit.ci

Merge branch 'main' into feat/evict-req-on-client-disconnect

c054af3

adds cleanup for the dict to prevent leakage

6eeb90d

chore: fix typo in test_lit_server.py

dc041d2

updates the sleep time

18419f1

updated some time

f6763e5

updated prompt len

6dc6454

chore: Remove print statement in stream_predict method

e7b3059

bhimrazy commented Aug 22, 2024

View reviewed changes

tests/test_lit_server.py Outdated Show resolved Hide resolved

bhimrazy added 8 commits August 23, 2024 09:38

Merge branch 'feat/evict-req-on-client-disconnect' of github.com:bhim…

a9d86ce

…razy/LitServe into feat/evict-req-on-client-disconnect

chore: Add delayed prediction support in LitAPI subclasses

34453e9

updated stream test and added test for nonstream case

0069b98

added logic to handle the client disconnection in predict

f3d6bd2

update sleep duration

6029165

Update sleep duration

6e95b30

update sleep time

f6f3e4c

removed sleep

9d47245

bhimrazy commented Aug 23, 2024

View reviewed changes

src/litserve/server.py Show resolved Hide resolved

bhimrazy added 6 commits August 23, 2024 11:09

check if is_disconnected exists

86ca3ce

adds sleep

154cc6c

chore: Update sleep duration

39986bf

chore: Update sleep duration in LitServer

2c7633a

tried another approach to check & handle disconnection

ccaeee9

wrap in try catch

f0b19af

bhimrazy changed the title ~~[WIP]: Evict requests if the client has disconnected~~ Evict requests if the client has disconnected (non-batch) Aug 23, 2024

bhimrazy marked this pull request as ready for review August 23, 2024 08:47

Merge branch 'main' into feat/evict-req-on-client-disconnect

dcab100

bhimrazy changed the title ~~Evict requests if the client has disconnected (non-batch)~~ Feat: Evict requests (single mode) if the client has disconnected Aug 23, 2024

Merge branch 'main' into feat/evict-req-on-client-disconnect

b810a66

bhimrazy marked this pull request as draft August 23, 2024 12:25

aniketmaurya changed the title ~~Feat: Evict requests (single mode) if the client has disconnected~~ Feat: Evict requests if the client has disconnected Aug 23, 2024

bhimrazy and others added 5 commits August 24, 2024 01:07

Merge branch 'main' into feat/evict-req-on-client-disconnect

4edab2c

Merge branch 'main' into feat/evict-req-on-client-disconnect

3d9a9a7

Merge branch 'main' into feat/evict-req-on-client-disconnect

5c0d7fc

Merge branch 'main' into feat/evict-req-on-client-disconnect

919b304

Merge branch 'main' into feat/evict-req-on-client-disconnect

6ffe51c

bhimrazy closed this Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Evict requests if the client has disconnected #208

Feat: Evict requests if the client has disconnected #208

bhimrazy commented Aug 21, 2024 •

edited

Loading

aniketmaurya commented Aug 21, 2024

bhimrazy commented Aug 21, 2024

aniketmaurya commented Aug 21, 2024

bhimrazy commented Aug 21, 2024

codecov bot commented Aug 21, 2024 •

edited

Loading

williamFalcon commented Aug 23, 2024

bhimrazy commented Aug 26, 2024 •

edited

Loading

Feat: Evict requests if the client has disconnected #208

Feat: Evict requests if the client has disconnected #208

Conversation

bhimrazy commented Aug 21, 2024 • edited Loading

What does this PR do?

Approach

Potential Impacts

TODO

PR review

Did you have fun?

aniketmaurya commented Aug 21, 2024

bhimrazy commented Aug 21, 2024

aniketmaurya commented Aug 21, 2024

bhimrazy commented Aug 21, 2024

codecov bot commented Aug 21, 2024 • edited Loading

Codecov Report

williamFalcon commented Aug 23, 2024

bhimrazy commented Aug 26, 2024 • edited Loading

bhimrazy commented Aug 21, 2024 •

edited

Loading

codecov bot commented Aug 21, 2024 •

edited

Loading

bhimrazy commented Aug 26, 2024 •

edited

Loading