Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bugfix] Use heartbeats instead of health checks #8583

Merged
merged 15 commits into from
Sep 25, 2024

Commits on Sep 18, 2024

  1. ♻️ Move health checks to separate thread

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    d76d41d View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. ♻️ Change to heartbeat

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    95d2028 View commit details
    Browse the repository at this point in the history
  2. ♻️ more health_check -> heartbeat

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    b01690d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    83f43d0 View commit details
    Browse the repository at this point in the history
  4. 🐛 set heartbeat interval based on rpc timeout

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3fcd3b3 View commit details
    Browse the repository at this point in the history
  5. 🐛 turn up heartbeats

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    33ffbf8 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    1add88e View commit details
    Browse the repository at this point in the history
  7. ♻️ refactor failed abort test

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    15db07c View commit details
    Browse the repository at this point in the history
  8. ♻️ set last alive threshold based on client timeout

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    624c640 View commit details
    Browse the repository at this point in the history
  9. 🐛 Check for closed sockets on heartbeat

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    4703a3d View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. Update vllm/engine/multiprocessing/engine.py

    Co-authored-by: Nick Hill <nickhill@us.ibm.com>
    joerunde and njhill authored Sep 20, 2024
    Configuration menu
    Copy the full SHA
    b135cff View commit details
    Browse the repository at this point in the history
  2. 🎨 fixes from review comments

    Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
    joerunde committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    10b646d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5694c17 View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2024

  1. Configuration menu
    Copy the full SHA
    e0cec81 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2024

  1. Configuration menu
    Copy the full SHA
    76835d8 View commit details
    Browse the repository at this point in the history