Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update DataChain worker liveness probe #325

Closed
wants to merge 3 commits into from

Conversation

dreadatour
Copy link
Contributor

This updates liveness probe command for DVCx worker k8s deployment.

Should be merged after https://github.com/iterative/studio/pull/9174 will be merged and released.

@dreadatour dreadatour added the studio Everything related to the Studio Helm chart label Feb 15, 2024
@dreadatour dreadatour requested a review from a team February 15, 2024 11:24
@mjasion mjasion force-pushed the studio-dvcx-worker-liveness-probe branch from 3c85aa9 to 181b4df Compare February 19, 2024 10:21
@mjasion mjasion enabled auto-merge February 19, 2024 10:21
@mjasion mjasion disabled auto-merge February 19, 2024 10:28
@mjasion mjasion force-pushed the studio-dvcx-worker-liveness-probe branch from 9a3f199 to 3e584d0 Compare February 19, 2024 18:31
@dreadatour
Copy link
Contributor Author

Waiting this PR to be merged and deployed to dev and prod: https://github.com/iterative/studio/pull/9247

@shcheklein
Copy link
Member

@dreadatour what is the status of this?

@shcheklein
Copy link
Member

@dreadatour ping

@shcheklein
Copy link
Member

folks @dreadatour and @iterative/platform - what is the status here?

1 similar comment
@shcheklein
Copy link
Member

folks @dreadatour and @iterative/platform - what is the status here?

@0x2b3bfa0
Copy link
Member

It looks like https://github.com/iterative/studio/pull/9174 mentioned on #325 (comment) was released 4+ months (!) ago; after #438 we no longer need this liveness probe.

Still, we should have some liveness probe, maybe making sure that the Celery workers are storing heartbeats into Redis or something that ensures us the pod is healthy and ready to ingest new tasks.

@0x2b3bfa0 0x2b3bfa0 changed the title Update DVCx worker liveness probe command for k8s deployment Update DataChain worker liveness probe command for k8s deployment Jul 16, 2024
@0x2b3bfa0 0x2b3bfa0 changed the title Update DataChain worker liveness probe command for k8s deployment Update DataChain worker liveness probe Jul 16, 2024
@0x2b3bfa0
Copy link
Member

0x2b3bfa0 commented Jul 16, 2024

So there's nothing wrong with this pull request, and we can merge it after re-designing /app/bin/liveness-probe.sh

@shcheklein
Copy link
Member

there was some code duplication around SA liveness probes and tests on the Studio side that we also should cleanup along the way

@0x2b3bfa0
Copy link
Member

Maybe https://github.com/iterative/studio/pull/10307#discussion1683252817 can provide better readiness/liveness probes for DataChain that anything else, if it gets merged.

@0x2b3bfa0
Copy link
Member

Stale; opened #504 to track this

@0x2b3bfa0 0x2b3bfa0 closed this Nov 30, 2024
@0x2b3bfa0 0x2b3bfa0 deleted the studio-dvcx-worker-liveness-probe branch November 30, 2024 02:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
studio Everything related to the Studio Helm chart
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants