-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Jobs] Nightly test - submitting job results in GCS crash #32367
Comments
nvm |
|
so it's likely introduced by b2c5e63 |
We suspect #32213 is caused by the same commit |
Also saw this in
|
Thanks for the help with the investigation! If the check failure |
Same issue as #32213, moving discussion there |
What happened + What you expected to happen
I've been switching over the execution mode of nightly tests to Jobs, and I think I gave Jobs more test coverage :) it seems like
shuffle_20gb_with_state_api
failed right after cluster startup when we attempted to submit a job due to a GCS crash. Here's the stacktrace:Logs from head node:
head-10.0.4.212-i-0086e889d3a72c5b1.zip
Link to cluster: https://console.anyscale-staging.com/o/anyscale-internal/projects/prj_ksaufjuihy7h6ww7abh5gwlqjh/clusters/ses_yzx1g7qa3jrnnhysv8akxz6hgg
Versions / Dependencies
Nightly
Reproduction script
Run
shuffle_20gb_with_state_api
against this PR: #32204Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered: