-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ci(ssh): revert using ssh-compute
action & increase sshd connection limit
#5367
Conversation
ssh-compute
action & increase sshd connection limit
ssh-compute
action & increase sshd connection limitssh-compute
action & increase sshd connection limit
Motivation: We've been trying multiple solutions to our SSH connection issues, our last try solving this issues was PR https://github.com/ZcashFoundation/zebra/pull/5367/files Depends-On: #5367 Expected behavior: An SSH connection should not be terminated by the server, the connection must be kept alive indefinitely until it's killed by GitHub Actions Solution: Disable TCP keepalive messages from the server and set `ClientAliveCountMax` to 0, which disables connection termination
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!
I'd like to check that the sshd connection limit adjustment actually worked before we merge this, since we're calling the script a different way now.
We might also want to use the bullseye image for the instances.
Sorry for adding chore: fix tj-actions/changed-files file comparison to this PR. But it's annoying having an ❌ in this PR and this will be pulled to Edit: I had to fix it here anyways, as it wouldn't merge otherwise |
CI is failing with a new (unrelated?) error 🥲
|
We've changed two things related to the disk in this PR:
Here are some things we could try:
Is there anything else we could try? |
That would only work if the instance startup script is causing the disk to be used. |
@gustavovalverde I just realised that these disk resize commands are only needed when we change the disk size. Specifically, they are only needed between:
So let's ignore the failure for now, and fix it if it becomes a problem after #5085 ? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, looks good, let's get it fixed!
Motivation
We've been having multiple errors (more than before) after implementing ssh-compute
Solution
ssh-compute
implementationsudo
in some docker commands as we might be connecting with a non-root user to the VMsMaxStartups
setting in sshd to 500 to avoid a failing SSH connection causes GitHub Action to failNote:
tj-actions/changed-files
file comparison was failing in this PR and it has been failing in themain
branch, so we're adding a fix here too as this would halt the PR from merging. Here's the actual explanation on this fix tj-actions/changed-files#639 (comment)Closes #5358
Closes #5365
Fixes #5362
Fixes #5361
Review
If CI passes, anyone can review this PR
Reviewer Checklist
gcp-vm-startup-script.sh
Follow Up Work
We might also want to wait up to 90 seconds after a VM has been created, just so we're sure all configurations are complete.