-
Notifications
You must be signed in to change notification settings - Fork 4.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grpc update breaks with Bazel 3.4.0 in docker container #11756
Comments
Looks like something changed so bazel frontend is unable to communicate with the server inside the docker container for RBE autoconfig. |
We see this too with our projects and RBE and 3.4.0:
|
Bazel 3.4.0 broke something and now rbe_autoconfig fails somewhere inside Docker where it tries to start Bazel: https://buildkite.com/bazel/google-bazel-presubmit/builds/36688#262e09df-19ea-47cc-96bf-3c5a4e2d7352 This CL should be rolled back as soon as we figured out how to fix this. The breakage is tracked in: #11756 RELNOTES: None. PiperOrigin-RevId: 320977135
Yes, we're trying to diagnose and get out a patch as fast as we can. |
Same issue here in a Docker runner on CircleCI: jobs:
build:
docker:
- image: circleci/node:12.16.1
environment:
NODE_OPTIONS: --max_old_space_size=4096
resource_class: medium+
|
@nathanhleung Can you confirm that in your case no remote execution or |
Repro without RBE:
|
culprit seems to be dfbf87c |
Not sure how to confirm, but this is our Bazel install step: install_bazel:
# From https://docs.bazel.build/versions/master/install-ubuntu.html
steps:
- run: |
sudo apt install curl gnupg apt-transport-https
curl https://bazel.build/bazel-release.pub.gpg | sudo apt-key add -
echo "deb [arch=amd64] https://storage.googleapis.com/bazel-apt \
stable jdk1.8" | sudo tee /etc/apt/sources.list.d/bazel.list
- run: sudo apt-get update && sudo apt-get -y install bazel
#!/bin/bash
# Exit on error
set -e
echo "# Generated by ./scripts/generate_bazelrc.sh" > .bazelrc
echo "build --remote_cache=https://$BAZEL_CACHE_USER:$BAZEL_CACHE_PASSWORD@cache.company.xyz" >> .bazelrc (nothing else) And build command:
|
We have confirmed that 0415511 fixes the issue |
To avoid conflict, we should also cherry-pick 08bf906 |
(My observation has been that enabling IPv6 in the container fixes this issue.) |
Patch release can be found here: https://releases.bazel.build/3.4.1/rc1/index.html |
Downstream pipeline for Bazel 3.4.1rc1 running here: https://buildkite.com/bazel/bazel-at-head-plus-downstream/builds/1564 |
The bug is mitigated and Bazel 3.4.1 is released. Assigning to Yun who offered to look into creating a test (and probably has an incentive to roll forward the Grpc update). |
FYI @olekw |
*** Reason for rollback *** Bazel 3.4.1 is released and fixes the issue. *** Original change description *** Disable tests on RBE in Bazel's presubmit and postsubmit. Bazel 3.4.0 broke something and now rbe_autoconfig fails somewhere inside Docker where it tries to start Bazel: https://buildkite.com/bazel/google-bazel-presubmit/builds/36688#262e09df-19ea-47cc-96bf-3c5a4e2d7352 This CL should be rolled back as soon as we figured out how to fix this. The breakage is tracked in: #11756 RELNOTES: None. PiperOrigin-RevId: 321120981
This is now fixed with the release of Bazel 3.4.1. |
BTW, netty/netty#10402 is the underlying issue. It could be worked around by not attempting to bind |
This is a workaround for a netty bug netty/netty#10402 that caused the rollback of upgrading grpc-java to 1.26.0 (#11756) Closes #11776. PiperOrigin-RevId: 321342799
This will make io.netty.channel.unix.Socket.isIPv6Preferred() available for fixing bazelbuild#11756
This will make io.netty.channel.unix.Socket.isIPv6Preferred() available for fixing #11756
bazel 3.4.0 is basically broken in docker containers: bazelbuild/bazel#11756 After this lands, I will cut a jazelle release that uses bazel 3.4.1, and upgrade fusion CI to use that new jazelle release. I am hopeful that this will fix our bazel server startup issues.
This will make io.netty.channel.unix.Socket.isIPv6Preferred() available for fixing bazelbuild#11756 # Conflicts: # third_party/BUILD
The underlying issue has been worked around in bazelbuild#11776 Fixes bazelbuild#11756 Closes bazelbuild#11792
https://github.com/uber/fusionjs/pull/1120 bazel 3.4.0 is basically broken in docker containers: bazelbuild/bazel#11756 After this lands, I will cut a jazelle release that uses bazel 3.4.1, and upgrade fusion CI to use that new jazelle release. I am hopeful that this will fix our bazel server startup issues.
https://github.com/uber/fusionjs/pull/1120 bazel 3.4.0 is basically broken in docker containers: bazelbuild/bazel#11756 After this lands, I will cut a jazelle release that uses bazel 3.4.1, and upgrade fusion CI to use that new jazelle release. I am hopeful that this will fix our bazel server startup issues.
https://github.com/uber/fusionjs/pull/1120 bazel 3.4.0 is basically broken in docker containers: bazelbuild/bazel#11756 After this lands, I will cut a jazelle release that uses bazel 3.4.1, and upgrade fusion CI to use that new jazelle release. I am hopeful that this will fix our bazel server startup issues.
Follow up on: bazelbuild/bazel#11776 Fixes bazelbuild/bazel#11756 RELNOTES: None PiperOrigin-RevId: 321541781
Error log:
https://buildkite.com/bazel/google-bazel-presubmit/builds/36688#7500893b-351a-4549-86ae-19803c73cbd1
The text was updated successfully, but these errors were encountered: