Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bazel CI: RBE builds are broken after grpc java upgrade #12264

Closed
meteorcloudy opened this issue Oct 14, 2020 · 23 comments
Closed

Bazel CI: RBE builds are broken after grpc java upgrade #12264

meteorcloudy opened this issue Oct 14, 2020 · 23 comments
Labels
breakage P1 I'll work on this now. (Assignee required) team-Remote-Exec Issues and PRs for the Execution (Remote) team type: bug

Comments

@meteorcloudy
Copy link
Member

https://buildkite.com/bazel/bazel-auto-sheriff-face-with-cowboy-hat/builds/306

ERROR: /var/lib/buildkite-agent/.cache/bazel/_bazel_buildkite-agent/cfad747ece6c2992c5b867a14a43555e/external/org_golang_x_crypto/curve25519/BUILD.bazel:3:11: GoCompilePkg external/org_golang_x_crypto/curve25519/curve25519.a failed (Exit 34): com.google.devtools.build.lib.remote.BulkTransferException
	at com.google.devtools.build.lib.remote.RemoteCache.waitForBulkTransfer(RemoteCache.java:225)
	at com.google.devtools.build.lib.remote.RemoteCache.download(RemoteCache.java:331)
	at com.google.devtools.build.lib.remote.RemoteSpawnRunner.downloadAndFinalizeSpawnResult(RemoteSpawnRunner.java:486)
	at com.google.devtools.build.lib.remote.RemoteSpawnRunner.exec(RemoteSpawnRunner.java:306)
	at com.google.devtools.build.lib.exec.SpawnRunner.execAsync(SpawnRunner.java:240)
	at com.google.devtools.build.lib.exec.AbstractSpawnStrategy.exec(AbstractSpawnStrategy.java:134)
	at com.google.devtools.build.lib.exec.AbstractSpawnStrategy.exec(AbstractSpawnStrategy.java:102)
	at com.google.devtools.build.lib.actions.SpawnStrategy.beginExecution(SpawnStrategy.java:47)
	at com.google.devtools.build.lib.exec.SpawnStrategyResolver.beginExecution(SpawnStrategyResolver.java:65)
	at com.google.devtools.build.lib.analysis.actions.SpawnAction.beginExecution(SpawnAction.java:331)
	at com.google.devtools.build.lib.actions.Action.execute(Action.java:127)
	at com.google.devtools.build.lib.skyframe.SkyframeActionExecutor$4.execute(SkyframeActionExecutor.java:859)
	at com.google.devtools.build.lib.skyframe.SkyframeActionExecutor$ActionRunner.continueAction(SkyframeActionExecutor.java:1019)
	at com.google.devtools.build.lib.skyframe.SkyframeActionExecutor$ActionRunner.run(SkyframeActionExecutor.java:978)
	at com.google.devtools.build.lib.skyframe.ActionExecutionState.runStateMachine(ActionExecutionState.java:129)
	at com.google.devtools.build.lib.skyframe.ActionExecutionState.getResultOrDependOnFuture(ActionExecutionState.java:81)
	at com.google.devtools.build.lib.skyframe.SkyframeActionExecutor.executeAction(SkyframeActionExecutor.java:469)
	at com.google.devtools.build.lib.skyframe.ActionExecutionFunction.checkCacheAndExecuteIfNeeded(ActionExecutionFunction.java:845)
	at com.google.devtools.build.lib.skyframe.ActionExecutionFunction.compute(ActionExecutionFunction.java:314)
	at com.google.devtools.build.skyframe.AbstractParallelEvaluator$Evaluate.run(AbstractParallelEvaluator.java:438)
	at com.google.devtools.build.lib.concurrent.AbstractQueueVisitor$WrappedRunnable.run(AbstractQueueVisitor.java:398)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
	at java.base/java.lang.Thread.run(Unknown Source)
	Suppressed: java.io.IOException: io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

Verified by building with Bazel@d4cd4e7ab18ebeae4152dafc113367289ffebb12 and its previous commit:
https://buildkite.com/bazel/culprit-finder/builds/581
https://buildkite.com/bazel/culprit-finder/builds/582

Culprit: d4cd4e7

@meteorcloudy meteorcloudy added breakage P1 I'll work on this now. (Assignee required) type: bug labels Oct 14, 2020
@meteorcloudy
Copy link
Member Author

meteorcloudy commented Oct 14, 2020

FYI @dmivankov, can you help look into this issue? Is this a bug in protobuf?

@meteorcloudy
Copy link
Member Author

Similar issue and fix: pravega/pravega#1621, pravega/pravega#1622

@meteorcloudy
Copy link
Member Author

meteorcloudy commented Oct 14, 2020

@dmivankov
Copy link
Contributor

dmivankov commented Oct 14, 2020

Interestingly I couldn't immediately find keep-alive default timeout changes in https://github.com/grpc/grpc-java v1.26.0->v1.31.1 or in https://github.com/netty/netty netty-4.1.42.Final -> netty-4.1.48.Final

20sec in GrpcUtil
https://github.com/grpc/grpc-java/blob/v1.31.1/core/src/main/java/io/grpc/internal/GrpcUtil.java#L205
vs
https://github.com/grpc/grpc-java/blob/v1.26.0/core/src/main/java/io/grpc/internal/GrpcUtil.java#L203

But this looks interesting
grpc/grpc-java#7015 auto flow control was turned on between 1.26.0 and 1.31.1
grpc/grpc-java#7302 users reported problems

Does setting environment variable

GRPC_EXPERIMENTAL_AUTOFLOWCONTROL=false

fix the issue on client side?
If yes (or maybe even if it doesn't?), we can try

NettyServerBuilder builder;
..
builder.flowControlWindow(NettyServerBuilder.DEFAULT_FLOW_CONTROL_WINDOW)

in https://github.com/bazelbuild/bazel/search?q=NettyServerBuilder
note: there seems to be no method to directly disable auto flow control other than manually setting default window https://github.com/grpc/grpc-java/blob/master/netty/src/main/java/io/grpc/netty/NettyServerBuilder.java#L387

dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 14, 2020
Was turned on by default during 1.26.0->1.31.1 grpc-java bump
It seems that it may be causing errors in RBE:

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

bazelbuild#12264
grpc/grpc-java#7302
@dmivankov
Copy link
Contributor

Drafted a PR #12266
Disclaimer: I neither tested it myself nor have reproduced the error in question

@dmivankov
Copy link
Contributor

dmivankov commented Oct 14, 2020

We can also try a forward fix going for v1.32.2
https://github.com/grpc/grpc-java/releases/tag/v1.32.2

netty: BDP ping accounting should occur after flow control. This resolves an incompatibility issue introduced in v1.30.0 and could be worked around via GRPC_EXPERIMENTAL_AUTOFLOWCONTROL=false introduced later. The symptom was a GOAWAY with “too_many_pings” without an aggressive keepalive configured. The environment variable is still available, but will be removed in the future

grpc/grpc-java#7446
grpc/grpc-java#7503

@ulfjack
Copy link
Contributor

ulfjack commented Oct 14, 2020

The original default keepalive was to send no keepalive. Maybe that has also changed upstream? That seems more likely to explain the problem.

@aiuto aiuto added the team-Remote-Exec Issues and PRs for the Execution (Remote) team label Oct 15, 2020
@coeuvre
Copy link
Member

coeuvre commented Oct 15, 2020

Since gRPC v1.32.2 fixes this, can we upgrade to that version instead?

@dmivankov
Copy link
Contributor

yes, auto flow enables pinging
https://github.com/grpc/grpc-java/blob/v1.26.0/netty/src/main/java/io/grpc/netty/AbstractNettyHandler.java#L141 - this is where auto flow pinging gets enabled in v1.26.0 (same in v1.31.1, but v1.31.1 enables auto flow by default for both client&server)

Given that auto flow control is a new feature and there's some indication that it caused the regression I'd rather try disabling it first #12266 as a more solid option.

v1.32.2 has fixes in that area, but it takes more PRs to bump again, unless there's an easy way to check whether it really helps before merging probably a good idea to try a faster fix.

I will prepare v1.32.2 though

@dmivankov
Copy link
Contributor

interesting bit in grpc core 1.32.0
grpc/grpc#23313 fixing grpc/grpc#16210
hopefully it also improves situation rather than introduces new bugs around same area :)

dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
bazel-io pushed a commit that referenced this issue Oct 15, 2020
Was turned on by default during 1.26.0->1.31.1 grpc-java bump
It seems that it may be causing errors in RBE:

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

#12264
grpc/grpc-java#7302

Closes #12266.

PiperOrigin-RevId: 337254515
@coeuvre
Copy link
Member

coeuvre commented Oct 15, 2020

@meteorcloudy The fix is merged as 6e94b05. Can you run the tests with it?

@meteorcloudy
Copy link
Member Author

Just launched a downstream test here: https://buildkite.com/bazel/bazel-at-head-plus-downstream/builds/1701

@coeuvre
Copy link
Member

coeuvre commented Oct 15, 2020

dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
@dmivankov
Copy link
Contributor

Ok, so we can try next one of two things

Is there a way to run those RBE tests on PRs?

@meteorcloudy
Copy link
Member Author

Yes, I'll help test #12273

@meteorcloudy
Copy link
Member Author

@meteorcloudy
Copy link
Member Author

The RBE build seems to be fixed by upgrading to 1.32.x.
Another test with auto flow control feature enabled: https://buildkite.com/bazel/bazel-at-head-plus-downstream/builds/1703

@meteorcloudy
Copy link
Member Author

The rules_haskell failure is caused by something else. So it looks like upgrading grpc to 1.32.x does fix the issue and allows us to safely bring back auto flow control.

@dmivankov
Copy link
Contributor

Great, then I can make the PRs: add 1.32.x, switch to 1.32.x & bring auto flow control back, drop 1.31.1

dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 15, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets
Composed PR: bazelbuild#12273

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
bazel-io pushed a commit that referenced this issue Oct 16, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets
Composed PR: #12273

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

#12264

Note: also an attempt and disabling auto flow by default is made in
#12266

Closes #12279
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 16, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
bazel-io pushed a commit that referenced this issue Oct 16, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

#12264

Note: also an attempt and disabling auto flow by default is made in
#12266

Also turn auto flow control feature back on

This reverts commit 6e94b05.

Closes #12288.

PiperOrigin-RevId: 337485572
dmivankov added a commit to dmivankov/bazel that referenced this issue Oct 16, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266
@meteorcloudy
Copy link
Member Author

This is fixed by upgrading grpc java version

bazel-io pushed a commit that referenced this issue Oct 19, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

#12264

Note: also an attempt and disabling auto flow by default is made in
#12266

Closes #12289
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets
Composed PR: bazelbuild#12273

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Closes bazelbuild#12279
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Also turn auto flow control feature back on

This reverts commit 6e94b05.

Closes bazelbuild#12288.

PiperOrigin-RevId: 337485572
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Closes bazelbuild#12289
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 1: add v1.32.x version to third_party/grpc
Note: partly switches to v1.32.x too as not all bits are versioned and
      some of unversioned bits are used from other third_party targets
Composed PR: bazelbuild#12273

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Closes bazelbuild#12279
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 2: switch to v1.32.x

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Also turn auto flow control feature back on

This reverts commit 6e94b05.

Closes bazelbuild#12288.

PiperOrigin-RevId: 337485572
coeuvre pushed a commit to coeuvre/bazel that referenced this issue Oct 22, 2020
Part 3: remove 1.31.1 from third_party/grpc

grpc-java transition from v1.26.0 to v1.31.1 enabled auto flow control
which  started failing in RBE with

io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
HTTP/2 error code: ENHANCE_YOUR_CALM
Received Goaway
too_many_pings

grpc-java v1.32.2 has a bugfix attempt on that
grpc v1.32.0 also has something new around keepalive pings

Hopefully version bump to those helps

bazelbuild#12264

Note: also an attempt and disabling auto flow by default is made in
bazelbuild#12266

Closes bazelbuild#12289
luca-digrazia pushed a commit to luca-digrazia/DatasetCommitsDiffSearch that referenced this issue Sep 4, 2022
    Was turned on by default during 1.26.0->1.31.1 grpc-java bump
    It seems that it may be causing errors in RBE:

    io.grpc.StatusRuntimeException: RESOURCE_EXHAUSTED: Bandwidth exhausted
    HTTP/2 error code: ENHANCE_YOUR_CALM
    Received Goaway
    too_many_pings

    bazelbuild/bazel#12264
    grpc/grpc-java#7302

    Closes #12266.

    PiperOrigin-RevId: 337254515
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breakage P1 I'll work on this now. (Assignee required) team-Remote-Exec Issues and PRs for the Execution (Remote) team type: bug
Projects
None yet
Development

No branches or pull requests

5 participants