Replies: 1 comment 5 replies
-
This is a kind to timeout. It means your server(oap) can't process the data in time. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Description:
We encountered a situation where the SkyWalking Java agent logged repeated
DEADLINE_EXCEEDED
errors while trying to callServiceManagementClient
. Shortly after those errors appeared, multiple threads began consuming 100% of the CPU indefinitely. Thread dumps show these busy threads are stuck inorg.apache.skywalking.apm.dependencies.io.netty.util.internal.shaded.org.jctools.queues.BaseMpscLinkedArrayQueue.poll
(i.e.,MpscUnboundedArrayQueue.poll
) within the Netty event loop.Below are the relevant log messages and a snippet of one such thread stack trace:
And one of the 100% CPU threads shows the following stack:
These threads appear to be in a busy-spin loop (
RUNNABLE
state) underBaseMpscLinkedArrayQueue.poll()
, using 100% CPU.Environment:
Additional Context or Screenshots:
Collector traceSegment service doesn't response in xxx seconds. ...
grpc-nio-worker-ELG-*
threads go intoRUNNABLE
state in a loop.Could you please advise if this is a known bug or a configuration issue? Any recommended workaround or fix would be appreciated.
Beta Was this translation helpful? Give feedback.
All reactions