Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: barrier stuck for minutes #9723

Closed
Tracked by #6640
fuyufjh opened this issue May 10, 2023 · 3 comments
Closed
Tracked by #6640

bug: barrier stuck for minutes #9723

fuyufjh opened this issue May 10, 2023 · 3 comments
Assignees
Milestone

Comments

@fuyufjh
Copy link
Member

fuyufjh commented May 10, 2023

Describe the bug

The metrics diagram looks broken, because no barriers are collected during a long time.

Both barrier number and latency are very high.

image

During the whole running https://g-2927a1b4d9.grafana-workspace.us-east-1.amazonaws.com/d/EpkBw5W4k/risingwave-test-dashboard?org[…]90631&from=1683537069000&to=1683623809000

image

To Reproduce

Longevity test

Expected behavior

No response

Additional context

Await-Tree Dump of Compute Node 3.txt
Await-Tree Dump of Compute Node 4.txt
Await-Tree Dump of Compute Node 5.txt

See more context in Slack channel: https://risingwave-labs.slack.com/archives/C034TRPKN1F/p1683627627166979

@kwannoel
Copy link
Contributor

@kwannoel
Copy link
Contributor

Test configuration:

Screenshot 2023-05-12 at 1 48 54 PM

@fuyufjh
Copy link
Member Author

fuyufjh commented May 26, 2023

The problem is caused by:

  1. The state of distinct table of aggregation has bad locality
  2. When the amount of data reached become larger than meta cache, the meta cache miss caused high read latency
  3. The read latency caused longer barrier latency, since the number of rows in one epoch is constant (not well backpressured, which may be a potential problem to be improved)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants