Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use Heatmap panel to represent Tablets status #2111

Closed
tzach opened this issue Nov 6, 2023 · 10 comments
Closed

Use Heatmap panel to represent Tablets status #2111

tzach opened this issue Nov 6, 2023 · 10 comments
Assignees
Labels
enhancement New feature or request

Comments

@tzach
Copy link
Contributor

tzach commented Nov 6, 2023

Real time status of tablets, similar to tablet-mon.py but as part of a dashboard.
image

@tzach tzach added the enhancement New feature or request label Nov 6, 2023
@tzach
Copy link
Contributor Author

tzach commented Nov 6, 2023

@tgrabiec FYI

@amnonh
Copy link
Collaborator

amnonh commented Nov 6, 2023

@tzach beside being a cool representation, can you describe a scenario that this is useful?

@tzach
Copy link
Contributor Author

tzach commented Nov 6, 2023

It would be beneficial to see, for a specific table, how it is distributed between the nodes.
See tablet-mon.py an in-terminal tool already in use.

@tgrabiec
Copy link
Contributor

Sending detailed tablet metrics may be too much. We disable per-table stats by default, and per-tablet stats will be heavier than that.

We can present aggregate metrics for migrating tablets per shard.

@amnonh
Copy link
Collaborator

amnonh commented Nov 21, 2023

I was thinking, if we can show the number of tablets per shard/node we can show bars that would show the distribution.
Naturally, showing all tablets in a real cluster is too much

@tzach
Copy link
Contributor Author

tzach commented Nov 21, 2023

Sending detailed tablet metrics may be too much. We disable per-table stats by default, and per-tablet stats will be heavier than that.

Even if too much for production, it would be helpful for debuging and demos.

@tgrabiec
Copy link
Contributor

@tzach So what would you like the heat map to show exactly?

@tgrabiec
Copy link
Contributor

I feel that the thing provided by grafana is too limited in its presentation and update frequency. What tablet-mon is generally useful for is getting an intuition about the behavior of the system in real time, by observing movement of parts, and for that you need (1) high frequency of updates (2) animation. Grafana has neither, it's good at presenting coarse-grained statistics.

@tgrabiec
Copy link
Contributor

For example, with tablet-mon I was able to immediately identify that there are stalls in tablet phase transitions which shouldn't be there. That would be lost in the noise in grafana. I think the two tools serve different purposes.

@amnonh amnonh changed the title Use Hashmap panel to represent Tablets status Use Heatmap panel to represent Tablets status Feb 13, 2024
@amnonh
Copy link
Collaborator

amnonh commented Feb 13, 2024

closed by #2170

@amnonh amnonh closed this as completed Feb 13, 2024
@amnonh amnonh added this to the Monitoring 4.7 milestone Mar 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants