-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add storage related metrics #2044
Comments
@raphaelsc : scylla_sstables_currently_open_for_reading add to detiled |
sum(rate(scylla_sstables_index_page_misses[120s])) by (instance) / (sum(rate(scylla_sstables_single_partition_reads[120s])) by (instance)) -> read amplification due to promoted index reads. Maybe need to add range scans to the divisor since they also generate index reads. |
I would like to export the |
#2044 (comment) would already give us this, on a per-scheduling group basis. |
the name of the metric is a bit misleading. it's actually sstables_currently_available_for_reading (i.e total number of sstables in the system). |
I see, so it is something else then. Maybe both are valuable then, see how much of the total sstables we need for each read. |
indeed. In my latest adventures, I have been using it a lot to correlate growth in non lsa with number of sstables (e.g. after a node op). |
I see that the |
@michoecho can you please look 2044#issuecomment-1680476322 I saw that you are the last one who touched the relevant code. I need a clear explenation on what the calculation should be, if possible with reasoning. |
I can't give a clear explanation for the calculation without a clear understanding of what is being calculated. |
@michoecho @avikivity ping |
@michoecho @avikivity ping I'm about to branch 4.6 and would like to add it to the release |
@michoecho @avikivity ping |
1 similar comment
@michoecho @avikivity ping |
What are you pinging me for? |
@michoecho pining you and @avikivity |
@avikivity @denesb @raphaelsc @michoecho I will branch 4.7 soon and would like to have it in the release. Can we make a decision on what to include? |
I have nothing more, other than my existing comments: #2044 (comment) As for the decision, I don't know what you mean, which decision? |
@denesb I'm looking for a bottom line regarding what to add. That should be an actual metric/calculation; too many options are floating around with no concrete resolution. |
I mentioned two metrics in the comment: |
and @raphaelsc had his thoughts around them, I'm looking for the bottom line after all conversations end. |
Right, he requested |
I see that the class label is not the same as the scheduling_group_name label (tested with scylla 2023.1.2) what's the relation between them (if any) and what does the user understand? |
Class is one of user, system or maintenance. These refer to the context in which some work (query) is being done. User refers to all work done on behalf of a user (driver) request. Maintenance refers to all work done as a background, maintenance work (think compaction, repair, streaming etc.). System is everything else. |
Add disk and sstable related metrics either to an existing dashboard or to a new dashboard
The text was updated successfully, but these errors were encountered: