-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Enhanced segment files sizes information in Nodes Stats/Indices Stats APIs #71643
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Pinging @elastic/es-core-features (Team:Core/Features) |
ywelsch
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that these information give a better view of the segment files and are useful in many cases, specially with searchable snapshots whose segment stats can now be introspected thanks to the include_unloaded_segments parameter.
agree. LGTM
jpountz
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 to make index stats about files. I think that some users found it useful to extract CFS files because it helped reason about what takes disk space, which hopefully #68508 will address.
|
Thanks Yannick and Adrien! |
… APIs (elastic#71643) Since elastic#16661 it is possible to know the total sizes for some Lucene segment files by using the Node Stats or Indices Stats API with the include_segment_file_sizes parameter, and the list of file extensions has been extended in elastic#71416. This commit adds a bit more information about file sizes like the number of files (count), the min, max and average file sizes in bytes that share the same extension. Here is a sample: "cfs" : { "description" : "Compound Files", "size_in_bytes" : 2260, "min_size_in_bytes" : 2260, "max_size_in_bytes" : 2260, "average_size_in_bytes" : 2260, "count" : 1 } This commit also simplifies how compound file sizes were computed: before compound segment files were extracted and sizes aggregated with regular non-compound files sizes (which can be confusing and out of the scope of the original issue elastic#6728), now CFS/CFE files appears as distinct files. These new information are provided to give a better view of the segment files and are useful in many cases, specially with frozen searchable snapshots whose segment stats can now be introspected thanks to the include_unloaded_segments parameter.
|
Is it breaking enough that it should only go in 8.0? |
I think it can go in 7.13.0. The REST response contains more information now but the previous fields are unchanged; the set of files has been already extended in #71416 but existing extensions and descriptions remain the same (just more extensions now); CFS/CFE is computed differently but the memory infos are still the same. |
|
@ywelsch do you have an opinion? |
|
I think this is not breaking for the reasons that you've outlined, so 7.13 is ok |
|
@tlrx I trust your judgement, I only wanted to make sure this question had been considered given the change to how CFS files are treated. |
… Stats APIs (#71725) Since #16661 it is possible to know the total sizes for some Lucene segment files by using the Node Stats or Indices Stats API with the include_segment_file_sizes parameter, and the list of file extensions has been extended in #71416. This commit adds a bit more information about file sizes like the number of files (count), the min, max and average file sizes in bytes that share the same extension. Here is a sample: "cfs" : { "description" : "Compound Files", "size_in_bytes" : 2260, "min_size_in_bytes" : 2260, "max_size_in_bytes" : 2260, "average_size_in_bytes" : 2260, "count" : 1 } This commit also simplifies how compound file sizes were computed: before compound segment files were extracted and sizes aggregated with regular non-compound files sizes (which can be confusing and out of the scope of the original issue #6728), now CFS/CFE files appears as distinct files. These new information are provided to give a better view of the segment files and are useful in many cases, specially with frozen searchable snapshots whose segment stats can now be introspected thanks to the include_unloaded_segments parameter. Backport of #71643
Since #16661 it is possible to know the total sizes for some Lucene segment files by using the Node Stats or Indices Stats API with the
include_segment_file_sizesparameter, and the list of file extensions has been extended in #71416.This pull request adds a bit more information about file sizes like the number of files (
count), themin,maxand average file sizes in bytes that share the same extension. Here is a sample:This pull request also simplifies how compound file sizes were computed: before compound segment files were extracted and sizes aggregated with regular non-compound files sizes (which I find confusing and out of the scope of the original issue #6728), now CFS/CFE files appears as distinct files.
I think that these information give a better view of the segment files and are useful in many cases, specially with searchable snapshots whose segment stats can now be introspected thanks to the
include_unloaded_segmentsparameter.