-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extend SimpleMemoryCheck
service to report jemalloc and smaps information, and on early termination signal
#46859
Conversation
… file Also include information on huge pages
cms-bot internal usage |
@cmsbuild, please test |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-46859/42888 |
A new Pull Request was created by @makortel for master. It involves the following packages:
@Dr15Jones, @makortel, @smuzaffar can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
+1 Size: This PR adds an extra 32KB to repository Comparison SummarySummary:
|
test parameters:
|
@cmsbuild, please test |
@Dr15Jones please review |
+1 Size: This PR adds an extra 12KB to repository Comparison SummaryThere are some workflows for which there are errors in the baseline: Summary:
|
test parameters:
|
@cmsbuild, please test |
Pull request #46859 was updated. @Dr15Jones, @cmsbuild, @makortel, @smuzaffar can you please check and sign again. |
@cmsbuild, please test |
+1 Size: This PR adds an extra 28KB to repository
Comparison SummarySummary:
|
Here is the new endJob output from 13034.0 (2024 TTBar+PU) step 3 (RECO)
|
test parameters: |
@cmsbuild, please test Final round without the customization |
+1 Size: This PR adds an extra 12KB to repository
Comparison SummarySummary:
|
Comparison differences are related to #46416 |
+core |
This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @sextonkennedy, @mandrenguyen, @rappoccio, @antoniovilela (and backports should be raised in the release meeting by the corresponding L2) |
+1 |
PR description:
This PR extends the
SimpleMemoryCheck
reporting withSIGUSR2
signal)AnonHugePages
, and RSS and VSIZE of mmapped files from/proc/<pid>/smaps
(if can open the file).pcm
files (generated by ROOT), because I noticed the.pcm
contribution was quite large (hundreds of MB)I hope these numbers will help to figure out if the application itself is allocating a lot of memory, if the allocator is using a lot more memory than the application asks for, or if the operating system ends up using a lot of memory e.g. because of fragmentation of transparent huge pages (see #42387).
This PR was motivated by #46040, especially the behavior of steep rise of RSS.
Resolves cms-sw/framework-team#1082
PR validation:
Tested privately with an example job in #46040 that the jemalloc and smaps information gets added, also in the case the job is terminated early with
SIGUSR2
signal. Tested also theSimpleMemoryCheck
works when run throughcmsRunGlibC
andcmsRunTC
(with jemalloc information missing, of course).