performGC before getting RTS/GC stats #177

avieth · 2018-01-09T23:52:51Z

performGC before getting RTS/GC stats

The deprecated getGCStats had some nice documentation: "If you would
like your statistics as recent as possible, first run a performGC"

This is sadly missing from getRTSStats but I believe it still holds.
When regressing allocated over iters, I'd see a believable slope, but an
unbelievable y intercept, something like -300000 when the slope is 150
or so. That's because the first hundred or so measurements were 0 for
bytes allocated, as the RTS didn't bother to run a GC before the RTS
stats were sampled.

Now we do 3 samples:

performGC before the first one, to ensure it's up-to-date.
Do the second one after the action, without a performGC, so we can
get legit readings on the GC-related stats.
performGC and then sample again, so we can get up-to-date readings
on other metrics.
Carefully choose whether to diff start stats against the end stats
per- or post-GC.

Also included is a fix to the ToJSON Measurements instance, which
duplicated the mutator cpu seconds where GC cpu seconds should go.

The deprecated getGCStats had some nice documentation: "If you would like your statistics as recent as possible, first run a performGC" This is sadly missing from getRTSStats but I believe it still holds. When regressing allocated over iters, I'd see a believable slope, but an unbelievable y intercept, something like -300000 when the slope is 150 or so. That's because the first hundred or so measurements were 0 for bytes allocated, as the RTS didn't bother to run a GC before the RTS stats were sampled. Now we do 3 samples: 1. performGC before the first one, to ensure it's up-to-date. 2. Do the second one after the action, without a performGC, so we can get legit readings on the GC-related stats. 3. performGC and then sample again, so we can get up-to-date readings on other metrics. 4. Carefully choose whether to diff start stats against the end stats per- or post-GC. Also included is a fix to the ToJSON Measurements instance, which duplicated the mutator cpu seconds where GC cpu seconds should go.

RyanGlScott

Thanks for noticing this, @avieth. It's unfortunate timing that this requires another breaking change after we just released criterion-1.3.0.0, but I suppose that's just how it is sometimes.

I've left some comments inline.

RyanGlScott · 2018-01-10T05:24:24Z

Criterion/Measurement.hs

 -- | Try to get GC statistics, bearing in mind that the GHC runtime
 -- will throw an exception if statistics collection was not enabled
 -- using \"@+RTS -T@\".
+-- If you need guaranteed up-to-date stats, call performGC first.


Put singlequotes around performGC (i.e., 'performGC') so that Haddock will link it.

RyanGlScott · 2018-01-10T05:26:09Z

Criterion/Measurement.hs

-                  -- ^ Statistics gathered at the __end__ of a run.
+                  -- ^ Statistics gathered at the __end__ of a run, post GC.
+                  -> Maybe GCStatistics
+                  -- ^ Statistics gathered at the __end__ of a run, pre GC.


The order here (first post-GC statistics, then pre-GC statistics) feels all wonky to me. Why not have the first argument be pre-GC statistics, then following by the post-GC statistics? We're going to need a breaking change anyway, so we might as well set things straight in the process.

My reasoning was that the "later" argument always came first, so I kept with the trend. It doesn't matter so much to me so I'll change it up if you insist.

Ah, that's a good point I hadn't considered. In that case, your order is fine, since we'd want to preserve the existing convention.

RyanGlScott · 2018-01-10T05:27:01Z

Criterion/Measurement.hs

+  , measMutatorCpuSeconds  = diff endPostGC start gcStatsMutatorCpuSeconds
+  , measGcWallSeconds      = diff endPreGC  start gcStatsGcWallSeconds
+  , measGcCpuSeconds       = diff endPreGC  start gcStatsGcCpuSeconds
+  } where diff a b f = f a - f b


This b argument will always be start, yes? Why not keep it inline in diff?

RyanGlScott · 2018-01-10T05:27:48Z

Criterion/Measurement.hs

+  , measMutatorWallSeconds = diff endPostGC start gcStatsMutatorWallSeconds
+  , measMutatorCpuSeconds  = diff endPostGC start gcStatsMutatorCpuSeconds
+  , measGcWallSeconds      = diff endPreGC  start gcStatsGcWallSeconds
+  , measGcCpuSeconds       = diff endPreGC  start gcStatsGcCpuSeconds


In the comments, can you include a justification for which things are diffed with endPreGC, and which things are diffed with endPostGC?

avieth · 2018-01-10T05:44:40Z

Thanks for noticing this, @avieth. It's unfortunate timing that this requires another breaking change after we just released criterion-1.3.0.0, but I suppose that's just how it is sometimes.

Then again, apparently I'm the only person who uses regressions on the GC stats :) else someone would probably have noticed this by now.

RyanGlScott · 2018-01-10T19:07:03Z

Thanks!

Mention #177

avieth added 2 commits January 9, 2018 18:52

fix shadowed binding warning

6f18c08

avieth force-pushed the avieth/fix_rts_stats branch from 52eaf27 to 6f18c08 Compare January 10, 2018 00:09

RyanGlScott requested changes Jan 10, 2018

View reviewed changes

avieth added 2 commits January 10, 2018 00:47

comment applyGCStatistics and simplify diff

277ea30

update comment for getGCStatistics

46fc8c8

RyanGlScott merged commit b93bc12 into haskell:master Jan 10, 2018

RyanGlScott added a commit that referenced this pull request Jan 10, 2018

Start changelog entry for next major version

9a2c3e1

Mention #177

patrickdoc mentioned this pull request Feb 10, 2018

GC drastically reducing benchmark performance #185

Closed

RyanGlScott mentioned this pull request Feb 25, 2018

Decrease impact of gc on benchmark performance (fixes #185) #187

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

performGC before getting RTS/GC stats #177

performGC before getting RTS/GC stats #177

Uh oh!

avieth commented Jan 9, 2018

Uh oh!

RyanGlScott left a comment

Uh oh!

RyanGlScott Jan 10, 2018

Uh oh!

RyanGlScott Jan 10, 2018

Uh oh!

avieth Jan 10, 2018

Uh oh!

RyanGlScott Jan 10, 2018

Uh oh!

RyanGlScott Jan 10, 2018

Uh oh!

avieth Jan 10, 2018

Uh oh!

RyanGlScott Jan 10, 2018

Uh oh!

avieth commented Jan 10, 2018

Uh oh!

RyanGlScott commented Jan 10, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

performGC before getting RTS/GC stats #177

performGC before getting RTS/GC stats #177

Uh oh!

Conversation

avieth commented Jan 9, 2018

Uh oh!

RyanGlScott left a comment

Choose a reason for hiding this comment

Uh oh!

RyanGlScott Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

RyanGlScott Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

avieth Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

RyanGlScott Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

RyanGlScott Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

avieth Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

RyanGlScott Jan 10, 2018

Choose a reason for hiding this comment

Uh oh!

avieth commented Jan 10, 2018

Uh oh!

RyanGlScott commented Jan 10, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants