CBMC/JBMC --cover: only store traces with --trace to avoid memory exhaustion #5714

tautschnig · 2021-01-02T23:43:55Z

Not everyone needs test inputs, especially when programs lack INPUT
instructions anyway. This helped me avoid running out of memory on a
768GB system for a particular verification problem. The same problem can
now be run with just 75GB of memory.

Each commit message has a non-empty body, explaining why the change was made.
n/a Methods or procedures I have added are documented, following the guidelines provided in CODING_STANDARD.md.
The feature or user visible behaviour I have added or modified has been documented in the User Guide in doc/cprover-manual/
Regression or unit tests are included, or existing tests cover the modified code (in this case I have detailed which ones those are in the commit message).
n/a My commit message includes data points confirming performance improvements (if claimed).
My PR is restricted to a single feature or bugfix.
n/a White-space or formatting changes outside the feature-related changed lines are in commits of their own.

codecov · 2021-01-03T00:22:34Z

Codecov Report

Merging #5714 (87e8440) into develop (d613b01) will decrease coverage by 0.01%.
The diff coverage is 100.00%.

@@             Coverage Diff             @@
##           develop    #5714      +/-   ##
===========================================
- Coverage    69.67%   69.66%   -0.02%     
===========================================
  Files         1248     1248              
  Lines       100836   100854      +18     
===========================================
  Hits         70262    70262              
- Misses       30574    30592      +18

Flag	Coverage Δ
cproversmt2	`43.37% <70.83%> (-0.06%)`	⬇️
regression	`66.64% <100.00%> (-0.02%)`	⬇️
unit	`32.20% <0.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/goto-checker/goto_trace_storage.h	`100.00% <ø> (ø)`
src/goto-instrument/cover.h	`100.00% <ø> (ø)`
src/goto-programs/goto_trace.h	`100.00% <ø> (ø)`
src/cbmc/cbmc_parse_options.cpp	`77.67% <100.00%> (+0.04%)`	⬆️
...-checker/cover_goals_verifier_with_trace_storage.h	`93.33% <100.00%> (+1.66%)`	⬆️
src/goto-checker/goto_trace_storage.cpp	`100.00% <100.00%> (ø)`
src/goto-instrument/cover.cpp	`85.41% <100.00%> (+0.10%)`	⬆️
src/goto-programs/goto_trace.cpp	`81.40% <100.00%> (+0.53%)`	⬆️
src/cbmc/c_test_input_generator.cpp	`60.00% <0.00%> (-30.00%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d613b01...87e8440. Read the comment docs.

peterschrammel

Good workaround in the meanwhile. Storing traces and inputs should be decoupled, though, I think.
Besides, I should probably revive the PR that enables trace output as traces are produced rather than storing them at all (but that's quite a UI change which needs to be phased in properly).

hannes-steffenhagen-diffblue · 2021-01-07T12:42:26Z

regression/cbmc-cover/branch-loop1/test.desc

@@ -1,6 +1,6 @@
 CORE
 main.c
--xml-ui --cover branch
+--xml-ui --cover branch --trace


Seems like this is broken somehow?

Yes, it was broken as the code generating XML output is used for both trace and test-suite output. I have therefore changed the patch to introduce a new command-line option "show-test-suite."

martin-cs

Given the performance numbers this sounds very much like the kind of thing we want and I am in favour of not storing things we don't need. But I am a little confused over the use-case for this. --cover creates a bunch of assertions for the relevant coverage metric and then solves for all of them. Then, orthogonally(? is this what @peterschrammel is suggesting?) we could output for each location:

A. reached / unreachable.
B. inputs that would cause the relevant path to be taken.
C. the execution trace that reaches each one.

So this is only fro case C? Are there users of this? Also, can't we output these as we compute them? I'm not sure I see why they need to be stored at all.

goto traces consume a lot of memory, and re-allocating them for each incremental addition of a goto trace does not seem to be a good approach. This is at the expense of more costly indexed access, which now requires iterator increments.

tautschnig · 2021-01-20T13:41:09Z

@peterschrammel @martin-cs I'd appreciate you taking another look at this for I've made substantial changes over the version you previously reviewed/approved:

I've introduced merge_irept to effectively compress the amount of memory consumed by a trace. This alone would have solved my out-of-memory problem as I've tested, but it seems the computational cost of building all the traces is still substantial (and often unnecessary).
Instead of using (abusing?) the "trace" command-line option, I've instead added a new option "show-test-suite."

It may still be desirable to print test inputs as-you-go, but that's not something on my priority queue (as I don't care about those inputs values at the moment).

A goto trace includes freshly constructed expressions, which thus lack sharing. This results in excessive memory use when storing multiple traces, as is done for coverage computation. On a particular benchmark, coverage computation previously ran out of memory at 768 GB. This same benchmark can now be run with just 80 GB of memory.

Not everyone needs test inputs, especially when programs lack INPUT instructions anyway. Test inputs are found via goto traces, and computing those is expensive in both time and memory required. On one benchmark, not building goto traces reduced the overall time from 36000 seconds to 2800 seconds (a saving of 92%).

martin-cs · 2021-01-22T16:24:22Z

@tautschnig Apologies for the lag; first week of term. I'm guessing by the merge that you are happy with this.

tautschnig requested review from chrisr-diffblue, hannes-steffenhagen-diffblue, kroening, peterschrammel and smowton as code owners January 2, 2021 23:43

tautschnig self-assigned this Jan 3, 2021

peterschrammel approved these changes Jan 4, 2021

View reviewed changes

hannes-steffenhagen-diffblue reviewed Jan 7, 2021

View reviewed changes

martin-cs reviewed Jan 11, 2021

View reviewed changes

tautschnig force-pushed the cover-trace branch from 65d381f to 0fe9312 Compare January 20, 2021 13:34

tautschnig force-pushed the cover-trace branch from 0fe9312 to a18c47d Compare January 20, 2021 13:42

tautschnig assigned peterschrammel and martin-cs and unassigned tautschnig Jan 20, 2021

tautschnig force-pushed the cover-trace branch from a18c47d to b3025b1 Compare January 20, 2021 15:25

tautschnig force-pushed the cover-trace branch from b3025b1 to 87e8440 Compare January 20, 2021 20:18

tautschnig merged commit 4820ec3 into diffblue:develop Jan 21, 2021

tautschnig deleted the cover-trace branch January 21, 2021 22:59

tautschnig mentioned this pull request Mar 25, 2024

introduce 'fatal assertions' #8226

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CBMC/JBMC --cover: only store traces with --trace to avoid memory exhaustion #5714

CBMC/JBMC --cover: only store traces with --trace to avoid memory exhaustion #5714

Uh oh!

tautschnig commented Jan 2, 2021

Uh oh!

codecov bot commented Jan 3, 2021 •

edited

Loading

Uh oh!

peterschrammel left a comment

Uh oh!

hannes-steffenhagen-diffblue Jan 7, 2021

Uh oh!

tautschnig Jan 20, 2021

Uh oh!

martin-cs left a comment

Uh oh!

tautschnig commented Jan 20, 2021

Uh oh!

martin-cs commented Jan 22, 2021

Uh oh!

Uh oh!

CBMC/JBMC --cover: only store traces with --trace to avoid memory exhaustion #5714

CBMC/JBMC --cover: only store traces with --trace to avoid memory exhaustion #5714

Uh oh!

Conversation

tautschnig commented Jan 2, 2021

Uh oh!

codecov bot commented Jan 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

peterschrammel left a comment

Choose a reason for hiding this comment

Uh oh!

hannes-steffenhagen-diffblue Jan 7, 2021

Choose a reason for hiding this comment

Uh oh!

tautschnig Jan 20, 2021

Choose a reason for hiding this comment

Uh oh!

martin-cs left a comment

Choose a reason for hiding this comment

Uh oh!

tautschnig commented Jan 20, 2021

Uh oh!

martin-cs commented Jan 22, 2021

Uh oh!

Uh oh!

codecov bot commented Jan 3, 2021 •

edited

Loading