RFT (request for tips): tweaking .cov files, aka code_coverage output #7541

timholy · 2014-07-07T22:28:00Z

In these issues:
JuliaCI/Coverage.jl#4
JuliaCI/Coverage.jl#11
it was noticed that it's difficult to accurately calculate coverage, given the format of the .cov files. @IainNZ suggested that this is best fixed in julia, since its parsing is definitive.

I have a general idea of how to fix this, but I could use some help with the details:

when code is expanded, for each file, create a list storing

struct {
    int firstlinenumber;
    int lastlinenumber;
    int iscompiled;
}

for each method in the file. The initial setting for iscompiled is false. Methods are stored in the order of appearance in the file.
2. Have coverageVisitLine, which gets called when functions are compiled, set iscompiled to true
3. In jl_write_coverage_data, when you get to the first line of the next method in the list, check iscompiled. If false, prepend 0 instead of - for each line of the function. This will even penalize comments, end statements, etc, but I think it's better to err in this direction---it's extra incentive to make sure all functions are tested.

The part I'm stuck on is finding the best place for step 1. My favorite candidate is here, but I'm not at all sure about this.

The text was updated successfully, but these errors were encountered:

mbauman · 2014-07-09T02:22:18Z

Another big part of the overestimation comes from the often-used x==1 && do_something() idiom. Perhaps short-circuiting lines should only count if you get to the end of the circuit?

timholy · 2014-07-09T02:40:09Z

True. Though when it's i < n || throw(BoundsError()) I'm not sure there's any reason to be particularly interested in triggering the BoundsError.

hayd · 2014-07-09T02:44:28Z

Similarly for ternary operator x == 1 ? do_something : do_something_else.

...you may want to check that the BoundsError is correctly triggered.

StefanKarpinski · 2014-07-09T04:18:51Z

It seems to me that the most interesting coverage question is of all the branch points in your program, how many of the possible branches do you take? I'm not sure if this is equivalent to fraction of basic blocks covered. At that point, you can consider 2-grams of basic blocks, 3-grams of basic blocks, etc.

timholy · 2014-07-09T09:06:00Z

The branches question is indeed relevant, but we shouldn't let ourselves get distracted by it: browsing through the *.cov files, by far the bigger issue is the fact that uncompiled functions don't get counted against the coverage fraction. Untested lines should be shown in red, and "irrelevant" lines (comments, etc) are uncolored. The biggest problem is determining which lines are relevant. For example, here are two files that are (laughably) listed as having 100% coverage: https://coveralls.io/files/238719979, https://coveralls.io/files/238719982.

You can see this result again and again throughout the complete results.

simonster · 2014-07-09T14:35:47Z

How will we stop functions that are tested but always inlined from counting against the coverage percentage?

timholy · 2014-07-09T15:38:01Z

Disable inlining when performing --code-coverage? #7464 (comment)

JeffBezanson · 2014-07-29T03:47:21Z

Disabling inlining is really not realistic.

timholy · 2014-07-29T09:36:57Z

If one cares, then I suppose the alternative would be to have an analysis tool that flags all functions that will be inlined. But if this is the only application, that seems a little excessive---this is just supposed to be a guide for helping people write more comprehensive tests, and that is something that already requires a little intelligence. I care less about what the actual coverage number is.

mlhetland · 2014-08-08T14:17:01Z

If we do want to add branch coverage, Ned Batchelder's much-used coverage.py might be of interest. (I've had some issues with it complaining of uncovered branches that aren't really “there”—stuff that couldn't have been reached anyway. But that's why he has the # pragma comments.)

As for the inlining issue: It seems to me that that could be important. If one wants 100% coverage (which could be useful, if one wants to add a test for it, for example—or we want a command-line switch for ensuring 100% coverage, like dlang), the plethora of tiny functions that seem idiomatic to Julia would seriously mess with that, no? If disabling inlining is unrealistic—is it possible to record where a function is inlined, and work with that?

Or … if functions are going to be inlined, if we know that when we compile them (as in @timholy's suggestion, here), could we not just use that as a reason to not flag them as compiled? In a sense, they haven't been compiled as individual functions (from the perspective of coverage) anyway. Then we will "sort of" get 100% coverage if and only if we have executed all the code, including the inlined functions. (Not talking about branch coverage here.) The only issue is that we won't be able to count individual lines in the inlined functions, but that's perhaps less crucial, if they're going to be tiny anyway, in general?

timholy · 2014-08-08T14:30:55Z

Currently the - that gets written in front of the line means, as far as Coveralls.jl is concerned, "this line doesn't count." So a function that is always inlined won't prevent you from reaching something that Coveralls thinks is 100% coverage. The only lines that count against your total are those with a 0 out front. So currently I'd guess that the number reported for most packages are inflated, and the number reporting something > 0 is an underestimate.

Besides inlining, the other big ticket item is precompilation, which currently has the same effect. You can work around this by deleting sys.so before you run, but a better solution would be for --code-coverage=all and --track-allocation=all to temporarily disable it. A few minutes spent searching through the code did not reveal to me how that file gets loaded, so ATM I haven't figured out how one might go about doing that.

timholy · 2014-10-23T13:20:01Z

For the inlining problem, perhaps we could have inference.jl tag that struct I mentioned up top.

timholy · 2015-01-06T22:25:22Z

In combination with other work in base (--inline=no, fixing the detection of base vs user code) and JuliaCI/Coverage.jl#36, this seems to basically be done.

timholy mentioned this issue Jul 9, 2014

More instrumentation (memory allocation, also fixes #7259) #7464

Merged

timholy referenced this issue in JuliaAttic/Color.jl Aug 27, 2014

Add badges. Bring on the shame.

a973c36

timholy mentioned this issue Oct 23, 2014

WIP: Code coverage reports for base #8781

Closed

ghost mentioned this issue Dec 15, 2014

Add --noinline startup option #9354

Merged

timholy closed this as completed Jan 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFT (request for tips): tweaking .cov files, aka code_coverage output #7541

RFT (request for tips): tweaking .cov files, aka code_coverage output #7541

timholy commented Jul 7, 2014

mbauman commented Jul 9, 2014

timholy commented Jul 9, 2014

hayd commented Jul 9, 2014

StefanKarpinski commented Jul 9, 2014

timholy commented Jul 9, 2014

simonster commented Jul 9, 2014

timholy commented Jul 9, 2014

JeffBezanson commented Jul 29, 2014

timholy commented Jul 29, 2014

mlhetland commented Aug 8, 2014

timholy commented Aug 8, 2014

timholy commented Oct 23, 2014

timholy commented Jan 6, 2015

RFT (request for tips): tweaking .cov files, aka code_coverage output #7541

RFT (request for tips): tweaking .cov files, aka code_coverage output #7541

Comments

timholy commented Jul 7, 2014

mbauman commented Jul 9, 2014

timholy commented Jul 9, 2014

hayd commented Jul 9, 2014

StefanKarpinski commented Jul 9, 2014

timholy commented Jul 9, 2014

simonster commented Jul 9, 2014

timholy commented Jul 9, 2014

JeffBezanson commented Jul 29, 2014

timholy commented Jul 29, 2014

mlhetland commented Aug 8, 2014

timholy commented Aug 8, 2014

timholy commented Oct 23, 2014

timholy commented Jan 6, 2015