-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Code coverage reports for base #8781
Conversation
IMO this would be better to do within @staticfloat's army of buildbots and posted to status.julialang.org and/or coveralls on a nightly or few-times-a-day basis, please no coveralls spam after every commit to every PR (I find it annoying already in the few packages that have it turned on, the results have value but I don't think they need to be tracked at that frequency). It is interesting to look at and work on improving coverage if it's remotely accurate (hence question 2, I think Tim knows the most about that). |
@tkelman: Very good points, I take it from your comments that the notifications can not be turned off (annoying...). The reason I took the time to look into this was pretty much along the lines of what you say, when looking to find holes in our current coverage it is really hard to do without some sort of tracking and visualisation of the current coverage. I would love to help setting this up as a cronjob or similar if I can be trusted with access to a machine. |
I have no idea there, I haven't mucked around with coveralls settings at all. The notifications are enabled in several packages I've submitted PR's to, but that may be because the package authors like the notifications better than I do, and that's up to them. Elliot has almost all of his code up at either https://github.com/staticfloat/julia-buildbot or other repositories. I'm not sure exactly which file corresponds to which builder, but if you can find where the tests get run it might be not that hard to fit in a set with coverage enabled too. |
FWIW, the big part of getting it to work for base was #7464, and in that issue you'll see more comments expressing concern about the amount of noise generated. Also, my view is currently that |
I think it'd be great to have Coveralls for Base, and I don't think its bad to have it on PRs necessarily (although sometimes it'd be best to turn it off). If someone adds new functionality and the coverage number goes down, that should be recognized as a bad thing. Conversely, I think @timholy is right that if there is a coverage number then people will submit little fixes, because they get feedback quickly and it "game-ifies" it (works for me, stackoverflow...). |
It's not so much a bug or oversight as a very tricky design question. I sketched what might possibly be a way forward in #7541, but I don't know enough about parsing/lowering for me to interested in figuring this out on my own. But I still don't think we want this for every PR. |
Well, I would not consider myself to be a legion, but I do want to plug some holes. I think I understand the problems outlined in #7541 and I will see what I can make out of the I am somewhat split whether pursuing 100% coverage is really all that relevant, but I strongly suspect that we have a bunch of modules that are largely untested. What I personally want to do is find them, read them (reading the Julia codebase is always a blast) and write some tests for them. |
See, you are a legion, all by yourself 😄. |
That's a fantastic idea, and it would be good if more people felt that urge when contributing. I can tell you for example the sparse linear algebra code is far less tested than the dense counterparts, but digging into coverage reports to find holes and devise new tests is not something I've found myself wanting to do. The status quo has been identifying bugs when they come up in real-world use cases and adding tests as part of fixing the bugs. Needs more sustained effort to do much differently.
New functionality is a good thing - if it's untested that's a problem, but doesn't completely invalidate the usefulness of the functionality. It's pretty easy when reviewing a feature addition to say, "needs tests." We don't need constant notifications and bot comments saying "+1%" or "-1%" to figure that out. Feature additions of this type also seem to be a fairly small portion of the pull requests that get opened. Most of what I tend to work on is build fixes or platform differences, for which a coverage test that only runs on one platform is pretty useless.
I like that idea, having something opt-in to only run when it makes sense would be good to start with, and we could start getting a sense of what portion of PR's we end up wanting to look at this information. |
You are right about the nature of PRs to base, they aren't often of the "needs test" type |
FWIW, code-coverage was extremely helpful to me in catching problems with the recent |
I will retract my noise concerns if we can get Coveralls reports integrated into the new official-from-Github multi-status indicators (https://github.com/blog/1935-see-results-from-all-pull-request-status-checks), instead of replying as comments. |
I think this is an excellent idea. However, I am currently pushing two papers for a deadline and probably won't be able to work on this before Christmas. |
I don't think this is something we can do on our side, I think Coveralls would have to implement it. I don't know whether Coveralls' service is as extensible by community contributors as Travis is. |
Is this closed by #9493 ? |
Wooah team: |
:D On Tue, Feb 3, 2015 at 8:22 PM, Iain Dunning notifications@github.com
|
I do think this is largely done, but I'm sure it could be set up to be a bit more convenient. Currently I have to manually push master to my fork of julia on a daily basis to keep Coveralls in sync with this repo. However, keep in mind that you basically can't use Travis to generate accurate coverage reports. You hit the 50-minute timeout due to these requirements:
So we can't do this as a byproduct of testing PRs, it has to be a separate pass. I think @staticfloat has been taking what I did in |
I had things working a little while ago, but with the 32-bit failures the
tarballs haven't been getting built. I'll try to make sure things are
working well by this weekend, as I love having coverage reports for Base!
-E
|
We have buildbots for this now, closing. We aren't getting coverage reports for PR's since nothing in #8781 (comment) has changed, but given a bit of webhook scripting we could possibly do so on-demand if there ends up being a need for that. |
The last part of #8781 (comment) is about to change once I can test this on linux I swear!!! |
A long time ago I noticed a comment by simonster about code coverage in base and finally took a stab at it. Little did I know that I ended up with almost exactly what timholy hacked together several moons ago.
I put the coverage as a separate entry in the build matrix on Travis and explicitly allow for it to fail in order not to slow down the Travis reports. As a consequence of this we will only run coverage on Linux, but I suspect that this does not matter.
However, there are several issues to consider before merging something like this (thus the WIP).
For 1., if it is not possible to disable them on the Coveralls dashboard I can tuck the coverage code and Coveralls submission as optional make targets that can be run upon request rather than on Travis. As for 2., I hope that someone else has more insights into exactly what the current shortcomings of the coverage code is.
Another note on the WIP status, I am yet to fire off the script onto Travis and may push to this branch shortly if I messed something up.