-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to communicate CI failures? #2
Comments
I prefer to migrate the existing flaky test issues here, just so we can reduce the noise in the core issue tracker because the biggest problem with flakes is noise anyway. Also the automation stuff should probably reply to issues and send PRs here and it just makes more sense if those are all done in one repo. It also makes it easier to search for stuff. I think we should create subteams for nodejs/build. nodejs/build is not for fixing flaky tests, also we sometimes ping them for fixing build files but I don't think (everyone in) that team is for that either. Hence I suggested nodejs/build-infra and nodejs/build-files. The difference is: if someone from the build team would need SSH access or account access or sending PRs to the build repo to fix this, ping nodejs/build-infra, if someone would need to open a PR to the build files in core to fix this, ping nodejs/build-files And yes, I think notification should be a ping. If it's urgent, like for infra failures, we can hop on the #nodejs-build IRC as well. |
Maybe we can restart https://github.com/nodejs/testing / nodejs/testing? nodejs/build should only be for infrastructure (at the moment I don't see nodejs/build-infra being anything other than an alias for nodejs/build). I don't think nodejs/build-files belongs under the Build WG. |
Ping @nodejs/testing |
I had been working on a separate UI for reporting Jenkins CI statuses at http://node-builder.herokuapp.com/, with the idea that at some point we could point out what failures were flaky tests/infrastructure issues/actual failing tests. IMHO, the current Jenkins UI makes it tough to easily and quickly identify sources of failures (right now you have to cross check nodejs/node, nodejs/build, and possibly other repos too). This might be something to look into with the commit queue work.
This would be great, to help reduce pings of |
Just my 2 cents, I'd be a bit worried about removing the "noise" from the main repo. I'm thinking we want more focus/help on getting these kinds of issues. Making them less visible/troublesome may result in less focus/outside help other than those already interested in the problem. |
I don't think there will be a difference because those who are not interested in the problem are already ignoring the noise generated by flaky test issues. But IMO having a single place to easily view the state of flakiness + infra issues is more important than reducing noise. |
@richardlau I agree, I've always been feeling weird when we ping the build WG for changes to Makefile, etc. If people are OK with it, I can create |
By the way there is a new automation tool for walking through the CI and pattern-match all the failure reasons (flakes, build file failures, infra failures) to print to the console. I think it's similar to http://node-builder.herokuapp.com/ , although it's a CLI. The demo is here: https://asciinema.org/a/184727 To make it automatically report flakes or display a failure as potential flakes to users, the easiest solution is to have a flaky test database in place. The tool can be smarter and compare file changes against failures or look into previous failures to identify flakes itself, but in the end we still need human to triage and confirm. |
IMO best way to track infra failures and incidents is in the build repo. It's easy to check with "the action items board" and open a new issue if needed. |
How should collaborators report unrelated CI failures?
For the instructions, "notifying the team" means a @ ping?
The text was updated successfully, but these errors were encountered: