Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project Velocity and PR reviews #64

Closed
benjamingr opened this issue Apr 19, 2018 · 53 comments
Closed

Project Velocity and PR reviews #64

benjamingr opened this issue Apr 19, 2018 · 53 comments

Comments

@benjamingr
Copy link

Following an internal discussion about concerns of high project velocity and recent PRs landing and then being reverted, I'm copying my response publicly here (below).

I'd like to discuss what we can do to improve the PR process this during the summit if anyone is interested.


Thinking a little more about this - I definitely think it's a matter of not getting enough eyes to review the code rather than bigger changes.

I think this is also very discouraging to a collaborator working on a PR because:

  • Having a change you worked on (and landed) reverted because someone who hasn't looked at it now looked at it is discouraging. To be clear - reverting is fine, making mistakes is fine and we should encourage experimentation - but the fact something gets approved and then pulled is frustrating.
  • Having a change you worked on mostly ignored as a PR is (probably?) our San Francisco - August 6/7 2015 #1 driver of collaborator burnout. It's very frustrating and I recall collaborators that have left the org because of it.

I think that in order to improve the # of eyes on PRs we have two strategies:

  • Make reviewing more rewarding somehow (list reviewers after the author in the changelog, mention noteworthy reviewers etc).
  • Ping nodejs/collaborators much less and smaller teams more - so that people don't get to that vicious "I have so many node notifications in my GitHub and I'm super busy and close to burnout - let's mark all as read" which leads to stale objections and reviews.

I think we should work towards a more organized review system where:

  • Relevant people who own various parts of the code form teams (like an http team etc)
  • Teams report to the TSC (or write a report) once a month about the subsystem, teams would be self organizing and hopefully meet once a month.
  • Teams get pinged on PRs to subsystems (mostly the case) and we stop pinging nodejs/collaborators on many things.

Optimally, a collaborator joining would automatically join the teams related to the PRs said collaborator worked on when joining - we should also likely ping existing collaborators.

Would it make sense? It's really just broad ideas.

@Tiriel
Copy link

Tiriel commented Apr 19, 2018

Lots to discuss there, and I'm definitely not the best to discuss most of it, but I'm deeply interested in hearing/reading discussions about this. Burnout is a very real problem, and I feel it's even worse in OSS since we're supposed to do this because we love it.

TL;DR: huge +1 on discussing this at Summit. I'll leave the rest to competent people.

@benjamingr
Copy link
Author

So, here is actionable stuff that might encourage reviews (some from the post above):

  • List reviewers in changelog.
  • Celebrate reviewing as a form of contribution by mentioning it explicitly as a form of contribution in contributing.md - our current guide is very nice (props to the authors) - we should emphasize it as a welcome form of contribution.
  • Make case studies of burnout related to lack of attention or stagnation of pull requests (I can help with that and name a few).
  • Discourage pings to nodejs/collaborators in collaborator guide and encourage unsubscribing from relevant issues.
  • Encourage people to join subsystem teams during the on-boarding process and establish a process for the teams meeting and filing monthly/bi-monthly reports to the TSC about subsystem status and pending issues.
  • Create a more digested form of ongoing efforts so that people know better where their help is needed. Some example ongoing projects that could be 'digested' so interesting parties can help review: modules, http2, fs/promises and promise APIs.

@mafintosh
Copy link
Contributor

Great idea to bring this up at the summit 👍

@Tiriel
Copy link

Tiriel commented Apr 19, 2018

I'll throw in my two cents again since some of this is pretty much like things currently in discussions in the CommComm restructuring effort.

Encourage people to join subsystem teams during the on-boarding process and establish a process for the teams meeting and filing monthly/bi-monthly reports to the TSC about subsystem status and pending issues.

This is precisely the point, at least in the CommComm. We've noticed people tend to think you have to join the CommComm/contribute in the CommComm repo to be a "CommComm contributor". I think there may be a bit of the same thing at work here, and that may prove to be an org-wide problem. We may have to improve the recognition of work done in the subteams as "real contribution".

Anyway, real discussion material there 😅

@mhdawson
Copy link
Member

+1, several good ideas suggested already and definitely a good topic for the summit.

@benjamingr
Copy link
Author

benjamingr commented Apr 21, 2018

Another thing we can suggest is letting teams decide on a tracking policy.

That way teams can take longer than 72h (over weekends) or less (fast-stack) to land PRs.

Some use cases:

  • A (the) documentation team can decide to allow fast-tracking documentation changes that fix typos
  • The modules team can decide to postpone landing PRs until a modules team meeting.
  • The test team can decide to fast-track changes as part of a particular effort (changing assert to assert.strictEquals for example.

All such request would be part of the monthly (by monthly) interaction of the team with the TSC who'd approve/vote on them.

@ryzokuken
Copy link
Contributor

A hard +1 for stronger, more independent subsystem teams. We should definitely encourage people to join any subsystem teams they're interested in working with, but IMHO we'd need to properly incentivize in order to make it more successful.

This could be easily done by giving the subsystem teams perhaps a little more autonomy (they're still very much answerable to the TSC, more on below).

This could be done by allowing the subsystem teams to have atleast a bi-weekly meeting where they triage relevant issues and discuss PRs. Only members of the concerned subsystem team would land a given PR, making sure it aligns well with the current position of the team. Each team would send a single member to represent them in TSC meetings (we could rotate people or elect them, may vary from team to team) who would report the TSC on the current status of the subsystem and take directions in case they need some.

How does that sound?

@mcollina
Copy link
Member

Considering that this would be a short presentation and a workshop at the summit, it would be fantastic if in the presentation you sum up the current pain points and the goals for the workshop session. I think it would be very nice if at the end of it we can come up with some process recommendation for the TSC to adopt and iterate on.

@apapirovski
Copy link

This could be done by allowing the subsystem teams to have atleast a bi-weekly meeting where they triage relevant issues and discuss PRs. Only members of the concerned subsystem team would land a given PR, making sure it aligns well with the current position of the team. Each team would send a single member to represent them in TSC meetings (we could rotate people or elect them, may vary from team to team) who would report the TSC on the current status of the subsystem and take directions in case they need some.

I'm strongly -1 on this approach. It will just add an extra thing to keep track of on top of the many PRs, and create an in-group of those who can afford to attend 3+ meetings every two weeks — adding to many collaborators already busy schedules. As someone that contributes to 3 teams regularly and another 2-3 fairly often, I would just have to quit any paid work and do open source full time.

@Trott
Copy link
Contributor

Trott commented Apr 22, 2018

I agree with @apapirovski above. More generally, when trying to address any issues (not just this one) at the summit, it would be best to avoid if at all possible:

  • more procedures (we already have more than anyone bothers to keep track of and many already get ignored)
  • more meetings (anything that can be done asynchronously should be done asynchronously--yes, I acknowledge the irony of saying this in an issue tracker devoted to a synchronous meeting of worldwide participants--the twice-a-year face-to-face meeting is important but we should still try to do as much as possible outside of such events)
  • more in-groups (which is not to say more groups is a problem, and differentiating in-groups from groups that aren't in-groups can be tricky, so this is unfortunately subjective)

Now that I've made my "don't" list, here's my "do" list of things to try to do whenever feasible:

  • streamlining, clarifying, or eliminating procedures
  • leaning more heavily on widely-adopted tools for collaboration like GitHub and Google Docs and less heavily on tools specific to our project that only a few people understand
  • decreasing reliance on small groups of people--the fact that Important Person X didn't see a pull request shouldn't mean that it can't land or that it is at high risk of getting reverted if it does land
  • automation

These are guidelines and not hard rules. (For example, I'm much more open to new procedures if we are emulating another project that has already solved a problem we are trying to solve.)

@Trott
Copy link
Contributor

Trott commented Apr 22, 2018

Possible things that might help with this specific issue:

  • Consider trying to solve the "getting from first-contribution to Collaborator" issue because our conversion/retention there is not great. (It's getting better, I believe. But there's room for improvement.)

  • Figure out how to deploy the people we already have more effectively. Like, there are clearly some knowledgable folks with limited time and maybe we can talk directly to them about what we can do to make things more predictable for them. Some ideas that probably won't work but just to start: Focused groups can maybe try to have a rotation for paying attention to pull requests and the issue tracker. (Like I said, these ideas probably won't fly.) Or maybe people are available but get frustrated with something specific and we can at least identify what it is and what could change to increase their participation.

  • Perhaps someone can brainstorm an automated tool that would help with notifications? Or maybe one already exists?

  • We could work with GitHub. I know several of us are in the early-adopter program thing they have set up to talk about features. For this, the one thing I wish GitHub would let me do is filter mention-based notifications so that I can have direct mentions and one or two teams I'm on show up in a high-priority list, but have all other team mentions show up in a different list. (I need to be on some teams for privileges, but I don't need the notifications as much as I do for other teams.) Now, this may be something I want and no one else would find useful, but the point is that we can actually ask for these kinds of things, and we might even get them.

  • Maybe there's some way to mark PRs as mostly-cosmetic (doc formatting, test refactoring)?

  • Or maybe we need a more clearly-defined RFC type process for new features? (I know, I said avoid new procedures. But in this case, if there is a community that has done this well that we want to emulate, it is worth learning from them. I'm looking at you, Rust people.)

@richardlau
Copy link

richardlau commented Apr 22, 2018

We could work with GitHub.

I wish there was a more formal way to conduct and record the result of polls. The current ways seems to either be reactions (from which you can't tell if someone changes their vote after a decision has been reached) or tallying comments (which adds lots of noise to the PR/issue). I'm thinking here about approvals for fast tracking, but this could equally be useful for TSC votes.

Maybe there's some way to mark PRs as mostly-cosmetic (doc formatting, test refactoring)?

Labels?

@benjamingr
Copy link
Author

I also prefer not adding something like a biweekly meeting as it would require a lot more engagement. I was thinking giving the # of meetings to each with teams presenting to the TSC once every two months.

I agree about automation @Trott, the idea of making teams more autonomous is to do less nodejs/collaborators pings - in general I think we should aim to greatly reduce the amount of churn and things someone has to read in order to contribute. That was the point of #64 (comment) at least.

Perhaps someone can brainstorm an automated tool that would help with notifications? Or maybe one already exists?

I don't think we're there yet - I think we should discuss and decide how many notifications people would expect. Stuff that is not urgent can be digested and pulled instead of pushed - stuff that is (like stuff stalling PRs) should be pushed and there should be an obvious way to tell the difference.

decreasing reliance on small groups of people--the fact that Important Person X didn't see a pull request shouldn't mean that it can't land or that it is at high risk of getting reverted if it does land

👍

We could work with GitHub. ...

That would help - but I think this is more of a cultural decision of when to ping who.

@vsemozhetbyt
Copy link

Can we make github-bot visit stalled PRs and cc focused teams according to labels?

@benjamingr
Copy link
Author

@vsemozhetbyt that would trigger more pings overall - I think there we should be aiming for more focused pings rather than more pings.

@joyeecheung
Copy link
Collaborator

@vsemozhetbyt The CODEOWNERS file can do that https://help.github.com/articles/about-codeowners/ but I am not sure pinging teams helps the issue or simply makes it worse

@richardlau
Copy link

The CODEOWNERS file can do that https://help.github.com/articles/about-codeowners/ but I am not sure pinging teams helps the issue or simply makes it worse

Interesting, it looks like we could use this to enforce things like nodejs/node#6178? I wouldn't want to do it across all of core, but it might make sense for things such as streams.

@trivikr
Copy link
Contributor

trivikr commented Apr 23, 2018

Can we make github-bot visit stalled PRs and cc focused teams according to labels?

This is a very good idea @vsemozhetbyt
@benjamingr If more pings are an issue, we can set up frequency of the github-bot visits accordingly. For example

  • the bot can run once a week - may be couple of hours before collaborators are most active during the week.
  • the bot can ping on PRs which have no active for last n days - where n can be 4 or 7 or 10
  • there's an option to unsubscribe bot updates by tagging the bot and saying "unsubscribe" or similar word

What do you think?
These suggestions are only for PRs inactive for long. You can propose different frequencies, if you feel that's good.

@Trott
Copy link
Contributor

Trott commented Apr 23, 2018

Here's the problem that I think needs to be addressed:

There are people in leadership positions in the project who don't have sufficient time to review all the things that they have a stake in and need to care about.

AFAICT, the problem is not "people aren't getting pinged" or "people aren't getting pinged enough" or "we aren't moving fast enough". On the contrary, people seem to be ignoring notifications because they're already overwhelming.

Is the fact that some PRs stall a problem? Maybe. Do some PRs take too long to get adequate review? Sometimes. But those are (IMO) mere nuisances compared to the after-the-fact "What? When did this happen? This needs to be reverted!" from Someone Everyone Else Listens To.

I think the problems here are very difficult:

  • How can we make it so people can be more available to the project?
  • How can we make it so people can get more done with the project in less time?
  • Specifically, how can we make reviewing PRs more efficient? (Smaller and fewer PRs, but that would negatively impact velocity and other things. But maybe there are other possibilities here?)
  • How can we make it so people who feel broad ownership over parts of the project feel comfortable with a narrower focus and/or trusting other people to look out for the same things they would? (Or does that last one even matter?)
  • How can we make it easier for people to temporarily or permanently remove themselves from positions of responsibility without it being A Big Deal when it happens? (I think the Emeritus stuff has been a good step for this.) What if we make it too easy and we end up with a brain-drain for the project?
  • Is there a point where velocity is too fast and we should slow down? If so, how do we know when we've reached that point and what do we do about it? Are we possibly at that point already?

@Trott
Copy link
Contributor

Trott commented Apr 23, 2018

I don't feel 100% on these suggestions but I suspect they're worth at least considering:

  • Change the 48/72 hour rule to a flat 72 hour rule. (Small doc and test changes can still get fast-tracked, of course.)

  • Implement the CODEOWNERS thing for parts of the code, if feasible. So streams changes would require at least one member of the nodejs/streams team to sign off. Maybe do the same for timers and process. It's possible for PRs to get stalled but they can always be escalated to the TSC if a GitHub team stalls out on one of them.

@benjamingr
Copy link
Author

I'd like just to say that I completely agree with @Trott :

AFAICT, the problem is not "people aren't getting pinged" or "people aren't getting pinged enough" or "we aren't moving fast enough". On the contrary, people seem to be ignoring notifications because they're already overwhelming.

The reason I suggested team pings is in order to greatly reduce overall pings and removing most of nodejs/collaborators pings.

I think that if we make interaction less overwhelming people would naturally be more available and willing to engage in the project.

@trivikr
Copy link
Contributor

trivikr commented Apr 23, 2018

Some of the answers which I can think of:

How can we make it so people can be more available to the project?

Encourage companies who use Node.js to employ some developers to work full-time or part-time on Node core. We should brainstorm on how to incentivize these companies to contribute back to Open Source.

How can we make it so people can get more done with the project in less time?

  • While writing the code - Form teams for making Node.js core contributions easy with their favorite editors. For example, I use this launch config to run single tests in VSCode and it is very helpful. I would love to join a team who can make Node.js code development in VSCode simpler, and would love if someone else does it for their favorite editor - like Atom/SublimeText etc.
  • After the code is written - Can we reduce build times some way?
  • When PR is posted - one-click option to run CI with default rebase with master? one-click WebUI to rebase from upstream master for PRs?
  • Reviewing the code - check with Github if we can contribute to their Code Review system to make it better?
  • Landing the commit - node-core-utils has done a great job!

Specifically, how can we make reviewing PRs more efficient? (Smaller and fewer PRs, but that would negatively impact velocity and other things. But maybe there are other possibilities here?)

May be we should revisit why there's so much code in a single repository, and split it to multiple ones? I just became a collaborator last month, so don't know the history on why these decisions were taken.

@devsnek
Copy link

devsnek commented Apr 23, 2018

On the subject of incentives, the majority of the node collabs are not paid to be here and as much as we all enjoy programming i think being a node collab sometimes goes beyond enjoyment and into "work." maybe we can look into getting some small incentives for collabs such as time on hosting services for x prs reviewed or something like that.

Going back to the "work" aspect, i think we should also think about ways to make contributing to node more enjoyable, maybe even fun. I'm here just 'cause i enjoy working on the code but getting a change into node is not what i would describe as an enjoyable process. Over time node seems to becoming more and more fixed and official and business-y which is fine 'cause we're not exactly a small project but i think we need to find ways to help reduce the stress of the process.

@benjamingr
Copy link
Author

benjamingr commented Apr 23, 2018

maybe we can look into getting some small incentives for collabs such as time on hosting services for x prs reviewed or something like that.

Honestly anything financial would not incentivize me (it would mostly just take the fun out of it). The only financial instrument that actually helps (IMO) is the travel fund which I personally am going to use this May for the first time in order to attend the summit.

but i think we need to find ways to help reduce the stress of the process.

I think that a significant amount of discussion in the CC is geared towards that. It's a broader overall issue.

@apapirovski
Copy link

Is there a point where velocity is too fast and we should slow down? If so, how do we know when we've reached that point and what do we do about it? Are we possibly at that point already?

This is only somewhat related but I think it's worth considering having formal stretches of time where we dedicate all collaborator time to fixing issues from the tracker rather than adding new features / refactoring. Perhaps these could take a form of an online mini-summit where people can also pair up based on their interests and areas of expertise.

Debugging and/or fixing some of these edge case issues isn't super fun (especially when it's some obscure single platform specific issue) but I think having collaborators working together to do so might be quite engaging and enable people to learn something new.

We're at 593 issues right now and I don't think there are that many left that can easily be closed down without providing an actual fix. That's pretty intimidating and overwhelming. I don't think we'll get to 0 but I think everyone would feel like they have a slightly better handle on the project if we got it down to 200-300.

@Trott
Copy link
Contributor

Trott commented Apr 23, 2018

Encourage companies who use Node.js to employ some developers to work full-time or part-time on Node core. We should brainstorm on how to incentivize these companies to contribute back to Open Source.

This is an important thing to talk about from time to time in general. For this specific issue, here's the thing: The people we're talking about already get paid to work on Node.js.

Aside: The "get more companies to pay people to work on the project" thing is more complicated than it may appear at first, even putting aside the practical challenges of persuading companies to donate significant resources for the common good. I don't want a project where the only way you can contribute more than marginally is with the support of a business entity. On the other hand, I also don't want a project where big companies benefit from free volunteer labor without giving back. How do you improve the balance from where it is now? (It's not an easy question. Our current coexistence of volunteers, partially-paid-by-their-employers folks, a small number of fully-paid-by-their-employers folks, and corporate foundation membership providing financial resources...it's a good setup compared to some of the easily-imaginable alternatives.)

While writing the code - Form teams for making Node.js core contributions easy with their favorite editors. For example, I use this launch config to run single tests in VSCode and it is very helpful. I would love to join a team who can make Node.js code development in VSCode simpler, and would love if someone else does it for their favorite editor - like Atom/SublimeText etc.

Just in case this isn't obvious: There is nothing stopping you from forming a group like that. If it's something you'd love, consider trying to make it happen. Groups often form ad hoc to do work and only get officially chartered at a later date, or don't even bother getting officially chartered at all. If you would love a team like that, you can make it happen as long as you're willing to do the work to get the team up and running and keep it running. (That can be the hard part!) If you're super-motivated to make it happen, maybe open a discussion board topic to find like-minded folks and/or invite like-minded folks if you already know some.

@benjamingr
Copy link
Author

I don't want a project where the only way you can contribute more than marginally is with the support of a business entity.

Precisely, I can name a project or two that realistically work that way (think browsers) and I don't think that we want to follow that path.

or don't even bother getting officially chartered at all.

I'd like to emphasize this point - there was lot of interesting discussion in teams that never made it to working groups. Chartering is only important if you need to make decisions autonomously - it's typically better to go through the TSC and get feedback.

@benjamingr
Copy link
Author

Ping @nodejs/collaborators - I'd like to work on this this weekend and I would like to solicit opinions.

I'm planning for this to be mostly a discussion where we hear out the thoughts collaborators have on this issue and discuss what to optimize for. I am not aiming to present solutions yet.

My hope is to work on something to present to the TSC and CommComm after the summit that would be actionable.

@benjamingr
Copy link
Author

Thanks a lot for that! We'll discuss this in the summit.

Nitpick comments are demotivating

I'm definitely in the camp of "these help me improve" and I enjoy reading nitpicks, but maybe that's just me.

@mhdawson
Copy link
Member

I definitely think using part of the time at the summit to discuss what we do when the CI is unstable is important. I think we need some criteria (not sure exactly what) that when hit (or maybe a process through which collaborators can request it) we say "stop" and focus on getting the CI back to green.

@mmarchini
Copy link

We might consider looking how other OSS projects handle this.

@refack
Copy link

refack commented May 29, 2018

regarding CI and test fail traceability - most subjobs have been moved to a better Jenkins test parsing plugin, so the need to check console output should be much reduced.
image
Gives you a direct link to failure output

@mmarchini
Copy link

@refack it's definitely a nice start! What do we need to do to have this on all node-test-commit subjobs (I can help with that)? Also, what happens if there's a build errors or infra issue?

BTW, @joyeecheung has a PR to node-core-utils to make traceability easier too: nodejs/node-core-utils#161

@mmarchini
Copy link

mmarchini commented May 30, 2018

I think we need some criteria (not sure exactly what) that when hit (or maybe a process through which collaborators can request it) we say "stop" and focus on getting the CI back to green

We could have two criterias ¹ ²

  • If the CI is red, don't land a PR until it's green or yellow
    • If we identify a flaky test during an unrelated PR, either add a flaky status to it or fix it (if it's an easy fix)
    • If it's an infra issue, wait until it's fixed.
  • When we reach a certain flakiness treshold (let's say, 20 known flaky tests), block everything from landing until we get back to < 20.

Again, we're talking about the project "moving too fast", there's no problem in slowing it down in exchange for a more stable CI.

It would be nice if marking a test as flaky was smoother (today we need to open a PR, get approvals + usually fast-track approvals, then go through the landing process).

¹ Both criterias should be enforced by a tool (for example, GitHub interface). We shouldn't have to rely on a policy to enforce them, because it won't work (waiting for 2 approvals to fast-track a PRs sometimes is ignores, for example)
² We might have to consider a lighter policy for security patches.

@MylesBorins
Copy link
Contributor

MylesBorins commented May 30, 2018 via email

@tniessen
Copy link

People shouldn't land PRs. To enforce the last item, CI runs and PR landing should be done by a bot. Collaborators could either add a label requesting a CI run or add a comment mentioning the bot, and once there's enough approvals and the minimum time has passed, the bot could try to land the PR.

I believe we briefly discussed this at the last summit in October, a commit queue sounds good in theory and thanks to @joyeecheung and many other great people, we now have git node land which is a great first step towards automation. There is still a ton of work to do in this area, though.

@joyeecheung
Copy link
Collaborator

Something we should discuss during the collaboration summi

@joyeecheung
Copy link
Collaborator

Another idea (I am not sure how we will make an agenda for this session so I will just log it here in case I forgot): we should make a repo for the flaky tests - right now the related issues are scattered in build and the core. We need some effort focusing on this, some records tracking the flakiness maybe, that should happen in a dedicated space to reduce the noise.

@mmarchini
Copy link

@joyeecheung I agree, although we might need to think how to handle tests we think are flaky but are actually highlighting a bug/design flaw in our code (see nodejs/node#21012 (comment) for example). Maybe if we identify there's an actual bug in place, we should move the issue from the flaky-tracking repo to nodejs/node?

@benjamingr
Copy link
Author

@joyeecheung leaving a comment here works, I want to go through the suggestions in this issue and discuss them during the session.

@joyeecheung
Copy link
Collaborator

@mmarchini SGTM. The repo is there to reduce the noise and it would be easier for people to quickly understand what's going on in the CI in case they have not checked it out for a while.

@richardlau
Copy link

Do we really need a separate repo? Why not labels and/or a project?

@joyeecheung
Copy link
Collaborator

joyeecheung commented May 31, 2018

@richardlau Because being scattered around in the core repo makes them harder to find (the flaky label is only applied to flaky tests, not build flakes). Also a lot of issues are build issues or something combined, it would be easier to search for relevant failures if the issues are in one repo when you see failures in the CI and don't know where to start (I search in my emails but not everybody subscribes to both nodejs/node and nodejs/build)

@joyeecheung
Copy link
Collaborator

Also the flaky test issues are making the core repo even more noisy, they could be easier to work with if we deal with those in a separate repo that people can actually subscribe to.

@richardlau
Copy link

https://github.com/nodejs/node/projects/8 already exists. My concerns with a separate repository is that it reinforces the idea that flaky tests are someone else's problem to fix.

@joyeecheung
Copy link
Collaborator

joyeecheung commented May 31, 2018

My concerns with a separate repository is that it reinforces the idea that flaky tests are someone else's problem to fix.

@richardlau I am afraid people who don't work on flaky tests don't work on them already (I sometimes work on those but when I don't have the time to specifically work on flakes and stop following, I am completely ignorant about what's going on in the CI).

The project reminds me that if we have a separate repo for the issue, we could apply different sets of labels for them and find relevant failures - e.g. some tests crash in a similar way, some tests are sensitive to GC and timing, etc. Being able to find the relationship would help fixing those if we have difficulty reproducing flakes.

@joyeecheung
Copy link
Collaborator

Speaker queue: https://tcq.app/meeting/6tS3

@Trott Trott closed this as completed Jun 1, 2018
@gtewallace
Copy link

@benjamingr have a look at Review Board open source code review https://www.reviewboard.org/

@mmarchini
Copy link

Is there a minute or list of actions from this session?

@Trott
Copy link
Contributor

Trott commented Jun 1, 2018

@mmarchini Not really. (Someone correct me if I'm wrong, though!) I think it's one of the downsides of the plenary sessions. They tend to generate a lot of conversation but not much in terms of ongoing action the way you get with small focused groups working on problems.

In the afternoon, we had a related break-out session on CI and came up with these action items:

@mmarchini
Copy link

Thanks @Trott!

I'm interested in participating more in the Build WG (currently participating in a few issues, working on a CI Job Health Dashboard and llnode jobs on Jenkins (not sure we still want to pursue this last one though).

I'm also interested in helping the Commit Queue Prototyping Team, if possible.

@mmarchini
Copy link

Template for reporting flakes: last known good revision

Would be nice to have a combined stress single test job with a git blame-like feature to find out the lkgr for a given flaky test.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests