[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085

hannojg · 2024-08-27T13:07:06Z

With the current architecture of the e2e tests there is a problem where:

A new release is created, this will be the baseline for the e2e performance regression testing
PR1 is merged which introduces a performance regression
PR2 is merged with no performance regression
PR3 is merged with no performance regression

however, all PRs will be compared against the same baseline. Thus not only PR1 gets marked as deploy blocker for having introduced a performance regression, but also PR2 and PR3.

This creates confusion and additional spam in slack:

As discussed here, the proposed solution is to create for each merge commit to make a new baseline build from the previous merge commit and compare it against that one.

hannojg · 2024-08-27T13:07:30Z

cc @kirillzyusko can you comment here so it can be assigned to you? Thanks!

mountiny · 2024-08-27T13:21:06Z

cc @AndrewGable does this sound good to you?

AndrewGable · 2024-08-27T15:58:44Z

Yes sounds great!

mountiny · 2024-08-27T22:05:54Z

@kirillzyusko, are you going to take this one on? Can you comment here if so? thanks

kirillzyusko · 2024-08-28T08:18:54Z

Yeah, feel free to assign this to me 🙌

mountiny · 2024-09-06T10:57:28Z

not overdue

kirillzyusko · 2024-09-11T10:26:25Z

I think we discussed this internally with Hanno, but I'll post here as well.

We think that we should postpone this PR, because current e2e tests are not very stable (i. e. sometimes they pass, sometimes not) and having such unstable pipeline with assembling builds for each new commit may lead to uncaught regressions, i. e.:

A <- first commit in chain, e2e passed no regression has been detected
|
|
B <- commit that introduces a regression and e2e tests fails
|
|
C <- e2e tests pass again, but we compare with a build that introduced a regression and this regression was unnoticed

So I think we need to have a good e2e pipeline first and only then go ahead with this PR 👀

mountiny · 2024-09-15T23:39:44Z

Sounds good, are you or someone else actively working on fixing the pipeline?

kirillzyusko · 2024-09-16T10:42:58Z

Sounds good, are you or someone else actively working on fixing the pipeline?

@mountiny current e2e pipeline slightly unstable, but we need to have gather logs/videos for all latest failures to detect what needs to be fixed first.

From what I saw - we have several major problems:

sometimes tests fail after 2-3 mins - looks like a job can not be started properly;
sometimes tests fail after 9-10 minutes - it means that something is wrong with first test;
sometimes third test fails;

I think we need to gather kind of analytics on what is the most frequent failure, gather logs for this failure and try to fix it. How does it sound for you?

I can prepare analytic based on latest failures (and attach links for all of them) but I need someone to help me to get logs + videos. Would you be able to help me? 👀

mountiny · 2024-09-16T12:09:28Z

Provided some more logs in DM and also created an issue to give you and some other people from Margelo access to the device farm

mountiny · 2024-09-27T15:49:44Z

@kirillzyusko how is this looking

kirillzyusko · 2024-09-30T13:47:00Z

@mountiny I prepared two PRs:

[NoQA] e2e: allow warmup failures #49649 (should fix a problem with failure after 8-10 minutes);
[NoQA] fix: add retry builds #49925 (should fix random build failures).

The remaining problem will be AWS scheduling - my assumption is that we'll need to add retry mechanism to that step (I think we can try to use https://github.com/marketplace/actions/retry-action).

But overall with these 2 fixes tests should become pretty stable (we should have 1-2 failures per week, which I think is kind of acceptable).

mountiny · 2024-10-07T14:23:40Z

Bumped the PRs

mountiny · 2024-10-16T08:59:37Z

How is this looking? Can @chrispader help here? I read he has some spare cycles

kirillzyusko · 2024-10-18T10:02:28Z

@mountiny I will work on it 👀 Just will finish react-native-app-logs and will switch back to e2e tests stuff!

melvin-bot · 2024-10-25T02:16:55Z

Reviewing label has been removed, please complete the "BugZero Checklist".

melvin-bot · 2024-10-25T02:16:58Z

The solution for this issue has been 🚀 deployed to production 🚀 in version 9.0.53-1 and is now subject to a 7-day regression period 📆. Here is the list of pull requests that resolve this issue:

[NoQA] feat: build e2e baseline from previous commit #48251

If no regressions arise, payment will be issued on 2024-11-01. 🎊

For reference, here are some details about the assignees on this issue:

@kirillzyusko does not require payment (Contractor)

melvin-bot · 2024-11-01T18:02:12Z

Skipping the payment summary for this issue since all the assignees are employees or vendors. If this is incorrect, please manually add the payment summary SO.

melvin-bot · 2024-11-04T14:20:02Z

The solution for this issue has been 🚀 deployed to production 🚀 in version 9.0.56-9 and is now subject to a 7-day regression period 📆. Here is the list of pull requests that resolve this issue:

[NoQA] fix: fetch main only if we are not on main #51751

If no regressions arise, payment will be issued on 2024-11-11. 🎊

For reference, here are some details about the assignees on this issue:

@kirillzyusko does not require payment (Contractor)

melvin-bot · 2024-11-11T15:03:11Z

Skipping the payment summary for this issue since all the assignees are employees or vendors. If this is incorrect, please manually add the payment summary SO.

mountiny · 2024-11-11T15:12:29Z

This was a follow up to improve the E2E test suite, I dont think we need a checklist in this case as Margelo keeps slowly improving the suite. No external review was done so no payment required either

mountiny self-assigned this Aug 27, 2024

mountiny added the Weekly KSv2 label Aug 27, 2024

mountiny added this to [#whatsnext] #quality Aug 27, 2024

muttmuure moved this to MEDIUM in [#whatsnext] #quality Aug 27, 2024

muttmuure assigned kirillzyusko Aug 28, 2024

kirillzyusko mentioned this issue Aug 29, 2024

[NoQA] feat: build e2e baseline from previous commit #48251

Merged

48 tasks

melvin-bot bot added the Overdue label Sep 5, 2024

melvin-bot bot removed the Overdue label Sep 6, 2024

melvin-bot bot added the Overdue label Sep 24, 2024

melvin-bot bot removed the Overdue label Sep 27, 2024

muttmuure moved this from MEDIUM to HIGH in [#whatsnext] #quality Oct 15, 2024

melvin-bot bot added the Overdue label Oct 15, 2024

melvin-bot bot removed the Overdue label Oct 16, 2024

melvin-bot bot added Reviewing Has a PR in review and removed Weekly KSv2 labels Oct 21, 2024

melvin-bot bot added Weekly KSv2 Awaiting Payment Auto-added when associated PR is deployed to production and removed Weekly KSv2 labels Oct 21, 2024

melvin-bot bot changed the title ~~E2E Testing: PRs merged after a regression all get marked as regression as well~~ [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well Oct 25, 2024

melvin-bot bot removed the Reviewing Has a PR in review label Oct 25, 2024

mountiny mentioned this issue Oct 30, 2024

[NoQA] fix: fetch main only if we are not on main #51751

Merged

47 tasks

melvin-bot bot added Daily KSv2 and removed Weekly KSv2 labels Oct 31, 2024

melvin-bot bot added Overdue Weekly KSv2 and removed Daily KSv2 labels Nov 4, 2024

melvin-bot bot changed the title ~~[HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well~~ [HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well Nov 4, 2024

melvin-bot bot removed the Overdue label Nov 4, 2024

mountiny closed this as completed Nov 11, 2024

github-project-automation bot moved this from HIGH to Done in [#whatsnext] #quality Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085

[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085

hannojg commented Aug 27, 2024 •

edited

Loading

hannojg commented Aug 27, 2024

mountiny commented Aug 27, 2024

AndrewGable commented Aug 27, 2024

mountiny commented Aug 27, 2024

kirillzyusko commented Aug 28, 2024

mountiny commented Sep 6, 2024

kirillzyusko commented Sep 11, 2024

mountiny commented Sep 15, 2024

kirillzyusko commented Sep 16, 2024

mountiny commented Sep 16, 2024

mountiny commented Sep 27, 2024

kirillzyusko commented Sep 30, 2024

mountiny commented Oct 7, 2024

mountiny commented Oct 16, 2024

kirillzyusko commented Oct 18, 2024

melvin-bot bot commented Oct 25, 2024

melvin-bot bot commented Oct 25, 2024

melvin-bot bot commented Nov 1, 2024

melvin-bot bot commented Nov 4, 2024

melvin-bot bot commented Nov 11, 2024

mountiny commented Nov 11, 2024

[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085

[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085

Comments

hannojg commented Aug 27, 2024 • edited Loading

hannojg commented Aug 27, 2024

mountiny commented Aug 27, 2024

AndrewGable commented Aug 27, 2024

mountiny commented Aug 27, 2024

kirillzyusko commented Aug 28, 2024

mountiny commented Sep 6, 2024

kirillzyusko commented Sep 11, 2024

mountiny commented Sep 15, 2024

kirillzyusko commented Sep 16, 2024

mountiny commented Sep 16, 2024

mountiny commented Sep 27, 2024

kirillzyusko commented Sep 30, 2024

mountiny commented Oct 7, 2024

mountiny commented Oct 16, 2024

kirillzyusko commented Oct 18, 2024

melvin-bot bot commented Oct 25, 2024

melvin-bot bot commented Oct 25, 2024

melvin-bot bot commented Nov 1, 2024

melvin-bot bot commented Nov 4, 2024

melvin-bot bot commented Nov 11, 2024

mountiny commented Nov 11, 2024

hannojg commented Aug 27, 2024 •

edited

Loading