-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HOLD for payment 2024-11-11] [HOLD for payment 2024-11-01] E2E Testing: PRs merged after a regression all get marked as regression as well #48085
Comments
cc @kirillzyusko can you comment here so it can be assigned to you? Thanks! |
cc @AndrewGable does this sound good to you? |
Yes sounds great! |
@kirillzyusko, are you going to take this one on? Can you comment here if so? thanks |
Yeah, feel free to assign this to me 🙌 |
not overdue |
I think we discussed this internally with Hanno, but I'll post here as well. We think that we should postpone this PR, because current e2e tests are not very stable (i. e. sometimes they pass, sometimes not) and having such unstable pipeline with assembling builds for each new commit may lead to uncaught regressions, i. e.:
So I think we need to have a good e2e pipeline first and only then go ahead with this PR 👀 |
Sounds good, are you or someone else actively working on fixing the pipeline? |
@mountiny current e2e pipeline slightly unstable, but we need to have gather logs/videos for all latest failures to detect what needs to be fixed first. From what I saw - we have several major problems:
I think we need to gather kind of analytics on what is the most frequent failure, gather logs for this failure and try to fix it. How does it sound for you? I can prepare analytic based on latest failures (and attach links for all of them) but I need someone to help me to get logs + videos. Would you be able to help me? 👀 |
Provided some more logs in DM and also created an issue to give you and some other people from Margelo access to the device farm |
@kirillzyusko how is this looking |
@mountiny I prepared two PRs:
The remaining problem will be AWS scheduling - my assumption is that we'll need to add retry mechanism to that step (I think we can try to use https://github.com/marketplace/actions/retry-action). But overall with these 2 fixes tests should become pretty stable (we should have 1-2 failures per week, which I think is kind of acceptable). |
Bumped the PRs |
How is this looking? Can @chrispader help here? I read he has some spare cycles |
@mountiny I will work on it 👀 Just will finish |
|
The solution for this issue has been 🚀 deployed to production 🚀 in version 9.0.53-1 and is now subject to a 7-day regression period 📆. Here is the list of pull requests that resolve this issue: If no regressions arise, payment will be issued on 2024-11-01. 🎊 For reference, here are some details about the assignees on this issue:
|
Skipping the payment summary for this issue since all the assignees are employees or vendors. If this is incorrect, please manually add the payment summary SO. |
The solution for this issue has been 🚀 deployed to production 🚀 in version 9.0.56-9 and is now subject to a 7-day regression period 📆. Here is the list of pull requests that resolve this issue: If no regressions arise, payment will be issued on 2024-11-11. 🎊 For reference, here are some details about the assignees on this issue:
|
Skipping the payment summary for this issue since all the assignees are employees or vendors. If this is incorrect, please manually add the payment summary SO. |
This was a follow up to improve the E2E test suite, I dont think we need a checklist in this case as Margelo keeps slowly improving the suite. No external review was done so no payment required either |
With the current architecture of the e2e tests there is a problem where:
however, all PRs will be compared against the same baseline. Thus not only PR1 gets marked as deploy blocker for having introduced a performance regression, but also PR2 and PR3.
This creates confusion and additional spam in slack:
As discussed here, the proposed solution is to create for each merge commit to make a new baseline build from the previous merge commit and compare it against that one.
The text was updated successfully, but these errors were encountered: