Skip to content

Conversation

@github-actions
Copy link

@github-actions github-actions bot commented Oct 2, 2025

  • Extract component for expand and collapse button group

  • Refactor toggle groups buttons
    (cherry picked from commit 6d1991d)

Co-authored-by: Pierre Jeambrun pierrejbrun@gmail.com

* Extract component for expand and collapse button group

* Refactor toggle groups buttons
(cherry picked from commit 6d1991d)

Co-authored-by: Pierre Jeambrun <pierrejbrun@gmail.com>
@boring-cyborg boring-cyborg bot added the area:UI Related to UI/UX. For Frontend Developers. label Oct 2, 2025
@pierrejeambrun pierrejeambrun marked this pull request as ready for review October 2, 2025 23:45
@pierrejeambrun
Copy link
Member

Still the same unrelated CI error, merging

@pierrejeambrun pierrejeambrun merged commit 1e45030 into v3-1-test Oct 3, 2025
42 of 43 checks passed
@pierrejeambrun pierrejeambrun deleted the backport-6d1991d-v3-1-test branch October 3, 2025 08:08
@potiuk
Copy link
Member

potiuk commented Oct 3, 2025

Still the same unrelated CI error, merging

Proposal (i feel like a ghost writing it still from vacations).

I think (and I might want to send a message to devlist after I am back from vacations - or rather after we come back from the summit) - that i is it's a bad idea to just "merge" PRs to main/vX-N-test with "same error as usual" especially if such error causes "no tests running at all" - without actually:

  • knowing that the error is being worked on by someone
  • attempting to find out which merge PR caused an issue (and reverting it if we can find out easily and revert easily)
  • opening an issue explaining the error - and raising it in #internal-airflow-ci-cd if it's not easy/obvious what caused the issue

I belive we discussed it long time ago and this was more or less what we agreed.

For now cc:-ing those who had some relations with backporting in the last few days: @bbovenzi @eladkal @kaxil @gopidesupavan @amoghrajesh @ashb

In this case likely rebase would have helped (at least with image building issue that were there for the last few days). I fixed failing CI build few hours ago (from vacations) #56352 (comment) explaining the reason - with comment that it's likely not a good idea to merge PRs to v3-1-test before you solve the underlying issue.

This is possibly making release manager job harder.

I think some attempt should be made when we merge such PRs before merging it blindly without any tests running to v3--test similarly as we usually do for main.

It literally makes no sense to merge such change when NONE of the tests were actually running. Merging it has no verification whatsoever, so you might as well wait with merging it until the issue in v3-1-test is fixed. I could understand if there is a "known" issue and our PR has unrelated errors - but in this cases no tests were running at all.

And In this case it was pretty clear that the offending PR was #56208 (that was first that failed in v3-1-test branch (previous one was green - this is quite visible here:

When you look at things merged to v3-1-test

*https://github.com/apache/airflow/commits/v3-1-test/

You can see that in #56208 number of tests run dropped dramatically:

Screenshot 2025-10-03 at 01 20 41

So in this case, getting rid of the big issue was as easy as reverting the #56208. I think otherwise, release manager might have really difficult job to chase and fix all the issues instroduced by no test run at all.

@pierrejeambrun
Copy link
Member

pierrejeambrun commented Oct 3, 2025

I understand.

I didn't have time to look into it at the time, so it was either merge it because failure were unreleated (and I'm confident about the change introduced here), or do nothing and also add work to release manager because he would have to cherry pick much more.

I figured merging was more helpful, also if I recall correctly the release process, we cherry pick to the test branch manually without running the CI at all, and fix the CI at the end when we do the sync PR with stable branch. (but maybe that's reserved to the RM?)

But I'm also fine letting the PRs open, or closing them and let the RM to the cherry picking later when the backport CI is fixed. (With all that implies, letting them open until someone actually do something about it by specifically targeting those PRs. (I usually don't but will have to if we want to work like this)).

@potiuk
Copy link
Member

potiuk commented Oct 3, 2025

we cherry pick to the test branch manually without running the CI at all

This also effectively runs PR on schedule now (few times a day) and when things fail, the errors are posted to #internal-airflow-ci-cd. This was happening continuously - see alerts here for example https://apache-airflow.slack.com/archives/C015SLQF059/p1759424572437079 - this is what dragged my attention, I saw that those failures were continuously happening day after day over last few days.

@potiuk
Copy link
Member

potiuk commented Oct 3, 2025

and fix the CI at the end when we do the sync PR with stable branch. (but maybe that's reserved to the RM?)

That often takes an order of magnitude more effort, because you have many failures and you have no idea were they came from. Which is exactly what I am afraid and try to protect the time of release manager who should generally not chase issues from days or weeks ago, I think (that's more or less the effect and why we introduced the notifications in slack as well)

Note that I am not criticising, just trying to show the mechanism "while it's hot" to get a little more empathising (with RM) and raise understanding / awareness why this is important.

@potiuk
Copy link
Member

potiuk commented Oct 3, 2025

Also.... We talk about it with @amoghrajesh and @gopidesupavan (remotely) at the Summit ... so .... come to listen to us :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:UI Related to UI/UX. For Frontend Developers.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants