Forbid failing incidents from being scheduled in aggregates #154

foursixnine · 2024-01-30T07:11:46Z

codecov-commenter · 2024-01-30T07:12:48Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (9872fbc) 66.84% compared to head (e551352) 67.49%.
Report is 6 commits behind head on master.

Files	Patch %	Lines
openqabot/types/incident.py	96.29%	1 Missing ⚠️
openqabot/types/incidents.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #154      +/-   ##
==========================================
+ Coverage   66.84%   67.49%   +0.64%     
==========================================
  Files          24       25       +1     
  Lines        1659     1692      +33     
==========================================
+ Hits         1109     1142      +33     
  Misses        550      550

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

openqabot/types/incident.py

openqabot/types/aggregate.py

foursixnine · 2024-01-30T07:31:10Z

@okurz since you're requesting changes already, can you guide me in the test/code paths? the_demon_incident in theory should not show up, any thoughts?

openqabot/types/incident.py

openqabot/types/aggregate.py

foursixnine · 2024-01-30T11:49:25Z

@okurz can you please tag this pr as ai-assisted?

Martchus · 2024-01-30T11:57:33Z

We don't have that kind of label yet (in that repo). Are you saying you've been using AI here? If yes, why is that relevant and what would adding that tag change?

foursixnine · 2024-01-30T12:26:54Z

We don't have that kind of label yet (in that repo). Are you saying you've been using AI here? If yes, why is that relevant and what would adding that tag change?

It add traceability, of things that have been done using some sort of assistance, while not important for you, SUSE-wise it is.

Martchus

Looks generally good.

tests/test_incident.py

openqabot/types/incident.py

tests/test_incident.py

okurz · 2024-01-31T10:15:58Z

@okurz can you please tag this pr as ai-assisted?

As I explained I don't think it's a good idea but as you insist I created and added that label.

Makefile

openqabot/types/aggregate.py

tests/test_aggregate.py

mergify · 2024-01-31T12:27:58Z

This pull request is now in conflicts. Could you fix it? 🙏

openqabot/types/incidents.py

Martchus · 2024-01-31T15:07:19Z

Makefile

+# devel: environment
+#   maybe use Makefile.VENV instead to get a shell with virtualenv
+# 	# we need to detect what shell we are using
+# 	shell=$$(basename $$SHELL); \
+# 	echo "Activating virtualenv for $$shell"; \
+# 	. $(VENV) && \
+# 	exec $($(SHELL))


Maybe keep it on a different branch for now?

Not really, updated the comment though; Not adding another TODO, to avoid causing a heart attack to @okurz

Actually this might be worse than a TODO as it's dead/disabled code and nobody will know why it's not enabled. I recommend you just remove that

@okurz thank you for the recommendation, however same comment applies:

#154 (comment)

It would make sense to at least state why this code has been disabled; e.g. why it is not good/useful enough for general use and in what situations it would make sense to use it nevertheless. The comments

# Developers have bad memory, so we need to remind them to activate the virtualenv # maybe use Makefile.VENV instead to get a shell with virtualenv

don't make that clear to me at all.

Additionally, also if that code was not commented-out I'd frankly struggle to make sense of its intended use and purpose. So that should probably be clarified anyway.

Additionally, also if that code was not commented-out I'd frankly struggle to make sense of its intended use and purpose. So that should probably be clarified anyway.

Good point, updated the comment

openqabot/types/incident.py

foursixnine · 2024-01-31T21:52:39Z

tests/test_incident.py

+            {"status": "passed", "job_id": 1},
+            {"status": "failed", "job_id": 1777},  # Accept the turk
+            {
+                "status": "softfailed",
+                "job_id": 2020,
+            },  # 2020 is the genesys of dark fate
+            {"status": "failed", "job_id": 2042},  # This one has a dark fate
+            {"status": "passed", "job_id": 3},


Python linting can be ugly at times 🗡️

mimi1vx · 2024-02-07T22:17:12Z

openqabot/types/incident.py

+#   - remove almost duplicated code from Approver.is_job_marked_acceptable_for_incident
+#   as approver does not seem to operate over incidents
+#   about the TODO see discussion at https://github.com/openSUSE/qem-bot/pull/154#discussion_r1472721681
+@staticmethod


static method of what ??

Of class Incident is suppose. I'm not that familiar with Python so I'm wondering what are you getting at. Can you provide a concrete suggestion?

OF which class? look at place whole has_ignored_comment is a function not a method .. and isn't part of any class

In perl class is usualy whole file , In python identaton and place matter :D

mimi1vx · 2024-02-07T22:22:54Z

openqabot/types/incident.py

+                )
+
+        if not results:
+            raise NoResultsError(


is this exception anywhere caught? and resolved?, btw aggregates could be scheduled before any results are available ..

mimi1vx

few questions ..

mergify · 2024-02-08T16:32:21Z

This pull request is now in conflicts. Could you fix it? 🙏

The code used by approver doesn't seem to use Incidents class and a rewrite at this point has less benefit than simply extracting the duplicated regular expression. A TODO has been left in place to keep track for subsequent PRs steming from discussion in [1] [1] openSUSE#154 (comment)

foursixnine · 2024-02-27T16:21:38Z

@okurz as agreed last week, here is the Pull request to deploy Forbid failing incidents from being scheduled in aggregates #154 which would have helped with the sudo update of yesterday

Martchus

There are still pending questions. If that was an opt-in we would be able to merge it more easily (without everything being perfect but we could try it out in production without changing the deployment).

Martchus · 2024-03-04T12:38:23Z

openqabot/types/incident.py

+#   - remove almost duplicated code from Approver.is_job_marked_acceptable_for_incident
+#   as approver does not seem to operate over incidents
+#   about the TODO see discussion at https://github.com/openSUSE/qem-bot/pull/154#discussion_r1472721681
+@staticmethod


Of class Incident is suppose. I'm not that familiar with Python so I'm wondering what are you getting at. Can you provide a concrete suggestion?

Martchus · 2024-03-04T12:39:33Z

openqabot/types/incident.py

+# TODO:
+#   - move to utils.py or a better place
+#   - remove almost duplicated code from Approver.is_job_marked_acceptable_for_incident


These two points don't seem to hard to implement. Am I overlooking something or can we maybe just do them before merging this PR?

can we maybe just do them before merging this PR?

I'll leave it for a follow-up when addressing the rest of the changes, as that would need a bigger refactor, due to the incidents class not being used in the approver. thingie.

Martchus · 2024-03-04T12:47:50Z

openqabot/types/incident.py

+    for comment in ret:
+        if regex.match(comment["text"]):
+            # leave comment for future debugging purposes
+            # log.debug("matched comment incident %s: with comment %s", inc, comment)


In fact, after last changes, this is not necessary anymore so I dropped them

It looks like the current version on GitHub still has the disabled line.

Martchus · 2024-03-05T10:25:55Z

@Mergifyio rebase

Let the build system die if any errors are found, this is intended for local development only.

Leave early when filtering incidents to schedule, as incidents that have failures don't need further processing. Adjust tests accordingly fixing that ugly off by one

- While incidents are less likely to have an exception comment, there are cases where a failing aggregate from day before, might impact an incident on its own to be scheduled - We want to accept only passed results without questioning, anything else, will need to have an acceptable_for, following the discussion in [1] [1] https://github.com/openSUSE/qem-bot/pull/154/files#r1474042954

The code used by approver doesn't seem to use Incidents class and a rewrite at this point has less benefit than simply extracting the duplicated regular expression. A TODO has been left in place to keep track for subsequent PRs steming from discussion in [1] [1] openSUSE#154 (comment)

qem-bot's data is normalized, so either passed or failed.

mergify · 2024-03-05T10:27:34Z

rebase

✅ Branch has been successfully rebased

okurz · 2024-03-06T13:46:09Z

I also called the application locally but found no relevant logs are output:

./bot-ng.py --configs metadata --singlearch metadata/bot-ng/singlearch.yml -t 1234 --debug --dry updates-run

Possibly the relevant steps are not executed due to dry-run or something else preventing the evaluation of what products to trigger:

2024-03-06 12:34:58 INFO     Bot schedule starts now
2024-03-06 12:34:58 INFO     Project SUSE:Maintenance:17818 has empty channels - check incident in SMELT
2024-03-06 12:34:58 INFO     Project SUSE:Maintenance:17958 has empty channels - check incident in SMELT
…
2024-03-06 12:35:28 INFO     Project SUSE:Maintenance:18479 can't calculate repohash  .. skipping
2024-03-06 12:35:28 INFO     Project SUSE:Maintenance:18485 has empty channels - check incident in SMELT
…
2024-03-06 12:35:59 INFO     Project SUSE:Maintenance:19102 can't calculate repohash  .. skipping
2024-03-06 12:35:59 INFO     Project SUSE:Maintenance:24734 has empty channels - check incident in SMELT
…
2024-03-06 12:36:31 INFO     Project SUSE:Maintenance:28369 can't calculate repohash  .. skipping
…
2024-03-06 12:37:02 INFO     Project SUSE:Maintenance:28667 can't calculate repohash  .. skipping
…
2024-03-06 12:37:32 INFO     Project SUSE:Maintenance:28784 can't calculate repohash  .. skipping
2024-03-06 12:37:32 INFO     Project SUSE:Maintenance:29248 has empty packages - check incident in SMELT
2024-03-06 12:37:32 INFO     Project SUSE:Maintenance:30071 has empty channels - check incident in SMELT
2024-03-06 12:37:41 INFO     Project SUSE:Maintenance:31645 has empty channels - check incident in SMELT
2024-03-06 12:37:42 INFO     Project SUSE:Maintenance:32086 has empty channels - check incident in SMELT
2024-03-06 12:37:59 INFO     Project SUSE:Maintenance:32288 has empty channels - check incident in SMELT
2024-03-06 12:38:09 INFO     Project SUSE:Maintenance:32462 has empty channels - check incident in SMELT
2024-03-06 12:38:33 INFO     Project SUSE:Maintenance:32613 has empty channels - check incident in SMELT
2024-03-06 12:39:13 INFO     Project SUSE:Maintenance:32782 has empty channels - check incident in SMELT
2024-03-06 12:39:33 INFO     Project SUSE:Maintenance:32808 has empty channels - check incident in SMELT
2024-03-06 12:39:43 INFO     Project SUSE:Maintenance:32824 has empty channels - check incident in SMELT
2024-03-06 12:40:16 INFO     Project SUSE:Maintenance:32877 has empty channels - check incident in SMELT
…
2024-03-06 12:40:47 INFO     Project SUSE:Maintenance:32879 can't calculate repohash  .. skipping
2024-03-06 12:40:47 INFO     … incidents loaded from qem dashboard
2024-03-06 12:40:47 DEBUG    Skipping invalid config metadata/.gitlab-ci.yml
2024-03-06 12:40:47 DEBUG    Skipping invalid config metadata/products.yml
2024-03-06 12:40:47 INFO     Starting bot mainloop
2024-03-06 12:40:47 INFO     Would trigger 0 products in openQA
2024-03-06 12:40:47 INFO     End of bot run

EDIT: I also called

for i in full-run incidents-run updates-run inc-approve inc-sync-results aggr-sync-results; do echo "### $i" && ./bot-ng.py --configs metadata --singlearch metadata/bot-ng/singlearch.yml -t 1234 --debug --dry $i; done 2>&1 | tee qem_bot_dry_run-master-$(date -Is).log && hub pr checkout 154 && for i in full-run incidents-run updates-run inc-approve inc-sync-results aggr-sync-results; do echo "### $i" && ./bot-ng.py --configs metadata --singlearch metadata/bot-ng/singlearch.yml -t 1234 --debug --dry $i; done 2>&1 | tee qem_bot_dry_run-pr154-$(date -Is).log

and compared both output logs to see if there is any reasonable difference. "inc-sync-results" provide a way too huge list of results to process, inc-approve shows a lot of difference but due to changed realtime results, not related to this pull request. The relevant commands (if at all) are "full-run incidents-run updates-run" and there are no differences at all in the output (except for timestamps) meaning what I stated in before: Possibly the relevant steps are not executed due to dry-run or something else preventing the evaluation of what products to trigger and further more significant reverse-engineering would be necessary to change that.

mergify · 2024-04-16T12:20:57Z

This pull request is now in conflicts. Could you fix it? 🙏

Sparked from discussion in: openSUSE/qem-bot#154 (review)

okurz requested changes Jan 30, 2024

View reviewed changes

openqabot/types/incident.py Outdated Show resolved Hide resolved

openqabot/types/incident.py Outdated Show resolved Hide resolved

openqabot/types/aggregate.py Outdated Show resolved Hide resolved

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch 2 times, most recently from 0d80e14 to 851afb1 Compare January 30, 2024 07:25

okurz requested changes Jan 30, 2024

View reviewed changes

openqabot/types/incident.py Outdated Show resolved Hide resolved

openqabot/types/incident.py Outdated Show resolved Hide resolved

openqabot/types/aggregate.py Outdated Show resolved Hide resolved

Martchus requested changes Jan 30, 2024

View reviewed changes

tests/test_incident.py Outdated Show resolved Hide resolved

okurz requested changes Jan 31, 2024

View reviewed changes

openqabot/types/incident.py Outdated Show resolved Hide resolved

tests/test_incident.py Show resolved Hide resolved

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from 51e7792 to 091b1cf Compare January 31, 2024 10:11

okurz added the ai-assisted label Jan 31, 2024

okurz reviewed Jan 31, 2024

View reviewed changes

Makefile Outdated Show resolved Hide resolved

Makefile Outdated Show resolved Hide resolved

openqabot/types/aggregate.py Show resolved Hide resolved

tests/test_aggregate.py Outdated Show resolved Hide resolved

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from ccb9ef0 to 5622827 Compare January 31, 2024 11:49

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from 8f2a737 to d436336 Compare January 31, 2024 12:37

foursixnine requested review from okurz and Martchus January 31, 2024 12:41

okurz reviewed Jan 31, 2024

View reviewed changes

openqabot/types/incidents.py Show resolved Hide resolved

foursixnine requested review from perlpunk and okurz January 31, 2024 13:54

Martchus reviewed Jan 31, 2024

View reviewed changes

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch 2 times, most recently from ffb63a0 to 905101f Compare January 31, 2024 21:51

foursixnine commented Jan 31, 2024

View reviewed changes

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from 905101f to b194cd7 Compare January 31, 2024 21:56

foursixnine requested a review from Martchus January 31, 2024 21:58

Martchus approved these changes Jan 31, 2024

View reviewed changes

mimi1vx reviewed Feb 7, 2024

View reviewed changes

mimi1vx suggested changes Feb 7, 2024

View reviewed changes

foursixnine force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from e551352 to 5c9d366 Compare February 27, 2024 16:20

Martchus requested changes Mar 4, 2024

View reviewed changes

foursixnine and others added 16 commits March 5, 2024 10:27

Forbid failing incidents from being scheduled in aggregates

df781d1

Add test for Incidents.has_failures()

b0b104a

Be lazy and setup the environment via Makefile

d2d50ab

Let the build system die if any errors are found, this is intended for local development only.

Add a bit of debugging help to Aggregate object

6cbf0da

Improve logging and test flow

43c19fd

Adjust comments to match the code

593e320

Fix bug in initial implementation of has_failures

335bfb8

Leave early when filtering incidents to schedule, as incidents that have failures don't need further processing. Adjust tests accordingly fixing that ugly off by one

Install all of the requirements properly

6c51898

Cosmetic fixes to make the tests pass

3a5d974

Make Rename venv and make target

6e8ac98

Update commented make target to be explicit

9ae8960

Fix VENV variable in Makefile

44ad49f

Update tests to new changes introduced to 62cb3d2

d5b05e2

qem-bot's data is normalized, so either passed or failed.

Be explicit about the TODO

36ed6db

Martchus force-pushed the veteolvidamicaramicasaminombreypegalavuelta branch from 5c9d366 to 36ed6db Compare March 5, 2024 10:27

dzedro pushed a commit to dzedro/os-autoinst-distri-opensuse that referenced this pull request Jun 10, 2024

Update suggestion about dead code

dd01ba6

Sparked from discussion in: openSUSE/qem-bot#154 (review)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Forbid failing incidents from being scheduled in aggregates #154

Forbid failing incidents from being scheduled in aggregates #154

foursixnine commented Jan 30, 2024 •

edited

Loading

codecov-commenter commented Jan 30, 2024 •

edited

Loading

foursixnine commented Jan 30, 2024

foursixnine commented Jan 30, 2024

Martchus commented Jan 30, 2024

foursixnine commented Jan 30, 2024

Martchus left a comment

okurz commented Jan 31, 2024

mergify bot commented Jan 31, 2024

Martchus Jan 31, 2024

foursixnine Jan 31, 2024

okurz Feb 1, 2024

foursixnine Feb 1, 2024

Martchus Feb 2, 2024

foursixnine Feb 2, 2024

foursixnine Jan 31, 2024

mimi1vx Feb 7, 2024

Martchus Mar 4, 2024

mimi1vx Mar 5, 2024 •

edited

Loading

mimi1vx Feb 7, 2024

mimi1vx left a comment

mergify bot commented Feb 8, 2024

foursixnine commented Feb 27, 2024 •

edited

Loading

Martchus left a comment

Martchus Mar 4, 2024

Martchus Mar 4, 2024

foursixnine Mar 5, 2024

Martchus Mar 4, 2024

Martchus commented Mar 5, 2024 •

edited

Loading

mergify bot commented Mar 5, 2024

okurz commented Mar 6, 2024 •

edited

Loading

mergify bot commented Apr 16, 2024

Forbid failing incidents from being scheduled in aggregates #154

Are you sure you want to change the base?

Forbid failing incidents from being scheduled in aggregates #154

Conversation

foursixnine commented Jan 30, 2024 • edited Loading

codecov-commenter commented Jan 30, 2024 • edited Loading

Codecov Report

foursixnine commented Jan 30, 2024

foursixnine commented Jan 30, 2024

Martchus commented Jan 30, 2024

foursixnine commented Jan 30, 2024

Martchus left a comment

Choose a reason for hiding this comment

okurz commented Jan 31, 2024

mergify bot commented Jan 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mimi1vx Mar 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mimi1vx left a comment

Choose a reason for hiding this comment

mergify bot commented Feb 8, 2024

foursixnine commented Feb 27, 2024 • edited Loading

Martchus left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Martchus commented Mar 5, 2024 • edited Loading

mergify bot commented Mar 5, 2024

✅ Branch has been successfully rebased

okurz commented Mar 6, 2024 • edited Loading

mergify bot commented Apr 16, 2024

foursixnine commented Jan 30, 2024 •

edited

Loading

codecov-commenter commented Jan 30, 2024 •

edited

Loading

mimi1vx Mar 5, 2024 •

edited

Loading

foursixnine commented Feb 27, 2024 •

edited

Loading

Martchus commented Mar 5, 2024 •

edited

Loading

okurz commented Mar 6, 2024 •

edited

Loading