Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Add special handling for critical customer #301

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

RaphaelBut
Copy link
Contributor

We do no longer post LimitedSupport if the target cluster is from a organization with the managed_critical_customer capability. Instead we redirect SRE to take additional steps for those clusters.

See https://issues.redhat.com/browse/OSD-24126

We do no longer post LimitedSupport if the target cluster
is from a organization with the managed_critical_customer capability.
Instead we redirect SRE to take additional steps for those clusters.
@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 11, 2024
Copy link
Contributor

openshift-ci bot commented Jul 11, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: RaphaelBut

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 11, 2024
@RaphaelBut
Copy link
Contributor Author

/hold for testing the ccam path

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 11, 2024
@codecov-commenter
Copy link

codecov-commenter commented Jul 11, 2024

Codecov Report

Attention: Patch coverage is 0% with 37 lines in your changes missing coverage. Please review.

Project coverage is 37.87%. Comparing base (c358e70) to head (ed99739).

Additional details and impacted files

Impacted file tree graph

@@            Coverage Diff             @@
##             main     #301      +/-   ##
==========================================
- Coverage   38.51%   37.87%   -0.65%     
==========================================
  Files          20       20              
  Lines        1594     1621      +27     
==========================================
  Hits          614      614              
- Misses        940      967      +27     
  Partials       40       40              
Files Coverage Δ
pkg/investigations/ccam/ccam.go 43.47% <0.00%> (ø)
pkg/investigations/chgm/chgm.go 62.37% <0.00%> (-0.63%) ⬇️
pkg/ocm/mock/ocmmock.go 42.66% <0.00%> (ø)
pkg/ocm/ocm.go 0.00% <0.00%> (ø)

pkg/investigations/ccam/ccam.go Outdated Show resolved Hide resolved
if err != nil {
return err
notes.AppendWarning("NetworkVerifier found unreachable targets, deadmanssnitch is blocked! \nUnreachable: \n%s", failureReason)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggestion: Clarify what this means for the SRE handling it. It might be obvious to us, but DMS being blocked might not automatically ring a bell to everyone.
Maybe something mentioning that DMS is blocked causing this alert and next steps to take?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice catch. Did mindlessly copy this.
I think, it might be best to just remove the mention of DMS as its not relevant to the workflow here.
would look like this, wdyt?

🤖 Automated CHGM pre-investigation 🤖

✅ Customer did not stop nodes.
⚠️ NetworkVerifier found unreachable targets!
Unreachable:
nosnch.in:443
⚠️Cannot post limited support on critical customers cluster. Please open a proactive case and silence the alert.

pkg/investigations/chgm/chgm.go Outdated Show resolved Hide resolved
@@ -224,6 +233,33 @@ func (c *SdkClient) PostLimitedSupportReason(limitedSupportReason *LimitedSuppor
return nil
}

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if we can make this cleaner. An error is probably not the ideal way to convey an expected failure.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same :D Let me see...

RaphaelBut and others added 2 commits July 11, 2024 17:45
Co-authored-by: typeid <github@typeid.org>
Co-authored-by: typeid <github@typeid.org>
Copy link
Contributor

openshift-ci bot commented Jul 11, 2024

@RaphaelBut: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/lint ed99739 link true /test lint

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@openshift-bot
Copy link

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

@openshift-ci openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 10, 2024
@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Oct 10, 2024
@openshift-merge-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-bot
Copy link

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

@openshift-ci openshift-ci bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Nov 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants