Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

double memory to fix OOM #60137

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

qiliRedHat
Copy link
Contributor

@qiliRedHat qiliRedHat commented Dec 24, 2024

Test job failed because gather-must-gather and gather-extra steps OOM.
gather-must-gather step failed

  * could not run steps: step udn-density-l3-24nodes failed: "udn-density-l3-24nodes" post steps failed: ["udn-density-l3-24nodes" pod "udn-density-l3-24nodes-gather-must-gather" failed: could not watch pod: the pod ci-op-wwhp33j8/udn-density-l3-24nodes-gather-must-gather failed after 3m56s (failed containers: sidecar): ContainerFailed one or more containers exited
Container sidecar exited with code 137, reason OOMKilled

gather-extra step failed

Link to job on registry info site: https://steps.ci.openshift.org/job?org=openshift-qe&repo=XXXXXX-XXXXXXXXX-ci&branch=main&test=udn-density-l3-24nodes&variant=aws-4.18-nightly-x86, "udn-density-l3-24nodes" pod "udn-density-l3-24nodes-gather-extra" failed: could not watch pod: the pod ci-op-wwhp33j8/udn-density-l3-24nodes-gather-extra failed after 3m22s (failed containers: sidecar): ContainerFailed one or more containers exited
Container sidecar exited with code 137, reason OOMKilled

This pr double the memory to see if the issue can be fixed.

@openshift-ci openshift-ci bot requested review from neisw and xueqzhan December 24, 2024 04:17
Copy link
Contributor

openshift-ci bot commented Dec 24, 2024

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: qiliRedHat
Once this PR has been reviewed and has the lgtm label, please assign vrutkovs for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot
Copy link
Contributor

[REHEARSALNOTIFIER]
@qiliRedHat: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-assisted-service-master-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.20-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.19-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.18-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.17-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.16-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.15-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.14-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.13-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.12-e2e-agent-compact-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-master-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-master-e2e-agent-4control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-master-e2e-agent-5control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.20-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.20-e2e-agent-4control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.20-e2e-agent-5control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.19-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.19-e2e-agent-4control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.19-e2e-agent-5control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.18-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.18-e2e-agent-4control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.18-e2e-agent-5control-ipv4 openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.17-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.16-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed
pull-ci-openshift-assisted-service-release-4.15-e2e-agent-ha-dualstack openshift/assisted-service presubmit Registry content changed

A total of 27563 jobs have been affected by this change. The above listing is non-exhaustive and limited to 25 jobs.

A full list of affected jobs can be found here
Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals.

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@qiliRedHat
Copy link
Contributor Author

/pj-rehearse periodic-ci-openshift-qe-ocp-qe-perfscale-ci-main-aws-4.18-nightly-x86-udn-density-l3-24nodes

@openshift-ci-robot
Copy link
Contributor

@qiliRedHat: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

Copy link
Contributor

openshift-ci bot commented Dec 24, 2024

@qiliRedHat: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/periodic-ci-openshift-qe-ocp-qe-perfscale-ci-main-aws-4.18-nightly-x86-udn-density-l3-24nodes a1d6e83 link unknown /pj-rehearse periodic-ci-openshift-qe-ocp-qe-perfscale-ci-main-aws-4.18-nightly-x86-udn-density-l3-24nodes

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants