Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gardener-extension-provider-openstack PODs are in CrashLoopBackOff #878

Closed
yinkun111 opened this issue Oct 17, 2024 · 2 comments · Fixed by #879
Closed

gardener-extension-provider-openstack PODs are in CrashLoopBackOff #878

yinkun111 opened this issue Oct 17, 2024 · 2 comments · Fixed by #879
Assignees
Labels
kind/bug Bug platform/openstack OpenStack platform/infrastructure status/closed Issue is closed (either delivered or triaged)

Comments

@yinkun111
Copy link

yinkun111 commented Oct 17, 2024

How to categorize this issue?

/kind bug
/platform openstack

What happened:
It is found that gardener-extension-provider-openstack PODs are in CrashLoopBackOff status in Openstack environment

The error is as following:

panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x10 pc=0x1dfc5fb]

goroutine 1171 [running]:
github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow.(*FlowContext).ensureConfiguredNetwork(0xc00261c2c0, {0x27fbb58?, 0xc001e70cf0?})
        /go/src/github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow/reconcile.go:209 +0xfb
github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow.(*FlowContext).ensureNetwork(0x3b785c0?, {0x27fbc00?, 0xc00031a0e0?})
        /go/src/github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow/reconcile.go:196 +0x28
github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow/shared.(*BasicFlowContext).AddTask.TaskFn.Timeout.func1({0x27fbb58?, 0xc001e70cf0?})
        /go/pkg/mod/github.com/gardener/gardener@v1.97.1/pkg/utils/flow/taskfn.go:35 +0x59
github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow/shared.(*BasicFlowContext).AddTask.(*BasicFlowContext).wrapTaskFn.func2({0x27fbb58, 0xc001e70cf0})
        /go/src/github.com/gardener/gardener-extension-provider-openstack/pkg/controller/infrastructure/infraflow/shared/basic_context.go:169 +0x3ba
github.com/gardener/gardener/pkg/utils/flow.(*execution).runNode.func2()
        /go/pkg/mod/github.com/gardener/gardener@v1.97.1/pkg/utils/flow/flow.go:226 +0x165
created by github.com/gardener/gardener/pkg/utils/flow.(*execution).runNode in goroutine 714
        /go/pkg/mod/github.com/gardener/gardener@v1.97.1/pkg/utils/flow/flow.go:222 +0x4f0

The shoot cluster spawn a new machine and this machine can not get available IP from Openstack neutron network,

In this scenario, when it calls the function

https://github.com/gardener/gardener-extension-provider-openstack/blob/master/pkg/controller/infrastructure/infraflow/reconcile.go#L201-L209

then cause gardener-extension-provider-openstack stuck in "CrashLoopBackOff" status.

please investigate this issue.

please check the the full log at attachment
Full_log.log

What you expected to happen:
gardener-extension-provider-openstack works normally

How to reproduce it (as minimally and precisely as possible):
within Openstack setting, if the shoot can not get available IP can cause this issue.

Anything else we need to know?:
no

Environment:
Openstack

@gardener-robot gardener-robot added kind/bug Bug platform/openstack OpenStack platform/infrastructure labels Oct 17, 2024
@kon-angelo kon-angelo self-assigned this Oct 17, 2024
@kon-angelo
Copy link
Contributor

Hi @yinkun111,

The shoot cluster spawn a new machine and this machine can not get available IP from Openstack neutron network,

The error log here is only relevant for the infrastructure reconciliation. I do not see the connection with spawning a new machine. That being said, we should address the panic.

Can you post the infrastructureConfig of the shoot ? Was the shoot ever succesfully reconciled in the past ?

@kon-angelo
Copy link
Contributor

I think @AndreasBurger already identified the issue and will proceed with a fix.

@gardener-robot gardener-robot added the status/closed Issue is closed (either delivered or triaged) label Oct 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Bug platform/openstack OpenStack platform/infrastructure status/closed Issue is closed (either delivered or triaged)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants