fix: terminate goroutines gracefully #21

starbops · 2024-02-04T13:10:07Z

IMPORTANT: Please do not create a Pull Request without creating an issue first.

Problem:

The context is not propagated appropriately. Goroutines do not work together correctly if some of them encounter failure.

Solution:

Passing down context and grouping goroutines to make them clean up resources after receiving the interrupt/terminate signal.

Related Issue:

harvester/harvester#5072

Test plan:

Create an IPPool object (please adjust the content according to your environment setup).

cat <<EOF | kubectl apply -f -
apiVersion: network.harvesterhci.io/v1alpha1
kind: IPPool
metadata:
  name: net-48
  namespace: default
spec:
  ipv4Config:
    serverIP: 192.168.48.77
    cidr: 192.168.48.0/24
    pool:
      start: 192.168.48.81
      end: 192.168.48.90
    router: 192.168.48.1
  networkName: default/net-48
EOF

Wait a few seconds for the agent Pod to become ready. Monitor the log (keep it opened and start a new shell for the next step):

$ kubectl -n harvester-system logs default-net-48-agent -f
Defaulted container "agent" out of: agent, ip-setter (init)
time="2024-02-04T13:35:12Z" level=info msg="Starting VM DHCP Agent: default-net-48-agent"
time="2024-02-04T13:35:12Z" level=info msg="Starting HTTP server"
time="2024-02-04T13:35:12Z" level=info msg="Listening on port: 8080"
time="2024-02-04T13:35:12Z" level=info msg="monitor ippool default/net-48"
time="2024-02-04T13:35:12Z" level=info msg="(dhcp.Run) starting DHCP service on nic eth1"
time="2024-02-04T13:35:12Z" level=info msg="(eventhandler.EventListener) starting IPPool event listener"
time="2024-02-04T13:35:12Z" level=info msg="(controller.Run) starting IPPool controller"
time="2024-02-04T13:35:33Z" level=info msg="(controller.sync) UPDATE default/net-48"

Remove the IPPool object to trigger agent Pod teardown.

kubectl delete ippools.network.harvesterhci.io net-48

Go back to the log monitor. The teardown logs should look like the following:

$ kubectl -n harvester-system logs default-net-48-agent -f
...
time="2024-02-05T08:29:04Z" level=info msg="(controller.sync) UPDATE default/net-48"
time="2024-02-05T08:29:04Z" level=info msg="Stopping HTTP server"
time="2024-02-05T08:29:04Z" level=info msg="(eventhandler.Stop) stopping IPPool event listener"
time="2024-02-05T08:29:04Z" level=info msg="(controller.Stop) stopping IPPool controller"
time="2024-02-05T08:29:04Z" level=info msg="(eventhandler.Run) IPPool event listener terminated"
time="2024-02-05T08:29:04Z" level=info msg="(dhcp.Stop) stopping DHCP service on nic eth1"
http: Server closed

Signed-off-by: Zespre Chang <zespre.chang@suse.com>

cmd/agent/run.go

Signed-off-by: Zespre Chang <zespre.chang@suse.com>

cmd/controller/run.go

cmd/agent/run.go

Yu-Jack

If my thought is correct, it might be a issue here, please take a look my comments in sequence.

pkg/agent/agent.go

pkg/agent/ippool/event.go

pkg/agent/ippool/controller.go

Yu-Jack

Just a NIT, I will approve it if you're not planning to change.

pkg/agent/ippool/event.go

Signed-off-by: Zespre Chang <zespre.chang@suse.com> Co-authored-by: Jack Yu <jack.yu@suse.com>

Yu-Jack

LGTM, thanks for that.

w13915984028

LGTM, thanks.

fix: terminate goroutines gracefully

40b34d8

Signed-off-by: Zespre Chang <zespre.chang@suse.com>

starbops marked this pull request as ready for review February 5, 2024 02:44

starbops requested review from bk201, Yu-Jack and w13915984028 February 5, 2024 02:46

Yu-Jack reviewed Feb 5, 2024

View reviewed changes

cmd/agent/run.go Outdated Show resolved Hide resolved

cmd/agent/run.go Outdated Show resolved Hide resolved

fix: leverage signals.SetupSignalContext()

2292a53

Signed-off-by: Zespre Chang <zespre.chang@suse.com>

Yu-Jack reviewed Feb 5, 2024

View reviewed changes

cmd/controller/run.go Outdated Show resolved Hide resolved

starbops force-pushed the fix-context branch from 3b2e500 to 86589d4 Compare February 5, 2024 14:07

starbops requested a review from Yu-Jack February 5, 2024 14:08

Yu-Jack reviewed Feb 6, 2024

View reviewed changes

cmd/agent/run.go Outdated Show resolved Hide resolved

starbops force-pushed the fix-context branch from 86589d4 to d4fbb4a Compare February 6, 2024 04:08

Yu-Jack reviewed Feb 6, 2024

View reviewed changes

pkg/agent/agent.go Outdated Show resolved Hide resolved

pkg/agent/ippool/event.go Outdated Show resolved Hide resolved

pkg/agent/ippool/controller.go Outdated Show resolved Hide resolved

Yu-Jack reviewed Feb 6, 2024

View reviewed changes

pkg/agent/ippool/event.go Outdated Show resolved Hide resolved

starbops force-pushed the fix-context branch from d4fbb4a to 1303be3 Compare February 6, 2024 10:22

Yu-Jack reviewed Feb 6, 2024

View reviewed changes

pkg/agent/ippool/event.go Outdated Show resolved Hide resolved

fix: handle normal teardown procedure

e3f380d

Signed-off-by: Zespre Chang <zespre.chang@suse.com> Co-authored-by: Jack Yu <jack.yu@suse.com>

starbops force-pushed the fix-context branch from 1303be3 to e3f380d Compare February 7, 2024 00:34

Yu-Jack approved these changes Feb 7, 2024

View reviewed changes

w13915984028 approved these changes Feb 7, 2024

View reviewed changes

starbops merged commit 88e708b into harvester:main Feb 7, 2024
5 checks passed

starbops mentioned this pull request Feb 19, 2024

[BUG] Potential bugs on vm-dhcp-controller harvester/harvester#5072

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: terminate goroutines gracefully #21

fix: terminate goroutines gracefully #21

starbops commented Feb 4, 2024 •

edited

Loading

Yu-Jack left a comment •

edited

Loading

Yu-Jack left a comment •

edited

Loading

Yu-Jack left a comment

w13915984028 left a comment

fix: terminate goroutines gracefully #21

fix: terminate goroutines gracefully #21

Conversation

starbops commented Feb 4, 2024 • edited Loading

Yu-Jack left a comment • edited Loading

Choose a reason for hiding this comment

Yu-Jack left a comment • edited Loading

Choose a reason for hiding this comment

Yu-Jack left a comment

Choose a reason for hiding this comment

w13915984028 left a comment

Choose a reason for hiding this comment

starbops commented Feb 4, 2024 •

edited

Loading

Yu-Jack left a comment •

edited

Loading

Yu-Jack left a comment •

edited

Loading