Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pods fails to terminate on Windows nodes on AKS #2538

Closed
andbess opened this issue Sep 8, 2021 · 6 comments
Closed

Pods fails to terminate on Windows nodes on AKS #2538

andbess opened this issue Sep 8, 2021 · 6 comments
Assignees
Labels
Feedback General feedback stale Stale issue

Comments

@andbess
Copy link

andbess commented Sep 8, 2021

What happened:
Sporadically some of the pods fails to de terminated on Windows nodes

What you expected to happen:
All the pods are terminated automatically

How to reproduce it:
Since the issue happens sporadically, it's hard to reproduce

Additional information:
We experience the same issue in our AKS cluster as was reported in microsoft/Windows-Containers microsoft/Windows-Containers#106, except we're not using fluentbit or fluentd logging agents (we only use Azure Monitor with combination of Grafana). Sporadically some the pods are not automatically terminated and we have to manually run "kubectl delete --force" to force terminate the pods.
Could it be that Microsoft Monitoring Agent also sometime holds the container log and that's the reason that it cannot be removed?

The error message is always the same:
Message: unable to remove filesystem for 5bbd9142e090352ac34b5c118a69f0b76efd1f79c35e50bb264aad0c3731664f: CreateFile C:\ProgramData\docker\containers\5bbd9142e090352ac34b5c118a69f0b76efd1f79c35e50bb264aad0c3731664f\5bbd9142e090352ac34b5c118a69f0b76efd1f79c35e50bb264aad0c3731664f-json.log: Access is denied.

Environment:

  • Kernel version: 10.0.17763.2114
  • OS Image: Windows Server 2019 Datacenter
  • Container runtime version: docker://20.10.6
  • Kubernetes version: 1.21.2
  • Size of cluster: 8 working Windows nodes
  • The version of Microsoft Monitoring Agent we're using on our Windows Node: 10.20.18053.0
@ghost ghost added the triage label Sep 8, 2021
@ghost
Copy link

ghost commented Sep 8, 2021

Hi andbess, AKS bot here 👋
Thank you for posting on the AKS Repo, I'll do my best to get a kind human from the AKS team to assist you.

I might be just a bot, but I'm told my suggestions are normally quite good, as such:

  1. If this case is urgent, please open a Support Request so that our 24/7 support team may help you faster.
  2. Please abide by the AKS repo Guidelines and Code of Conduct.
  3. If you're having an issue, could it be described on the AKS Troubleshooting guides or AKS Diagnostics?
  4. Make sure your subscribed to the AKS Release Notes to keep up to date with all that's new on AKS.
  5. Make sure there isn't a duplicate of this issue already reported. If there is, feel free to close this one and '+1' the existing issue.
  6. If you have a question, do take a look at our AKS FAQ. We place the most common ones there!

@ghost ghost added the action-required label Sep 10, 2021
@ghost
Copy link

ghost commented Sep 10, 2021

Triage required from @Azure/aks-pm

@ghost
Copy link

ghost commented Sep 15, 2021

Action required from @Azure/aks-pm

@ghost ghost added the Needs Attention 👋 Issues needs attention/assignee/owner label Sep 15, 2021
@ghost
Copy link

ghost commented Oct 1, 2021

Issue needing attention of @Azure/aks-leads

@EPinci EPinci added the Feedback General feedback label Oct 27, 2021
@ghost ghost removed triage action-required Needs Attention 👋 Issues needs attention/assignee/owner labels Oct 27, 2021
@ghost ghost added the stale Stale issue label Dec 26, 2021
@ghost
Copy link

ghost commented Dec 26, 2021

This issue has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs within 15 days of this comment.

@ghost ghost closed this as completed Jan 3, 2022
@ghost
Copy link

ghost commented Jan 3, 2022

This issue will now be closed because it hasn't had any activity for 7 days after stale. andbess feel free to comment again on the next 7 days to reopen or open a new issue after that time if you still have a question/issue or suggestion.

@ghost ghost locked as resolved and limited conversation to collaborators Feb 2, 2022
This issue was closed.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Feedback General feedback stale Stale issue
Projects
None yet
Development

No branches or pull requests

3 participants