Skip to content
This repository has been archived by the owner on Jan 11, 2023. It is now read-only.

Sync up health monitoring #3817

Merged
merged 4 commits into from
Sep 7, 2018

Conversation

mboersma
Copy link
Member

@mboersma mboersma commented Sep 7, 2018

What this PR does / why we need it:

Enables kubelet monitoring and restarting with Kubernetes' health-monitor.sh script. Syncs up the acs-engine implementation with that being used on AKS infrastructure.

We saw some CSE 30 (Kubernetes can't start) errors in CI for this PR, so let's disable the kubelet-monitor again and leave the other changes intact while we do more testing.

cc: @juan-lee @jwilder

Special notes for your reviewer:

I tested this with a new acs-engine cluster to make sure units landed in the right places and everything starts up correctly.

If applicable:

  • documentation
  • unit tests
  • tested backward compatibility (ie. deploy with previous version, upgrade with this branch)

Release note:

Kubernetes: add kubelet health monitoring

@mboersma mboersma self-assigned this Sep 7, 2018
@ghost ghost added the in progress label Sep 7, 2018
@jackfrancis
Copy link
Member

/lgtm

@juan-lee @seanknox could you kindly review?

@codecov
Copy link

codecov bot commented Sep 7, 2018

Codecov Report

❗ No coverage uploaded for pull request base (master@849e5fd). Click here to learn what that means.
The diff coverage is n/a.

@@            Coverage Diff            @@
##             master    #3817   +/-   ##
=========================================
  Coverage          ?   55.67%           
=========================================
  Files             ?      109           
  Lines             ?    16246           
  Branches          ?        0           
=========================================
  Hits              ?     9045           
  Misses            ?     6411           
  Partials          ?      790

@acs-bot
Copy link

acs-bot commented Sep 7, 2018

New changes are detected. LGTM label has been removed.

@acs-bot acs-bot removed the lgtm label Sep 7, 2018
@mboersma mboersma changed the title Enable kubelet health monitor Sync up health monitoring Sep 7, 2018
@jackfrancis
Copy link
Member

/lgtm pending E2E

@acs-bot
Copy link

acs-bot commented Sep 7, 2018

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jackfrancis, mboersma

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [jackfrancis,mboersma]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@jackfrancis jackfrancis merged commit 69b6614 into Azure:master Sep 7, 2018
@ghost ghost removed the in progress label Sep 7, 2018
Copy link

@jwilder jwilder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@mboersma mboersma deleted the enable-kubelet-health-monitor branch September 10, 2018 15:44
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants