helm: logging improvements #4343

derekperkins · 2018-11-06T05:02:45Z

This PR adds log rotation via logrotate to the helm chart. By default, it rotates logs at 100M, running on an hourly cron.

This also uses the same debian:stretch-slim to tail the error and slow logs, and adds a vitess user/group.

@sougou - Can you create a new Docker repo for logrotate and logtail and have them auto-build like the others?

group=2000 + user=1000 to match existing helm vttablet deployments Signed-off-by: Derek Perkins <derek@derekperkins.com>

Signed-off-by: Derek Perkins <derek@derekperkins.com>

docker/k8s/Dockerfile

dkhenry

LGTM

derekperkins · 2018-11-06T11:14:26Z

There are two things still bothering me about logging.

I don't like that the logrotate container is running as root
I don't think that the tailing containers (and maybe the logrotate container) are listening for SIGTERM, meaning that upgrades/restarts wait the entire termination deadline before executing.

Signed-off-by: Derek Perkins <derek@derekperkins.com>

the previous method didn’t listen for SIGTERM and so would block restarting Signed-off-by: Derek Perkins <derek@derekperkins.com>

this doesn’t require root privileges to run, and also listens for SIGTERM Signed-off-by: Derek Perkins <derek@derekperkins.com>

Signed-off-by: Derek Perkins <derek@derekperkins.com>

also rename slow.log to slow-query.log to match Vitess defaults Signed-off-by: Derek Perkins <derek@derekperkins.com>

Signed-off-by: Derek Perkins <derek@derekperkins.com>

derekperkins · 2018-11-07T14:12:06Z

Ok, I solved those two problems.

I changed the logrotate container to use a simple sleep loop instead of a cron and now it can run as the vitess user. I also trap for signals so logrotate doesn't block the pod from restarting.
I created another logtail container that does the tailing, but inside of a bash script that traps signals, and I can confirm that that solves the issue where the pod takes the full deadline before restarting. I have the logging containers start running mysqladmin ping in a loop after receiving a signal but before exiting. That makes sure that we don't miss any logs - especially error logs in the event that something goes wrong during shutdown.

This has been bugging me for a year and I feel pretty good about these changes. @dkhenry - if you could take another look, I made a lot of changes after your last LGTM.

cc @hmcgonig @leoxlin @acharis @msolters @trevex

trevex · 2018-11-07T14:17:35Z

I created another logtail container that does the tailing, but inside of a bash script that traps signals, and I can confirm that that solves the issue where the pod takes the full deadline before restarting. I have the logging containers start running mysqladmin ping in a loop after receiving a signal but before exiting. That makes sure that we don't miss any logs - especially error logs in the event that something goes wrong during shutdown.

This is a great quality of life improvement 👍

derekperkins added 4 commits November 5, 2018 14:57

docker: add vitess:vitess to all k8s images

4437d63

group=2000 + user=1000 to match existing helm vttablet deployments Signed-off-by: Derek Perkins <derek@derekperkins.com>

helm: use debian:stretch-slim instead of busybox

36845f2

Signed-off-by: Derek Perkins <derek@derekperkins.com>

docker: add new logrotate image + logrotate.conf

1ff0558

Signed-off-by: Derek Perkins <derek@derekperkins.com>

helm: add logrotate to vttablet StatefulSet

41a47f5

Signed-off-by: Derek Perkins <derek@derekperkins.com>

derekperkins requested review from dkhenry and enisoc November 6, 2018 05:03

dkhenry reviewed Nov 6, 2018

View reviewed changes

docker/k8s/Dockerfile Show resolved Hide resolved

dkhenry approved these changes Nov 6, 2018

View reviewed changes

derekperkins added 5 commits November 6, 2018 19:22

docker: add logtail image

c7422f5

Signed-off-by: Derek Perkins <derek@derekperkins.com>

helm: use new logtail image for slow/error logs

3856f46

the previous method didn’t listen for SIGTERM and so would block restarting Signed-off-by: Derek Perkins <derek@derekperkins.com>

docker: logrotate - use bash sleep instead of cron

5329fa3

this doesn’t require root privileges to run, and also listens for SIGTERM Signed-off-by: Derek Perkins <derek@derekperkins.com>

helm: stop running logrotate as root

cef15a5

Signed-off-by: Derek Perkins <derek@derekperkins.com>

helm: use logtail on general.log

c15a90f

also rename slow.log to slow-query.log to match Vitess defaults Signed-off-by: Derek Perkins <derek@derekperkins.com>

derekperkins force-pushed the k8s-logs branch from 97ca0be to c15a90f Compare November 7, 2018 05:03

docker: fix logtail codeclimate issues

c9e5b22

Signed-off-by: Derek Perkins <derek@derekperkins.com>

derekperkins force-pushed the k8s-logs branch from 080d0e5 to c9e5b22 Compare November 7, 2018 05:47

derekperkins merged commit b06f7c8 into vitessio:master Nov 9, 2018

derekperkins mentioned this pull request Nov 13, 2018

helm: general cleanup #4361

Merged

rafael mentioned this pull request Dec 3, 2018

Slack vitess 2018 12 3.r0 tinyspeck/vitess#118

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

helm: logging improvements #4343

helm: logging improvements #4343

derekperkins commented Nov 6, 2018 •

edited

Loading

dkhenry left a comment

derekperkins commented Nov 6, 2018

derekperkins commented Nov 7, 2018

trevex commented Nov 7, 2018

helm: logging improvements #4343

helm: logging improvements #4343

Conversation

derekperkins commented Nov 6, 2018 • edited Loading

dkhenry left a comment

Choose a reason for hiding this comment

derekperkins commented Nov 6, 2018

derekperkins commented Nov 7, 2018

trevex commented Nov 7, 2018

derekperkins commented Nov 6, 2018 •

edited

Loading