Skip to content
This repository has been archived by the owner on Apr 3, 2020. It is now read-only.

Concourse-Up deployments will stop working after 1 year #81

Closed
danyoung opened this issue Jan 14, 2019 · 2 comments
Closed

Concourse-Up deployments will stop working after 1 year #81

danyoung opened this issue Jan 14, 2019 · 2 comments
Labels

Comments

@danyoung
Copy link
Contributor

danyoung commented Jan 14, 2019

EngineerBetter's own Concourse deployment stopped working over the weekend as the NATS TLS certificate that was created when the Director was first deployed expired on the 3rd of January 2019.

This didn't present a problem until the next time a NATS client (ie the BOSH agents on VMs) established a connection, at which point they couldn't connect. As the Health Manager couldn't communicate with the agents on the VMs via NATS, it decided to recreate them. Unfortunately it would recreate them with the same expired certificate, causing them to time out after 10 minutes, be destroyed, and then rescheduled for creation.

Unfortunately this will happen to every Concourse-Up deployment after a year, since BOSH-generated certs are hardcoded to this. We're currently looking at the problem and thank you for your patience while we implement a solution.

In the meantime, please check the age of your deployment using the info command and take steps to rotate the certs as described here: https://bosh.io/docs/nats-ca-rotation/. Please join us on Concourse-Up Slack to discuss further.

@danyoung danyoung added the bug label Jan 14, 2019
@crsimmons
Copy link
Contributor

As of v0.18.1 there is now a built-in Concourse-Up command for renewing these NATS certificates.

concourse-up maintain --renew-nats-cert

Read more in the readme

@danyoung
Copy link
Contributor Author

Closing this for now, since there is a preventative action available. Full automation of NATs cert renewal may be a future story.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants