-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deploy karm and purchase karm+1 #94
Comments
+1 sounds good to me. |
So a quick summary of my plan to move forward:
Assuming a pg_upgrade looks OK then:
If pg_upgrade doesn't look OK then we'll need to consider whether we want to take long enough downtime to restore into 9.5 on karm, or just stick with 9.1 for now. |
The restore into 9.5 took about 20h. Should we get karm up and running as a regular 9.1 replica and wait for "karm+1" so that we can benchmark on that machine? I thought the purpose of this ticket was to replace ramoth with karm, rather than do the full 9.5 upgrade. Hopefully a "karm+1" would allow us to benchmark and de-risk that while not having to worry about ramoth flaking out. |
Well it was always my intention to move to 9.5 as part of deploying karm if that was feasible. Give me a few days to see how well a pg_upgrade works. |
I restored to 9.1 and have just done a test upgrade to 9.5 with the following command:
Total time to upgrade was 25 seconds. I have also built the chef infrastructure to support building multiple versions of libpgosm.so and linking them into the postgres library directory and have changed the live database (as well as karm before the upgrade) to use |
That's great! And I guess it means the majority of the data is in a disk format which either didn't change or is backwards-compatible. Do you think we could fit this into a scheduled ~1h downtime in a couple of weeks time? |
It should only be the system tables |
karm is in production. Should this be closed or kept open for karm+1 discussion? |
No it isn't. |
Karm is now serving Read Only API traffic. |
I'm going to close this, and open a new ticket for purchasing karm+1 |
This is not complete yet. |
To explain, deployment of karm will not be complete until we make it master and do the postgres upgrade which I plan to do once the IC power outage on 7th January is out of the way. |
Now that the IC outage is out of the way, is there anything more blocking this other than scheduling a few hours downtime to do the upgrade? |
No. I mean I need to write myself a detailed list of steps to take but other than that it's just a case of scheduling the downtime. |
Okay. Do you have any preference for when? i.e: Sunday morning / Friday evening? Given that we expect this to be quite quick, perhaps we could schedule for some time around 11th March? I don't know whether Easter is a good time (people not mapping because it's a holiday) or a bad time (people mapping because it's a holiday), so probably best to avoid that if we can. |
Sunday morning is probably the best bet and Easter itself won't work for me as I'll either be on holiday or at my parents... Something like 12th March should work? |
Preliminary work for next weekend has hopefully been completed now. Both ramoth and karm have been switched to use postgresql.org packages and ramoth has been upgraded to Ubuntu 16.04 this weekend. All three machines have postgres 9.6 installed and the database functions have been built and installed to the postgres library directory. Some directory rearrangement has been done on ramoth and katla to make paths more consistent and setup a location ready for the 9.6 system tablespace. Checklist for the day, based on the last time we did a master flip, my upgrade experiments last year and the postgres documentation for upgrading slaves:
|
cross-checking with my notes, https://gist.github.com/Komzpa/994d5aaf340067ccec0e - |
@Komzpa yes it gives you a script to run as part of the upgrade which is why I didn't mention that separately |
@tomhughes is it misspelling, that inside the checklist you mentioned 'Stop postgres on karm' two times? |
@gardster it was yes - now fixed. |
Master switch and database upgrade is complete - the OS upgrade on katla is now underway. Note that we went to 9.5 not 9.6 in the end because we have indexes on |
Upgrade is complete and karm is now fully deployed so I'm going to close this - we have #131 tracking the purchase of karm+1. |
In light of the problems with Ramoth being unable to cope with load (see #91 and #92 ), and our long-running concerns with Ramoth's replication, I think it is urgent to move karm into production. I don't want to hijack the discussion on #78 so let's discuss it here.
My plan would be as follows:
Thoughts?
The text was updated successfully, but these errors were encountered: