Add an autoscaling group for the docs-rs-builder #243

rylev · 2023-02-16T15:53:12Z

This adds an autoscaling group for the docs-rs-builder.

Currently this works by grabbing the latest docs-rs-builder AMI, creating a launch template with that AMI, and then using that template to make an autoscaling group.

I'm not sure we want to actually deploy this way in the long term, but I think it's a good start for testing how the autoscaling group behaves in practice.

syphar · 2023-02-16T18:41:40Z

terragrunt/modules/docs-rs/builder.tf

+    device_name = "/dev/sda1"
+
+    ebs {
+      volume_size           = 64


Seeing the current filesystem usage (~100 GB) I would prefer something at least double this size.

( of course the current usage also includes the database & some web cache)

It would be nice if we didn't need so much storage. I believe a large part of that storage is only needed during a single crate's build and afterwards can be deleted, no? Would it be possible to add some clean up to the builder process so that the filesystem usage doesn't grow so large?

Hm.. we are cleaning up after the build.

Plus the cleanup tasks for docker images, which are in cron right now.
( btw, cc @Nemo157 @jyn514 , these cronjobs would need to be configured in our ansible images too, right? )

Only looking at the above I could totally imagine to just try with the current definition above, let the builder build, and watch how much space is used. ( assuming the big docker image is configured?)

But, we're also planning on adding some build artifact caching: rust-lang/docs.rs#1757
( of course we could increase storage only then, when that feature is finished)

Yep, we just have a daily cronjob (systemd-timer) running docker container prune --force && docker image prune --force (and cargo-sweep which shouldn't be necessary if we rebuild the image for a new version?).

Where is the cronjob currently configured? I can add this to the Ansible configuration (though I wouldn't block merging this).

It's in

/etc/systemd/system/prune-disk-space.service /etc/systemd/system/prune-disk-space.timer

syphar · 2023-02-16T18:43:03Z

two questions:

I assume the actual autoscaling is handled differently? I mean fetching the metric & scaling based on it?
also, metrics fetching from prometheus will also be added later?

rylev · 2023-02-20T09:34:17Z

I assume the actual autoscaling is handled differently?

Correct, the autoscaling currently is whatever the default is for ec2 instance health checks which I believe is super basic (e.g., if the instance goes into serviced mode). This is definitely not what we want, but I'd like to do that as a follow-up PR.

metrics fetching from prometheus will also be added later?

Correct

jyn514 · 2023-02-27T16:16:42Z

terragrunt/accounts/docs-rs-staging/docs-rs/terragrunt.hcl

+  min_num_builder_instances = 1
+  max_num_builder_instances = 1


I'm confused what the autoscaling does when you've pinned it to always be at one instance?

It assures that there's one healthy instance. So if one instance stops or gets terminated a new one boots.

rylev requested a review from Mark-Simulacrum February 16, 2023 15:53

syphar reviewed Feb 16, 2023

View reviewed changes

rylev requested a review from jdno February 20, 2023 10:23

rylev force-pushed the builder-autoscale branch from 68709ae to f4a394b Compare February 20, 2023 13:34

rylev added 2 commits February 21, 2023 10:03

Add an autoscaling group for the docs-rs-builder

69aabc4

Add instance tag

743d0e8

rylev force-pushed the builder-autoscale branch from f4a394b to 743d0e8 Compare February 21, 2023 09:03

jdno approved these changes Feb 27, 2023

View reviewed changes

jyn514 reviewed Feb 27, 2023

View reviewed changes

rylev merged commit e83cf54 into rust-lang:master Mar 7, 2023

rylev deleted the builder-autoscale branch March 7, 2023 16:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an autoscaling group for the docs-rs-builder #243

Add an autoscaling group for the docs-rs-builder #243

rylev commented Feb 16, 2023

syphar Feb 16, 2023

rylev Feb 20, 2023

syphar Feb 20, 2023

Nemo157 Feb 20, 2023 •

edited

Loading

rylev Feb 21, 2023

Nemo157 Feb 21, 2023

syphar commented Feb 16, 2023

rylev commented Feb 20, 2023

jyn514 Feb 27, 2023

rylev Feb 27, 2023

Add an autoscaling group for the docs-rs-builder #243

Add an autoscaling group for the docs-rs-builder #243

Conversation

rylev commented Feb 16, 2023

syphar Feb 16, 2023

Choose a reason for hiding this comment

rylev Feb 20, 2023

Choose a reason for hiding this comment

syphar Feb 20, 2023

Choose a reason for hiding this comment

Nemo157 Feb 20, 2023 • edited Loading

Choose a reason for hiding this comment

rylev Feb 21, 2023

Choose a reason for hiding this comment

Nemo157 Feb 21, 2023

Choose a reason for hiding this comment

syphar commented Feb 16, 2023

rylev commented Feb 20, 2023

jyn514 Feb 27, 2023

Choose a reason for hiding this comment

rylev Feb 27, 2023

Choose a reason for hiding this comment

Nemo157 Feb 20, 2023 •

edited

Loading