Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Add byoc request quota #5037

Merged
merged 1 commit into from
Oct 24, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 31 additions & 0 deletions docs/source/bentocloud/byoc/aws.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,37 @@ BentoCloud BYOC Setup Guide for AWS

As part of our commitment to ensuring a smooth and efficient onboarding process, we have prepared this setup guide to help your DevOps team integrate BentoCloud into your AWS account.

Request quotas
--------------

To ensure there are no delays in your cluster setup, please make sure you have enough service quotas in your AWS account prior to starting the setup. If not, please request increased quotas in the region where you will deploy BentoCloud.

To request quotas:

1. Visit the `AWS Service Quotas console <https://console.aws.amazon.com/servicequotas/home/services/ec2/quotas>`_ for your region.
2. `Request quotas <https://docs.aws.amazon.com/servicequotas/latest/userguide/request-quota-increase.html>`_ as per your deployment plan. See the table below for quota details:

.. list-table::
:widths: 10 35 25 30
:header-rows: 1

* - Type
- Quota name
- Required quantity
- Purpose
* - CPU
- ``Running On-Demand Standard (A, C, D, H, I, M, R, T, Z) instances``
- 32 vCPUs
- Run infrastructure workloads, image builder jobs, and CPU serving instances.
* - GPU
- Depending on needs:

- T4/A10G GPU: ``Running On-Demand G and VT instances``
- A100/H100 GPU: ``Running On-Demand P instances``

- Based on needs
- Run your workloads that require GPUs.

Setup
-----

Expand Down
34 changes: 34 additions & 0 deletions docs/source/bentocloud/byoc/gcp.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,40 @@ BentoCloud BYOC Setup Guide for GCP

As part of our commitment to ensuring a smooth and efficient onboarding process, we have prepared this setup guide to help your DevOps team integrate BentoCloud into your GCP account.

Request quotas
--------------

To ensure there are no delays in your cluster setup, please make sure you have enough service quotas in your GCP account prior to starting the setup. If not, please request increased quotas in the project and region where you will deploy BentoCloud.

To request quotas:

1. Visit the `GCP Quotas page <https://console.cloud.google.com/iam-admin/quotas>`_.
2. `Request quotas <https://cloud.google.com/compute/resource-usage#vm_instance_quota>`_ in the correct project and region as per your deployment plan. See the table below for quota details:

.. list-table::
:widths: 10 35 25 30
:header-rows: 1

* - Type
- Quota name
- Required quantity
- Purpose
* - CPU
- ``CPUS``
- 32
- Run infrastructure workloads, image builder Pods, and serving instances.
* - GPU
- Depending on needs:

- T4: ``NVIDIA_T4_GPUS``
- L4: ``NVIDIA_L4_GPUS``
- A100 40GB: ``NVIDIA_A100_GPUS``
- A100 80GB: ``NVIDIA_A100_80GB_GPUS``
- H100 (Committed usage only): ``COMMITTED_NVIDIA_H100_GPUS``

- Based on needs
- Run your workloads that require GPUs.

Setup
-----

Expand Down