Skip to content

Commit

Permalink
docs: Add byoc request quota (#5037)
Browse files Browse the repository at this point in the history
Add byoc request quota doc

Signed-off-by: Sherlock113 <sherlockxu07@gmail.com>
  • Loading branch information
Sherlock113 authored Oct 24, 2024
1 parent c59f84c commit 2b65f06
Show file tree
Hide file tree
Showing 2 changed files with 65 additions and 0 deletions.
31 changes: 31 additions & 0 deletions docs/source/bentocloud/byoc/aws.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,37 @@ BentoCloud BYOC Setup Guide for AWS

As part of our commitment to ensuring a smooth and efficient onboarding process, we have prepared this setup guide to help your DevOps team integrate BentoCloud into your AWS account.

Request quotas
--------------

To ensure there are no delays in your cluster setup, please make sure you have enough service quotas in your AWS account prior to starting the setup. If not, please request increased quotas in the region where you will deploy BentoCloud.

To request quotas:

1. Visit the `AWS Service Quotas console <https://console.aws.amazon.com/servicequotas/home/services/ec2/quotas>`_ for your region.
2. `Request quotas <https://docs.aws.amazon.com/servicequotas/latest/userguide/request-quota-increase.html>`_ as per your deployment plan. See the table below for quota details:

.. list-table::
:widths: 10 35 25 30
:header-rows: 1

* - Type
- Quota name
- Required quantity
- Purpose
* - CPU
- ``Running On-Demand Standard (A, C, D, H, I, M, R, T, Z) instances``
- 32 vCPUs
- Run infrastructure workloads, image builder jobs, and CPU serving instances.
* - GPU
- Depending on needs:

- T4/A10G GPU: ``Running On-Demand G and VT instances``
- A100/H100 GPU: ``Running On-Demand P instances``

- Based on needs
- Run your workloads that require GPUs.

Setup
-----

Expand Down
34 changes: 34 additions & 0 deletions docs/source/bentocloud/byoc/gcp.rst
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,40 @@ BentoCloud BYOC Setup Guide for GCP

As part of our commitment to ensuring a smooth and efficient onboarding process, we have prepared this setup guide to help your DevOps team integrate BentoCloud into your GCP account.

Request quotas
--------------

To ensure there are no delays in your cluster setup, please make sure you have enough service quotas in your GCP account prior to starting the setup. If not, please request increased quotas in the project and region where you will deploy BentoCloud.

To request quotas:

1. Visit the `GCP Quotas page <https://console.cloud.google.com/iam-admin/quotas>`_.
2. `Request quotas <https://cloud.google.com/compute/resource-usage#vm_instance_quota>`_ in the correct project and region as per your deployment plan. See the table below for quota details:

.. list-table::
:widths: 10 35 25 30
:header-rows: 1

* - Type
- Quota name
- Required quantity
- Purpose
* - CPU
- ``CPUS``
- 32
- Run infrastructure workloads, image builder Pods, and serving instances.
* - GPU
- Depending on needs:

- T4: ``NVIDIA_T4_GPUS``
- L4: ``NVIDIA_L4_GPUS``
- A100 40GB: ``NVIDIA_A100_GPUS``
- A100 80GB: ``NVIDIA_A100_80GB_GPUS``
- H100 (Committed usage only): ``COMMITTED_NVIDIA_H100_GPUS``

- Based on needs
- Run your workloads that require GPUs.

Setup
-----

Expand Down

0 comments on commit 2b65f06

Please sign in to comment.