diff --git a/.github/actions/spelling/allow.txt b/.github/actions/spelling/allow.txt index 90a57d81..b760ed40 100644 --- a/.github/actions/spelling/allow.txt +++ b/.github/actions/spelling/allow.txt @@ -4,6 +4,7 @@ AMD Alpstein Balfrin Besard +Besso Broyden CFLAGS CHARMM @@ -120,6 +121,7 @@ artifactory autodetection aws baremetal +besso biomolecular blaspp blt diff --git a/docs/clusters/besso.md b/docs/clusters/besso.md new file mode 100644 index 00000000..7dc2818d --- /dev/null +++ b/docs/clusters/besso.md @@ -0,0 +1,82 @@ +[](){#ref-cluster-besso} +# Besso + +Besso is a small Alps cluster that provides development resources for porting software for selected customers. +It is provided as is, without the same level of support as the main platform clusters. + +### Storage and file systems + +Besso uses the [HPCP filesystems and storage policies][ref-hpcp-storage]. + +## Getting started + +### Logging into Besso + +To connect to Besso via SSH, first refer to the [ssh guide][ref-ssh]. + +!!! example "`~/.ssh/config`" + Add the following to your [SSH configuration][ref-ssh-config] to enable you to directly connect to besso using `ssh besso`. + ``` + Host besso + HostName besso.vc.cscs.ch + ProxyJump ela + User cscsusername + IdentityFile ~/.ssh/cscs-key + IdentitiesOnly yes + ``` + +### Software + +[](){#ref-cluster-besso-uenv} +#### uenv + +Besso is a development and testing system, for which CSCS does not provide supported applications. + +Instead, the [prgenv-gnu][ref-uenv-prgenv-gnu] programming environment is provided for the both the [a100][ref-alps-a100-node] and [mi200][ref-alps-mi200-node] node types. + +[](){#ref-cluster-besso-containers} +#### Containers + +Besso supports container workloads using the [Container Engine][ref-container-engine]. + +To build images, see the [guide to building container images on Alps][ref-build-containers]. + +#### Cray Modules + +!!! warning + The Cray Programming Environment (CPE), loaded using `module load cray`, is no longer supported by CSCS. + + CSCS will continue to support and update uenv and the Container Engine, and users are encouraged to update their workflows to use these methods at the first opportunity. + + The CPE is still installed on Besso, however it will receive no support or updates, and will be [replaced with a container][ref-cpe] in a future update. + +## Running jobs on Besso + +### Slurm + +Besso uses [Slurm][ref-slurm] as the workload manager, which is used to launch and monitor workloads on compute nodes. + +There are multiple [Slurm partitions][ref-slurm-partitions] on the system: + +* the `a100` partition contains [NVIDIA A100 GPU][ref-alps-a100-node] nodes +* the `mi200` partition contains [AMD Mi250x GPU][ref-alps-mi200-node] nodes +* the `normal` partition contains all of the nodes in the system. + +| name | max nodes per job | time limit | +| -- | -- | -- | +| `a100` | 2 | 24 hours | +| `mi200` | 2 | 24 hours | +| `normal` | 4 | 24 hours | + +See the Slurm documentation for instructions on how to [run jobs][ref-slurm]. + +### FirecREST + +!!! todo + add the correct API endpoint + +Besso can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/hpc/firecrest/v2` API endpoint. + +## Maintenance and status + +There is no regular scheduled maintenance for this system. diff --git a/docs/alps/clusters.md b/docs/clusters/index.md similarity index 85% rename from docs/alps/clusters.md rename to docs/clusters/index.md index defbad64..dc6de5f6 100644 --- a/docs/alps/clusters.md +++ b/docs/clusters/index.md @@ -43,4 +43,14 @@ The following clusters are part of the platforms that are fully operated by CSCS [:octicons-arrow-right-24: Santis][ref-cluster-santis] +## Other systems + +