diff --git a/.github/actions/spelling/allow.txt b/.github/actions/spelling/allow.txt index 90a57d81..b760ed40 100644 --- a/.github/actions/spelling/allow.txt +++ b/.github/actions/spelling/allow.txt @@ -4,6 +4,7 @@ AMD Alpstein Balfrin Besard +Besso Broyden CFLAGS CHARMM @@ -120,6 +121,7 @@ artifactory autodetection aws baremetal +besso biomolecular blaspp blt diff --git a/docs/clusters/besso.md b/docs/clusters/besso.md new file mode 100644 index 00000000..7dc2818d --- /dev/null +++ b/docs/clusters/besso.md @@ -0,0 +1,82 @@ +[](){#ref-cluster-besso} +# Besso + +Besso is a small Alps cluster that provides development resources for porting software for selected customers. +It is provided as is, without the same level of support as the main platform clusters. + +### Storage and file systems + +Besso uses the [HPCP filesystems and storage policies][ref-hpcp-storage]. + +## Getting started + +### Logging into Besso + +To connect to Besso via SSH, first refer to the [ssh guide][ref-ssh]. + +!!! example "`~/.ssh/config`" + Add the following to your [SSH configuration][ref-ssh-config] to enable you to directly connect to besso using `ssh besso`. + ``` + Host besso + HostName besso.vc.cscs.ch + ProxyJump ela + User cscsusername + IdentityFile ~/.ssh/cscs-key + IdentitiesOnly yes + ``` + +### Software + +[](){#ref-cluster-besso-uenv} +#### uenv + +Besso is a development and testing system, for which CSCS does not provide supported applications. + +Instead, the [prgenv-gnu][ref-uenv-prgenv-gnu] programming environment is provided for the both the [a100][ref-alps-a100-node] and [mi200][ref-alps-mi200-node] node types. + +[](){#ref-cluster-besso-containers} +#### Containers + +Besso supports container workloads using the [Container Engine][ref-container-engine]. + +To build images, see the [guide to building container images on Alps][ref-build-containers]. + +#### Cray Modules + +!!! warning + The Cray Programming Environment (CPE), loaded using `module load cray`, is no longer supported by CSCS. + + CSCS will continue to support and update uenv and the Container Engine, and users are encouraged to update their workflows to use these methods at the first opportunity. + + The CPE is still installed on Besso, however it will receive no support or updates, and will be [replaced with a container][ref-cpe] in a future update. + +## Running jobs on Besso + +### Slurm + +Besso uses [Slurm][ref-slurm] as the workload manager, which is used to launch and monitor workloads on compute nodes. + +There are multiple [Slurm partitions][ref-slurm-partitions] on the system: + +* the `a100` partition contains [NVIDIA A100 GPU][ref-alps-a100-node] nodes +* the `mi200` partition contains [AMD Mi250x GPU][ref-alps-mi200-node] nodes +* the `normal` partition contains all of the nodes in the system. + +| name | max nodes per job | time limit | +| -- | -- | -- | +| `a100` | 2 | 24 hours | +| `mi200` | 2 | 24 hours | +| `normal` | 4 | 24 hours | + +See the Slurm documentation for instructions on how to [run jobs][ref-slurm]. + +### FirecREST + +!!! todo + add the correct API endpoint + +Besso can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/hpc/firecrest/v2` API endpoint. + +## Maintenance and status + +There is no regular scheduled maintenance for this system. diff --git a/docs/alps/clusters.md b/docs/clusters/index.md similarity index 85% rename from docs/alps/clusters.md rename to docs/clusters/index.md index defbad64..dc6de5f6 100644 --- a/docs/alps/clusters.md +++ b/docs/clusters/index.md @@ -43,4 +43,14 @@ The following clusters are part of the platforms that are fully operated by CSCS [:octicons-arrow-right-24: Santis][ref-cluster-santis] +## Other systems + +
+- :fontawesome-solid-mountain: __Porting and Development__ + + Besso is a small system used by some partners for development and porting with AMD and NVIDIA GPUs. + + [:octicons-arrow-right-24: Besso][ref-cluster-besso] +
+ diff --git a/docs/index.md b/docs/index.md index b11cbff8..ff3b2bb5 100644 --- a/docs/index.md +++ b/docs/index.md @@ -30,15 +30,15 @@ Find out more about Alps... Learn more about the Alps research infrastructure - [:octicons-arrow-right-24: Alps Overview](alps/index.md) + [:octicons-arrow-right-24: Alps Overview][ref-alps] Get detailed information about the main components of the infrastructure - [:octicons-arrow-right-24: Alps Clusters](alps/clusters.md) + [:octicons-arrow-right-24: Alps Clusters][ref-alps-clusters] - [:octicons-arrow-right-24: Alps Hardware](alps/hardware.md) + [:octicons-arrow-right-24: Alps Hardware][ref-alps-hardware] - [:octicons-arrow-right-24: Alps Storage](alps/storage.md) + [:octicons-arrow-right-24: Alps Storage][ref-alps-storage] - :fontawesome-solid-key: __Logging In__ diff --git a/mkdocs.yml b/mkdocs.yml index e240bbee..e07b1300 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -17,7 +17,14 @@ nav: - 'Alps': - alps/index.md - 'Platforms': alps/platforms.md - - 'Clusters': alps/clusters.md + - 'Clusters': + - clusters/index.md + - 'Besso': clusters/besso.md + - 'Bristen': clusters/bristen.md + - 'Clariden': clusters/clariden.md + - 'Daint': clusters/daint.md + - 'Eiger': clusters/eiger.md + - 'Santis': clusters/santis.md - 'Hardware': alps/hardware.md - 'Storage': alps/storage.md - 'Machine Learning Platform':