Note
This policy is written to define a core set of environment variables that represent a resource definition at each of the HPCBIOS compatible sites participating in a Baseline Configuration (BC) Initiative. These variables shall be pre-defined in all login scripts, making them automatically available to users at each site, whenever the relevant functionality is in place.
- BC Policy: HPCBIOS_2012-98
- Date of Policy: 2012-12-15
The current list of variables has as follows:
- BC_CORES_PER_NODE (MUST)
- This variable contains the number of cores per node for the default compute node type to which a job is being submitted.
- BC_MEM_PER_NODE (MUST)
- This variable contains the approximate maximum memory per node available to an end user program (in integer MBs) for the default compute node type to which a job is being submitted. Users are advised to not consume more than 95% of the stated limit and consider well any implications when node sharing is in place (this aspect MAY be queue-specific).
- BC_MPI_TASKS_ALLOC (SHOULD)
- This variable, intended to be referenced from inside a job script, shall contain the number of MPI tasks that are allocated for a particular job.
- BC_NODE_ALLOC (SHOULD)
- This variable, intended to be referenced from inside a job script, shall contain the number of nodes allocated for a particular job.
- HOME (MUST)
- This is defined automatically in a Unix environment and MUST correspond to a location where a user can define his initialization scripts. This area is backed up at a defined frequency and policy per each site. Users are strongly advised to increase redundancy of critical items by ensuring an own copy; Using Source Version Control Systems (git, hg, svn, cvs) is highly RECOMMENDED for custom software codes, which may represent many manhours of work.
- JAVA_HOME (SHOULD)
- This variable contains the path to the base directory of the default installation of JAVA on a particular compute platform. If the platform does not have JAVA installed, this variable should not be defined. It can also be provided dynamically via the modules mechanism (ie. loading a modulefile).
- SAMPLES_HOME (SHOULD)
- This variable contains the path to the base directory for the sample codes and scripts installed by a site's staff under BC policy Sample Code Repository.
- SOFT_HOME (MUST)
- This variable contains the path to the base directory for the application codes installed on a system.
- SCRATCH (SHOULD)
- This is a shared parallel filesystem optimized for large file and high bandwidth access. This file system is not backed up and files on this system MAY be purged as soon as a job finishes. Users SHOULD use this filesystem for jobs that seek to have high aggregate bandwidth. A given system MAY not provide it or, restrict its usage to certain users/groups.
- TMPDIR (MUST)
- This is a directory on a local temporary file system (i.e., local high speed disk) that SHOULD be available on all HPCBIOS compatible systems. TMPDIR is intended to be used by executing programs to perform file I/O that is local to that system in order to avoid slower file I/O across a network mounted file system, such as a user's HOME or ARCHIVE directories. It is not intended to be used as a file storage area by users, rather as working area within tasks. Accordingly, this file system SHALL NOT be backed up or exported to any other system. In the event of file or directory structure deletion or a catastrophic disk failure, such files and directory structures are lost. Thus, it is the user’s responsibility to transfer files that need to be saved to a location that allows for longer term file storage, such as the user’s $HOME, $WORK or $ARCHIVE directory locations - if these are made permanent via a backup policy - or, another external system.
- WORK (MUST)
- This is an individual user’s directory for stage-in, stage-out of running jobs. Also, this is the correct location to store checkpointing state. This file system MAY not be backed up or exported to any other system. In the event of file or directory structure deletion or a catastrophic storage failure, such files and directory structures MAY be lost. Thus, it is the user’s responsibility to increase redundancy and transfer files that need to be saved to a location that allows for longer term file storage, such as $HOME or $ARCHIVE directory locations - if they are permanent via a backup policy - or, another external system. Fitness of filesystem type is users' responsibility.
- WORKDIR (SHOULD)
- Same as WORKDIR, see above
This list of environment variables is subject to change, and should be reviewed on a periodic basis for potential additions, modifications or deletions.