Skip to content

Commit

Permalink
Split MPMD stdout into tasks on slurm
Browse files Browse the repository at this point in the history
It can be difficult to debug MPMD jobs because their logs are all written
concurrently to a single file. While the use of tags to designate which
task via the preamble and PS4 can help identify which line is from which
task, it is still difficult to follow a single task through the log,
particularly for larger MPMD jobs with dozens of tasks.

Individual stdout files are now created by using the `srun` `--output`
option. These files are written to the working directory (in `$DATA`).

Fixes: NOAA-EMC#1468
  • Loading branch information
WalterKolczynski-NOAA committed Apr 15, 2023
1 parent d47f33f commit 9a24d4a
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion env/HERA.env
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ step=$1

export npe_node_max=40
export launcher="srun -l --export=ALL"
export mpmd_opt="--multi-prog"
export mpmd_opt="--multi-prog --output=${step}.%J.%t.out"

# Configure MPI environment
#export I_MPI_ADJUST_ALLREDUCE=5
Expand Down
2 changes: 1 addition & 1 deletion env/JET.env
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ elif [[ "${PARTITION_BATCH}" = "vjet" ]]; then
export npe_node_max=16
fi
export launcher="srun -l --export=ALL"
export mpmd_opt="--multi-prog"
export mpmd_opt="--multi-prog --output=${step}.%J.%t.out"

# Configure STACK
export OMP_STACKSIZE=2048000
Expand Down
2 changes: 1 addition & 1 deletion env/ORION.env
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ step=$1

export npe_node_max=40
export launcher="srun -l --export=ALL"
export mpmd_opt="--multi-prog"
export mpmd_opt="--multi-prog --output=${step}.%J.%t.out"

# Configure MPI environment
export MPI_BUFS_PER_PROC=2048
Expand Down
2 changes: 1 addition & 1 deletion env/S4.env
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ elif [[ ${PARTITION_BATCH} = "ivy" ]]; then
export npe_node_max=20
fi
export launcher="srun -l --export=ALL"
export mpmd_opt="--multi-prog"
export mpmd_opt="--multi-prog --output=${step}.%J.%t.out"

# Configure MPI environment
export OMP_STACKSIZE=2048000
Expand Down

0 comments on commit 9a24d4a

Please sign in to comment.