Skip to content

Commit

Permalink
Added required changes for CPU autodetection
Browse files Browse the repository at this point in the history
  • Loading branch information
Caspar van Leeuwen committed Sep 19, 2023
1 parent 2f411e9 commit 44b815b
Show file tree
Hide file tree
Showing 3 changed files with 29 additions and 18 deletions.
19 changes: 9 additions & 10 deletions config/aws_citc.py
Original file line number Diff line number Diff line change
@@ -1,15 +1,14 @@
# This is an example configuration file
# WARNING: for CPU autodetect to work correctly you need to
# 1. Either use ReFrame >= 4.3.3 or temporarily change the 'launcher' for each partition to srun
# 2. Either use ReFrame >= 4.3.3 or run from a clone of the ReFrame repository

# Note that CPU autodetect currently does not work with this configuration file on AWS.
# This is because there is no system mpirun, and the CPU autodetection doesn't load any modules
# that would make an mpirun command available (as normal multiprocessing tests would).
# In order to do CPU autodetection, you'll need to change the launcer to srun:
# 'launcher = srun'
# You can run the CPU autodetect by listing all tests (reframe -l ...)
# and then, once all CPUs are autodetected, change the launcher back to mpirun for a 'real' run (reframe -r ...)
# Without this, the autodetect job fails because
# 1. A missing mpirun command
# 2. An incorrect directory structure is assumed when preparing the stagedir for the autodetect job

# Another known issue is that CPU autodetection fails if run from an actual installation of ReFrame.
# It only works if run from a clone of their Github Repo. See https://github.com/reframe-hpc/reframe/issues/2914
# Related issues
# 1. https://github.com/reframe-hpc/reframe/issues/2926
# 2. https://github.com/reframe-hpc/reframe/issues/2914

import os

Expand Down
14 changes: 10 additions & 4 deletions config/izum_vega.py
Original file line number Diff line number Diff line change
@@ -1,11 +1,17 @@
# WARNING: for CPU autodetect to work correctly you need to
# 1. Either use ReFrame >= 4.3.3 _or_ temporarily change the 'launcher' for each partition to srun
# 2. Temporarily change the 'access' field for the GPU partition to
# 1. Either use ReFrame >= 4.3.3 or temporarily change the 'launcher' for each partition to srun
# 2. Either use ReFrame >= 4.3.3 or run from a clone of the ReFrame repository
# 3. Temporarily change the 'access' field for the GPU partition to
# 'access': ['-p gpu', '--export=None', '--gres=gpu:1'],

# Without this, the autodetect job fails because
# a missing mpirun command (change #1)
# Vega doesn't allow submission to the GPU partition without requesting at least one GPU (change #2)
# 1. A missing mpirun command
# 2. An incorrect directory structure is assumed when preparing the stagedir for the autodetect job
# 3. Vega doesn't allow submission to the GPU partition without requesting at least one GPU (change #2)

# Related issues
# 1. https://github.com/reframe-hpc/reframe/issues/2926
# 2. https://github.com/reframe-hpc/reframe/issues/2914

import os

Expand Down
14 changes: 10 additions & 4 deletions config/surf_snellius.py
Original file line number Diff line number Diff line change
@@ -1,11 +1,17 @@
# WARNING: for CPU autodetect to work correctly you need to
# 1. Either use ReFrame >= 4.3.3 _or_ temporarily change the 'launcher' for each partition to srun
# 2. Temporarily change the 'access' field for the GPU partition to
# 1. Either use ReFrame >= 4.3.3 or temporarily change the 'launcher' for each partition to srun
# 2. Either use ReFrame >= 4.3.3 or run from a clone of the ReFrame repository
# 3. Temporarily change the 'access' field for the GPU partition to
# 'access': ['-p gpu', '--export=None', '--exclusive'],

# Without this, the autodetect job fails because
# a missing mpirun command (change #1)
# Snellius doesn't allow submission to the GPU partition without requesting at least one GPU (change #2)
# 1. A missing mpirun command
# 2. An incorrect directory structure is assumed when preparing the stagedir for the autodetect job
# 3. Snellius doesn't allow submission to the GPU partition without requesting at least one GPU

# Related issues
# 1. https://github.com/reframe-hpc/reframe/issues/2926
# 2. https://github.com/reframe-hpc/reframe/issues/2914

import os

Expand Down

0 comments on commit 44b815b

Please sign in to comment.