SLURM

From wiki.hpc.mk
Revision as of 12:15, 27 August 2021 by Boris (talk | contribs)

Initiate and manage SLURM tasks

Most used parameters:

  1. !/bin/bash
  1. SBATCH --ntasks-per-node=2 # Number of tasks per phisical CPU core
  2. SBATCH --time=1:00:00 # Script duration (days-hrs:min:sec)
  3. SBATCH --job-name=test_job # Job name
  4. SBATCH --mem=1G # Ram memory for rendering (e.g. 1G, 2G, 4G)
  5. SBATCH --error=testerror_%j.error # Print the errors that occur when executing the job
  6. SBATCH --cpus-per-task=1 # Number of processors required for a single task
  7. SBATCH --output=testoutput_%j.out # Print the results from scripts and the values it returns
  8. SBATCH --gres=gpu:2 # Number of cards per one nod allocated for the job
  9. SBATCH --nodelist=cuda4 # Executing on specific nodes, e.g. cuda4 is for executing only on cuda4 host