SLURM
From wiki.hpc.mk
Initiate and manage SLURM tasks
Most used parameters:
- !/bin/bash
- SBATCH --ntasks-per-node=2 # Number of tasks per phisical CPU core
- SBATCH --time=1:00:00 # Script duration (days-hrs:min:sec)
- SBATCH --job-name=test_job # Job name
- SBATCH --mem=1G # Ram memory for rendering (e.g. 1G, 2G, 4G)
- SBATCH --error=testerror_%j.error # Print the errors that occur when executing the job
- SBATCH --cpus-per-task=1 # Number of processors required for a single task
- SBATCH --output=testoutput_%j.out # Print the results from scripts and the values it returns
- SBATCH --gres=gpu:2 # Number of cards per one nod allocated for the job
- SBATCH --nodelist=cuda4 # Executing on specific nodes, e.g. cuda4 is for executing only on cuda4 host