Difference between revisions of "Slurm"
From Systems Group
Systemswiki (Talk | contribs) |
Systemswiki (Talk | contribs) (→SBATCH) |
||
Line 12: | Line 12: | ||
== SBATCH == | == SBATCH == | ||
Sbatch is a command used to submit jobs via batch scripts to SLURM. | Sbatch is a command used to submit jobs via batch scripts to SLURM. | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
Revision as of 19:04, 4 August 2022
Slurm is an open-source job scheduler for Linux and Unix-like kernels.
SRUN
srun is used to submit jobs for execution in real time. Also used to create job steps.
srun example
srun --pty /bin/bash # shell on compute job / default account is used when not specified
srun -p slurm-general-01 --account=slurmgeneral --pty /bin/bash # shell on compute job / specifying which partition and account (applicable if user is assigned multiple accounts)
SBATCH
Sbatch is a command used to submit jobs via batch scripts to SLURM.
batch script example
#!/bin/bash -l # login shell (required for lmod) #SBATCH --job-name=testprogram # job name #SBATCH --partition=slurm-general-01 # specifying which partition to run job on #SBATCH --account=slurmgeneral # only applicable if user is assigned multiple accounts #SBATCH --ntasks=1 # commands to run in parallel #SBATCH --mem=1gb # request 1gb of memory #SBATCH --output=testprogram.lob # output and error log date sleep 10 module use /mnt/lmod_modules/Linux/ module load miniconda3 someProgram.py date
submitting a job using sbatch
sbatch myprogram.sh # queue job using a batch script
sbatch -p slurm-general-01 --account=slurmgeneral myprogram.sh # batch script specifying which partition and account when not specified using a slurm directive within the script
Compute Resources
The ODU CS department HPC cluster is comprised of multiple partitions where users can submit jobs. Each partition can only be accessed by users who are assigned to the partitions respective account. Not all partitions can be accessed by all users.
Cluster | Partition | Account |
---|---|---|
slurm-cluster | slurm-general-01 | slurmgeneral |
slurm-cluster | slurm-general-02 | slurmgeneral |
slurm-cluster | haoresearch | shaoresearch |
slurm-cluster | lusiliresearch | lliresearch |
slurm-cluster | wangresearch | fwangresearch |
Troubleshooting
How to view assigned account
sacctmgr show association -p user=$username