tech:slurm
This is an old revision of the document!
SLURM - Simple Linux Utility for Resource Management
Introduction
Slurm is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters.
It provides three key functions:
- allocating exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work,
- providing a framework for starting, executing, and monitoring work (typically a parallel job such as MPI) on a set of allocated nodes, and
- arbitrating contention for resources by managing a queue of pending jobs.
/data/www/wiki.inf.unibz.it/data/attic/tech/slurm.1567761442.txt.gz · Last modified: 2019/09/06 11:17 by kohofer