User Tools

Site Tools


tech:slurm

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
tech:slurm [2019/09/06 14:59] – [Controller] kohofertech:slurm [2019/09/06 15:33] – [Controller] kohofer
Line 154: Line 154:
  
   root@controller# systemctl start slurmdbd   root@controller# systemctl start slurmdbd
 +
 +=== Authentication ===
 +
 +Copy /etc/munge.key to all compute nodes
 +
 +  scp /etc/munge/munge.key csadmin@10.7.20.97:/tmp/.
 +  
 +Run a job from slurm-ctrl
 +
 +  ssh csadmin@slurm-ctrl
 +  srun -N 1 hostname
 +  linux1
 +
 +
  
 === Test munge === === Test munge ===
Line 174: Line 188:
 {{:tech:slurm-hpc-cluster_compute-node.png?400|}} {{:tech:slurm-hpc-cluster_compute-node.png?400|}}
  
-=== Authentication ===+==== Installation ==== 
 + 
 +  ssh -l csadmin 10.7.20.102 
 +  sudo apt install slurm-wlm 
 +  
 +Generate ssh keys 
 +  ssh-keygen 
 + 
 +Copy ssh-keys to slurm-ctrl (using IP, because no DNS in place) 
 + 
 +  ssh-copy-id -i ~/.ssh/id_rsa.pub csadmin@10.7.20.97: 
 + 
 +Become root to do important things: 
 + 
 +  sudo -i 
 +  vi /etc/hosts 
 + 
 +Add those lines below to the /etc/hosts file 
 + 
 +<code> 
 +10.7.20.97      slurm-ctrl.inf.unibz.it slurm-ctrl 
 +10.7.20.98      linux1.inf.unibz.it     linux1 
 +</code>
  
-  ssh root@slurm-ctrl 
-  root@controller# for i in `seq 1 2`; do scp /etc/munge/munge.key linux-${i}:/etc/munge/munge.key; done 
  
-  root@compute-1# systemctl start munge 
  
-Run a job from slurm-ctrl 
  
-  ssh csadmin 
-  srun -N 1 hostname 
-  linux1 
  
 +===== Links =====
  
 +[[https://slurm.schedmd.com/overview.html|Slurm Workload Manager Overview]]
  
 +[[https://github.com/mknoxnv/ubuntu-slurm|Steps to create a small slurm cluster with GPU enabled nodes]]
  
 +[[https://implement.pt/2018/09/slurm-in-ubuntu-clusters-pt1/|Slurm in Ubuntu Clusters Part1]]
  
 +[[https://wiki.fysik.dtu.dk/niflheim/SLURM|Slurm batch queueing system]]
  
-https://slurm.schedmd.com/overview.html+[[https://doku.lrz.de/display/PUBLIC/SLURM+Workload+Manager|SLURM Workload Manager]]
/data/www/wiki.inf.unibz.it/data/pages/tech/slurm.txt · Last modified: 2022/11/24 16:17 by kohofer