User Tools

Site Tools


tech:slurm

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
tech:slurm [2019/09/06 14:30] kohofertech:slurm [2019/09/06 14:45] kohofer
Line 19: Line 19:
 Controller name: slurm-ctrl Controller name: slurm-ctrl
  
-  ssh csadmin@slurm-ctrl +  ssh csadmin@slurm-ctrl 
-  sudo apt install slurm-wlm slurm-wlm-doc mailutils sview mariadb-client mariadb-server libmariadb-dev python-dev python-mysqldb+  sudo apt install slurm-wlm slurm-wlm-doc mailutils sview mariadb-client mariadb-server libmariadb-dev python-dev python-mysqldb
  
 === Install Maria DB Server === === Install Maria DB Server ===
  
-  apt-get install mariadb-server +  apt-get install mariadb-server 
-  systemctl start mysql +  systemctl start mysql 
-  mysql -u root+  mysql -u root
   create database slurm_acct_db;   create database slurm_acct_db;
   create user 'slurm'@'localhost';   create user 'slurm'@'localhost';
Line 41: Line 41:
 === Configure munge === === Configure munge ===
  
-  ssh csadmin@linux1+  ssh csadmin@linux1
   scp slurm-ctrl:/etc/munge/munge.key /etc/munge/   scp slurm-ctrl:/etc/munge/munge.key /etc/munge/
  
Line 166: Line 166:
 === Test munge === === Test munge ===
  
-  munge -n | unmunge | grep STATUS+  munge -n | unmunge | grep STATUS
   STATUS:           Success (0)   STATUS:           Success (0)
-  munge -n | ssh slurm-ctrl unmunge | grep STATUS+  munge -n | ssh slurm-ctrl unmunge | grep STATUS
   STATUS:           Success (0)   STATUS:           Success (0)
  
 === Test Slurm === === Test Slurm ===
  
-  sinfo+  sinfo
   PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST   PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
   debug*       up   infinite      1   idle linux1   debug*       up   infinite      1   idle linux1
Line 181: Line 181:
 A compute node is a machine which will receive jobs to execute, sent from the Controller, it runs the slurmd service. A compute node is a machine which will receive jobs to execute, sent from the Controller, it runs the slurmd service.
  
-Zecihnung+{{:tech:slurm-hpc-cluster_compute-node.png?400|}}
  
 === Authentication === === Authentication ===
  
-  ssh root@slurm-ctrl+  ssh root@slurm-ctrl
   root@controller# for i in `seq 1 2`; do scp /etc/munge/munge.key linux-${i}:/etc/munge/munge.key; done   root@controller# for i in `seq 1 2`; do scp /etc/munge/munge.key linux-${i}:/etc/munge/munge.key; done
  
Line 192: Line 192:
 Run a job from slurm-ctrl Run a job from slurm-ctrl
  
-  ssh csadmin +  ssh csadmin 
-  srun -N 1 hostname+  srun -N 1 hostname
   linux1   linux1
  
/data/www/wiki.inf.unibz.it/data/pages/tech/slurm.txt · Last modified: 2022/11/24 16:17 by kohofer