tech:slurm
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
tech:slurm [2019/09/06 14:59] – [Controller] kohofer | tech:slurm [2019/09/06 17:15] – kohofer | ||
---|---|---|---|
Line 22: | Line 22: | ||
ssh slurm-ctrl | ssh slurm-ctrl | ||
- | apt install slurm-wlm slurm-wlm-doc mailutils | + | apt install slurm-wlm slurm-wlm-doc mailutils mariadb-client mariadb-server libmariadb-dev python-dev python-mysqldb |
=== Install Maria DB Server === | === Install Maria DB Server === | ||
Line 51: | Line 51: | ||
=== Central Controller === | === Central Controller === | ||
- | The main configuration file is / | + | The main configuration file is / |
- | vi / | + | vi / |
< | < | ||
Line 114: | Line 114: | ||
</ | </ | ||
- | root@controller# systemctl start slurmctld | + | root@slurm-ctrl# scp / |
+ | root@slurm-ctrl# systemctl start slurmctld | ||
=== Accounting Storage === | === Accounting Storage === | ||
Line 153: | Line 154: | ||
</ | </ | ||
- | root@controller# systemctl start slurmdbd | + | root@slurm-ctrl# systemctl start slurmdbd |
+ | |||
+ | === Authentication === | ||
+ | |||
+ | Copy / | ||
+ | |||
+ | scp / | ||
+ | |||
+ | Allow password-less access to slurm-ctrl | ||
+ | |||
+ | csadmin@slurm-ctrl: | ||
+ | |||
+ | Run a job from slurm-ctrl | ||
+ | |||
+ | ssh csadmin@slurm-ctrl | ||
+ | srun -N 1 hostname | ||
+ | linux1 | ||
+ | |||
=== Test munge === | === Test munge === | ||
Line 174: | Line 193: | ||
{{: | {{: | ||
- | === Authentication | + | === Installation |
- | ssh root@slurm-ctrl | + | ssh -l csadmin 10.7.20.102 |
- | | + | |
+ | |||
+ | Generate ssh keys | ||
- | | + | |
- | Run a job from slurm-ctrl | + | Copy ssh-keys to slurm-ctrl |
- | ssh csadmin | + | ssh-copy-id -i ~/ |
- | | + | |
- | linux1 | + | Become root to do important things: |
+ | |||
+ | | ||
+ | | ||
+ | |||
+ | Add those lines below to the /etc/hosts file | ||
+ | |||
+ | < | ||
+ | 10.7.20.97 | ||
+ | 10.7.20.98 | ||
+ | </ | ||
+ | |||
+ | First copy the munge keys from the slurm-ctrl to all compute nodes, now fix location, | ||
+ | owner and permission. | ||
+ | |||
+ | mv / | ||
+ | chown munge:munge / | ||
+ | chmod 400 / | ||
+ | |||
+ | Place / | ||
+ | |||
+ | mv / | ||
+ | chown root: / | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | ===== Links ===== | ||
+ | |||
+ | [[https:// | ||
+ | [[https:// | ||
+ | [[https:// | ||
+ | [[https:// | ||
+ | [[https:// | ||
+ | [[https:// | ||
- | https://slurm.schedmd.com/ | + | {{ :tech:9-slurm.pdf |Linux Clusters Institute: Scheduling and Resource Management 2017}} |
/data/www/wiki.inf.unibz.it/data/pages/tech/slurm.txt · Last modified: 2022/11/24 16:17 by kohofer