tech:slurm
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
tech:slurm [2020/04/24 11:45] – [GCC] kohofer | tech:slurm [2020/05/27 10:57] – kohofer | ||
---|---|---|---|
Line 241: | Line 241: | ||
debug* | debug* | ||
- | If computer node is down | + | If computer node is **<color #ed1c24>down</ |
< | < | ||
Line 247: | Line 247: | ||
PARTITION AVAIL TIMELIMIT | PARTITION AVAIL TIMELIMIT | ||
debug* | debug* | ||
+ | |||
+ | sinfo | ||
+ | PARTITION AVAIL TIMELIMIT | ||
+ | gpu* | ||
+ | gpu* | ||
+ | |||
</ | </ | ||
Line 356: | Line 362: | ||
====== Modules ====== | ====== Modules ====== | ||
+ | |||
+ | ===== Python ===== | ||
+ | |||
+ | ==== Python 3.7.7 ==== | ||
+ | |||
+ | |||
+ | cd / | ||
+ | mkdir / | ||
+ | wget https:// | ||
+ | tar xfJ Python-3.7.7.tar.xz | ||
+ | cd Python-3.7.7/ | ||
+ | ./configure --prefix=/ | ||
+ | make | ||
+ | make install | ||
+ | | ||
+ | |||
+ | ==== Python 2.7.18 ==== | ||
+ | |||
+ | |||
+ | cd / | ||
+ | mkdir / | ||
+ | wget https:// | ||
+ | cd Python-2.7.18 | ||
+ | ./configure --prefix=/ | ||
+ | make | ||
+ | make install | ||
+ | |||
+ | ==== Create modules file ==== | ||
+ | |||
+ | |||
+ | cd / | ||
+ | vi python-2.7.18 | ||
+ | |||
+ | < | ||
+ | #%Module1.0 | ||
+ | proc ModulesHelp { } { | ||
+ | global dotversion | ||
+ | |||
+ | puts stderr " | ||
+ | } | ||
+ | |||
+ | module-whatis " | ||
+ | prepend-path PATH / | ||
+ | |||
+ | </ | ||
+ | | ||
+ | |||
+ | |||
===== GCC ===== | ===== GCC ===== | ||
- | This take a long time! | + | This takes a long time! |
Commands to run to compile gcc-6.1.0 | Commands to run to compile gcc-6.1.0 | ||
Line 394: | Line 448: | ||
make | make | ||
+ | |||
+ | Next error: | ||
+ | |||
+ | < | ||
+ | ../ | ||
+ | | ||
+ | |||
+ | </ | ||
+ | |||
+ | To fix see: [[https:// | ||
+ | or [[https:// | ||
+ | |||
+ | Amend the files according to solution above! | ||
+ | |||
+ | Next error: | ||
+ | |||
+ | < | ||
+ | ... | ||
+ | checking for unzip... unzip | ||
+ | configure: error: cannot find neither zip nor jar, cannot continue | ||
+ | Makefile: | ||
+ | ... | ||
+ | ... | ||
+ | </ | ||
+ | |||
+ | apt install unzip zip | ||
+ | |||
+ | and run make again! | ||
+ | |||
+ | make | ||
+ | |||
+ | Next error: | ||
+ | |||
+ | < | ||
+ | ... | ||
+ | In file included from ../ | ||
+ | ../ | ||
+ | ./ | ||
+ | | ||
+ | ... | ||
+ | </ | ||
+ | |||
+ | Edit the file: / | ||
+ | |||
+ | vi / | ||
+ | |||
+ | <note warning> | ||
+ | |||
+ | < | ||
+ | // kh | ||
+ | ucontext_t *_uc = (ucontext_t *); \ | ||
+ | //struct ucontext *_uc = (struct ucontext *)_p; \ | ||
+ | // kh | ||
+ | |||
+ | </ | ||
+ | |||
+ | Next error: | ||
+ | |||
+ | <code php> | ||
+ | ... | ||
+ | In file included from ../ | ||
+ | ./ | ||
+ | // | ||
+ | | ||
+ | ../ | ||
+ | ./ | ||
+ | | ||
+ | | ||
+ | ../ | ||
+ | | ||
+ | | ||
+ | ../ | ||
+ | | ||
+ | | ||
+ | ../ | ||
+ | ../ | ||
+ | | ||
+ | | ||
+ | ../ | ||
+ | ../ | ||
+ | ../ | ||
+ | | ||
+ | ... | ||
+ | </ | ||
+ | |||
+ | ===== Example ===== | ||
+ | |||
+ | An simple example to use nvidia GPU! | ||
+ | |||
+ | < | ||
+ | #!/bin/bash | ||
+ | |||
+ | #SBATCH --job-name=mnist | ||
+ | #SBATCH --output=mnist.out | ||
+ | #SBATCH --error=mnist.err | ||
+ | |||
+ | #SBATCH --partition gpu | ||
+ | #SBATCH --gres=gpu | ||
+ | #SBATCH --mem-per-cpu=4gb | ||
+ | #SBATCH --nodes 2 | ||
+ | #SBATCH --time=00: | ||
+ | |||
+ | #SBATCH --ntasks=10 | ||
+ | |||
+ | #SBATCH --mail-type=ALL | ||
+ | #SBATCH --mail-user=< | ||
+ | </ | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | ml load miniconda3 | ||
+ | |||
+ | python3 main.py | ||
===== Links ===== | ===== Links ===== | ||
+ | |||
+ | https:// | ||
+ | |||
+ | https:// | ||
+ | |||
+ | https:// | ||
http:// | http:// | ||
https:// | https:// | ||
+ |
/data/www/wiki.inf.unibz.it/data/pages/tech/slurm.txt · Last modified: 2022/11/24 16:17 by kohofer