JeanZayTools

Content:

Information and Sample code for using Jean ZAY supercomputer through Melody resource allocation

Online tutorials: http://www.idris.fr/jean-zay/

Login to JeanZay from IMT Atlantique

Filling and submitting registration forms for JeanZay (contact [email protected])
Connection to IMT Atlantique ssh server: ssh ssh.telecom-bretagne.eu
Connection to JeanZay using JeanZay login: ssh [email protected]

Using Jupyter Notebooks on JeanZay

See http://www.idris.fr/jean-zay/pre-post/jean-zay-jupyter-notebook.html

Considered/tested DL environments

pytorch-gpu/py3/1.4.0, including pytorch and scikit-learn (contact: [email protected])

Interactive execution mode

See http://www.idris.fr/jean-zay/gpu/jean-zay-gpu-exec_interactif.html
An example below for the execution on a single GPU

module load pytorch-gpu/py3/1.4.0 (see above for the environment to be loaded)
salloc --ntasks=1 -A yrf@gpu --cpus-per-task=10 --gres=gpu:1 --hint=nomultithread --partition=gpu_p1
conda activate pytorch-gpu-1.4.0
./python my_main

Batch mode

See http://www.idris.fr/jean-zay/gpu/jean-zay-gpu-exec_mono_batch.html

An example below, when using a submission script:

Write a a submission script, denoted here as mono_gpu.slurm:

#!/bin/bash
#SBATCH -A yrf@gpu                  # account name
#SBATCH --job-name=gpu_mono         # job name
#SBATCH --ntasks=1                  # number of tasks (a unique process here)
#SBATCH --gres=gpu:1                # number of gpu (a unique GPU here)
#SBATCH --cpus-per-task=40          # number fo cores (1/4 of the node)
#SBATCH --partition=gpu_p1          # change partition (preprost: test; gpu_p1: runs)
#SBATCH --hint=nomultithread        # physical cores here (not logical)
#SBATCH --time=04:00:00             # execution time required (HH:MM:SS)
#SBATCH --output=gpu_mono%j.out     # output file
#SBATCH --error=gpu_mono%j.err      # error file

# cleaning the modules
module purge
# loading some modules
module load geos/3.7.3
module load tensorflow-gpu/py3/1.14-openmpi
# add required informations from your path
export PYTHONPATH=${HOME}/DINAE:${PYTHONPATH}
# run the python code launch.py with 3 input arguments
python launch.py ${1} ${2} ${3}

Submit the script:

sbatch mono_gpu.slurm arg1 arg2 arg3

Use this command line to monitor your job

squeue -u UserID

And this one to kill it, if you figure a problem/bug in your application:

scancel JobID

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JeanZayTools

Content:

Online tutorials: http://www.idris.fr/jean-zay/

Login to JeanZay from IMT Atlantique

Using Jupyter Notebooks on JeanZay

Considered/tested DL environments

Interactive execution mode

Batch mode

About

Releases

Packages

maxbeauchamp/JeanZayTools

Folders and files

Latest commit

History

Repository files navigation

JeanZayTools

Content:

Online tutorials: http://www.idris.fr/jean-zay/

Login to JeanZay from IMT Atlantique

Using Jupyter Notebooks on JeanZay

Considered/tested DL environments

Interactive execution mode

Batch mode

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages