Schedule¶
DAY 1 - Tuesday 27/05 | |
09:00 CEST
10:00 EEST |
Welcome and Introduction
Presenters: Jørn Dietze and Gregor Decristoforo (LUST) |
09:15 CEST
10:15 EEST |
Introduction to LUMI
Presenter: Jørn Dietze (LUST) |
09:45 CEST
10:45 EEST |
Using the LUMI web interface
Presenters: Mats Sjöberg (CSC) and Oskar Taubert (CSC) |
10:05 CEST
11:05 EEST |
Hands-on: Run a simple PyTorch example notebook |
10:35 CEST
11:35 EEST |
Break (15 minutes) |
10:50 CEST
11:50 EEST |
Your first AI training job on LUMI
Presenters: Mats Sjöberg (CSC) and Oskar Taubert (CSC) |
11:20 CEST
12:20 EEST |
Hands-on: Run a simple single-GPU PyTorch AI training job |
12:05 CEST
13:05 EEST |
Lunch break (60 minutes) |
13:05 CEST
14:05 EEST |
Understanding GPU activity & checking jobs
Presenter: Samuel Antao (AMD) |
13:25 CEST
14:25 EEST |
Hands-on: Checking GPU usage interactively using rocm-smi |
13:45 CEST
14:45 EEST |
Running containers on LUMI Presenter: Gregor Decristoforo (LUST) |
14:05 CEST
15:05 EEST |
Hands-on: Pull and run a container |
14:25 CEST
15:25 EEST |
Break (15 minutes) |
14:40 CEST
15:40 EEST |
Building containers from conda/pip environments Presenter: Jørn Dietze (LUST) |
15:00 CEST
16:00 EEST |
Hands-on: Creating a conda environment file and building a container using cotainr |
15:20 CEST
16:20 EEST |
Extending containers with virtual environments for faster testing Presenter: Gregor Decristoforo (LUST) |
15:40 CEST
16:40 EEST |
Hands-on: Getting started with your own project
Bring your own AI code, you want to run on LUMI, and spent some time applying what you have learned during the workshop - with on-site support from LUST/AMD. |
17:00 CEST
18:00 EEST |
End of the course day |
DAY 2 - Wednesday 28/05 | |
09:00 CEST
10:00 EEST |
Scaling AI training to multiple GPUs
Presenters: Mats Sjöberg (CSC) and Oskar Taubert (CSC) |
09:40 CEST
10:40 EEST |
Hands-on: Converting the PyTorch single GPU AI training job to use all GPUs in a single node via DDP |
10:20 CEST
11:20 EEST |
Break (15 minutes) |
10:35 CEST
11:35 EEST |
Extreme scale AI
Presenter: Samuel Antao (AMD) |
11:20 CEST
12:20 EEST |
Demo/Hands-on: Using multiple nodes |
11:35 CEST
12:35 EEST |
Loading training data on LUMI
Presenter: Harvey Richardson (HPE) |
12:00 CEST
13:00 EEST |
Lunch break (60 minutes) |
13:00 CEST
14:00 EEST |
Coupling machine learning with HPC simulation Presenter: Harvey Richardson (HPE) |
13:30 CEST
14:30 EEST |
Hands-on: Advancing your own project and Q&A
Bring your own AI code, you want to run on LUMI, and spent some time applying what you have learned during the workshop - with on-site support from LUST/AMD. |
16:30 CEST
17:30 EEST |
End of the course day |