26

Mar

Teaser image to LRZ Bootcamp with NIVIDA on Accelerated Distributed Computing

Bootcamp

LRZ Bootcamp With NIVIDA on Accelerated Distributed Computing

Train Multiple Systems and Learn About New Tools

   26.03.2025

   9:00 am - 2:15 pm

   Leibniz Rechenzentrum, Boltzmannstr. 1, 85748 Garching bei München

The LRZ (Leibniz Supercomputing Centre) BDAI Team, together with NVIDIA, is organizing a Bootcamp on Accelerated Distributed Computing - powered by NVIDIA on March 26, 2025, at 9 AM.

The key topics addressed are:
1. Training Parallelization - Various methods to split and optimize training across multiple systems, including data/model parallelism and advanced techniques like ZeRO and Mixture-of-Experts

2. Communication Systems - How components communicate within and between nodes using tools like NCCL

3. Implementation Tools - Frameworks like NeMo Megatron and Profiler

You can register yourself for the bootcamp under the following link, where you can also find a detailed agenda.


Related

Link to DataFest 2025

Hackathon  •  28.03.2025 - 30.03.2025  •  LMU Munich, Main Building

DataFest 2025

DataFest Germany at the LMU Munich on March, 28-30, 2025.