Linux-basierter Parallelrechner / HPC-Cluster Alex (Megware)

Model: GPU-Cluster 2022

Manufacturer: Megware (2022)

URL: https://hpc.fau.de/systems-services/systems-documentation-instructions/clusters/alex-cluster/

Location: Erlangen

Usage: For external users too

Details

Projects

(8)

Publications

(84)

DFG Key: 7000 Datenverarbeitungsanlagen, zentrale Rechenanlagen

Description

FAU’s Alex cluster (system integrator: Megware) is a high-performance compute resource with Nvidia GPGPU accelerators and partially high speed interconnect. It is intended for single and multi GPGPU workloads, e.g. from molecular dynamics, or machine learning. Alex serves for both, FAU’s basic Tier3 resources as well as NHR’s project resources.

2 front end nodes, each with two AMD EPYC 7713 “Milan” processors (64 cores per chip) running at 2.0 GHz with 256 MB Shared L3 cache per chip, 512 GB of RAM, and 100 GbE connection to RRZE’s network backbone but no GPGPUs.
8 GPGPU nodes, each with two AMD EPYC 7662 “Rome” processors (64 cores per chip) running at 2.0 GHz with 256 MB Shared L3 cache per chip, 512 GB of DDR4-RAM, four Nvidia A100 (each 40 GB HBM2 @ 1,555 GB/s; DGX board with NVLink; 9.7 TFlop/s in FP64 or 19.5 TFlop/s in FP32), one HDR200 Infiniband HCAs, 25 GbE, and 6 TB on local NVMe SSDs. (During the year 2021 and early 2022, these nodes have previously been part of TinyGPU.)
20 GPGPU nodes, each with two AMD EPYC 7713 “Milan” processors (64 cores per chip) running at 2.0 GHz with 256 MB Shared L3 cache per chip, 1,024 GB of DDR4-RAM, eight Nvidia A100 (each 40 GB HBM2 @ 1,555 GB/s; HGX board with NVLink; 9.7 TFlop/s in FP64 or 19.5 TFlop/s in FP32), two HDR200 Infiniband HCAs, 25 GbE, and 14 TB on local NVMe SSDs.
12 GPGPU nodes, each with two AMD EPYC 7713 “Milan” processors (64 cores per chip) running at 2.0 GHz with 256 MB Shared L3 cache per chip, 2,048 GB of DDR4-RAM, eight Nvidia A100 (each 80 GB HBM2 @ 1,555 GB/s; HGX board with NVLink; 9.7 TFlop/s in FP64 or 19.5 TFlop/s in FP32), two HDR200 Infiniband HCAs, 25 GbE, and 14 TB on local NVMe SSDs.
38 GPGPU nodes, each with two AMD EPYC 7713 “Milan” processors (64 cores per chip) running at 2.0 GHz with 256 MB Shared L3Cache per chip, 512 GB of DDR4-RAM, eight Nvidia A40 (each with 48 GB DDR6 @ 696 GB/s; 37.42 TFlop/s in FP32), 25 GbE, and 7 TB on local NVMe SSDs.

Involved Person(s)

Gerhard Wellein Thomas Zeiser

Organisation(s)

Regionales Rechenzentrum Erlangen (RRZE) Professur für Höchstleistungsrechnen

Research Areas

Performance Engineering Field of Research

Performance Models Field of Research

Performance Tools Field of Research

Funding Sources

Bayerisches Staatsministerium für Bildung und Kultus, Wissenschaft und Kunst (ab 10/2013) NHR-Bund-Länder-Förderung Bundesministerium für Forschung, Technologie und Raumfahrt (BMFTR) NHR-Bund-Länder-Förderung Deutsche Forschungsgemeinschaft (DFG) DFG - Infrastrukturförderung (INFRA)