1.3. DGX H100 System Topology
Here is an image of the DGX H100 system topology.
1.4. DGX OS Software
The DGX H100 system comes pre-installed with a DGX software stack incorporating the following
components:
▶
An Ubuntu server distribution with supporting packages.
▶
The following system management and monitoring software:
▶
NVIDIA System Management (NVSM)
Provides active health monitoring and system alerts for NVIDIA DGX nodes in a data center.
It also provides simple commands for checking the health of the DGX H100 system from
the command line.
▶
Data Center GPU Management (DCGM)
This software enables node-wide administration of GPUs and can be used for cluster and
data-center level management.
▶
DGX H100 system support packages.
▶
The NVIDIA GPU driver
▶
Docker Engine
▶
NVIDIA Container Toolkit
1.3. DGX H100 System Topology
NVIDIA DGX H100 User Guide
15