1.2.
Recommended Tools
List of recommended tools needed to service the NVIDIA DGX A100.
‣
Laptop
‣
USB key with tools and drivers
‣
USB key imaged with the DGX Server OS ISO
‣
Screwdrivers (Phillips #1 and #2, small flat head)
‣
KVM Crash Cart
‣
Anti-static wrist strap
‣
Masking tape or label maker
‣
Tie wraps or velcro for cable management
‣
Box cutter
‣
Black Permanent Marker
‣
Packing materials
1.3.
Customer Support
Contact NVIDIA Enterprise Support for assistance in reporting, troubleshooting, or diagnosing
problems with your DGX A100 system. Also contact NVIDIA Enterprise Support for assistance
in installing or moving the DGX A100 system.
For details on how to obtain support, visit the NVIDIA Enterprise Support web site
www.nvidia.com/en-us/support/enterprise/
1.4.
Running the Pre-flight Test
Instructions for running the DGX stress test.
NVIDIA recommends running the pre-flight stress test before putting a system into a
production environment or after servicing. You can specify running the test on the GPUs, CPU,
memory, and storage, and also specify the duration of the tests.
To run the tests, use NVSM.
Syntax:
sudo nvsm stress-test [--usage] [--force] [--no-prompt] [<test>...] [DURATION]
$
For help on running the test, issue the following.
sudo nvsm stress-test --usage
~$
NVIDIA DGX A100 System
).
Introduction
(https://
DU-10044-001 _v01 | 2