Setting Filesystem Quotas; Switching Between Raid 0 And Raid 5 - Nvidia DGX A100 User Manual

Hide thumbs Also See for DGX A100:
Table of Contents

Advertisement

Use the server IP address or the server name provided by your network administrator.
3. Mount the NFS export.
$ sudo mount /mnt
/mnt is an example mount point.
4. Verify caching is enabled.
$ cat /proc/fs/nfsfs/volumes
In the output, look for
The NFS will be automatically mounted and cached on the DGX A100 System in
subsequent reboot cycles.
8.1. 

Setting Filesystem Quotas

When running NGC containers, you might need to limit the amount of disk space that is
used on a filesystem to avoid filling up the partition. Refer to
community/tutorials/how-to-set-filesystem-quotas-on-ubuntu-18-04
how to set filesystem quotas on Ubuntu 18.04 and later.
8.2. 
Switching Between RAID 0 and RAID
5
As supplied from the factory, the RAID level of the DGX A100 RAID array is RAID 0. which
provides the maximum storage capacity but does not provide any redundancy.
If one SSD in the array fails, all data stored on the array is lost. If you are willing to accept
reduced capacity in return for some level of protection against failure of a SSD, you can
change the level of the RAID array to RAID 5. If you change the RAID level from RAID 0 to RAID
5, the total storage capacity of the RAID array is reduced.
Before you change the RAID level of the DGX A100 RAID array, back up all data on the array
that you want to preserve. Changing the RAID level of the DGX A100 RAID array erases all data
stored on the array.
The DGX A100 software includes the
can use to change the level of the RAID array without unmounting the RAID volume.
To change the RAID level to RAID 5, run the following command:
$ sudo configure_raid_array.py -m raid5
After you change the RAID level to RAID 5, the RAID array is rebuilt. A RAID array that is
being rebuilt is online and ready to be used, but a check on the health of the DGX system
reports the status of the RAID volume as unhealthy.
The time required to rebuild the RAID array depends on the workload on the system. On an
idle system, the rebuild will take about 30 minutes to complete.
To change the RAID level to RAID 0, run the following command:
NVIDIA DGX A100
FSC=yes
.
configure_raid_array.py
Configuring Storage
https://www.digitalocean.com/
for information about
custom script, which you
DU-09821-001 _v01   |   48

Hide quick links:

Advertisement

Table of Contents
loading

Table of Contents