Monitoring; Thermal Sensors - Nvidia ConnectX-6 Manual

Infiniband/ethernet adapter cards for ocp spec 3.0
Hide thumbs Also See for ConnectX-6:
Table of Contents

Advertisement

Monitoring

Thermal Sensors

The adapter card incorporates the ConnectX IC, which operates in the range of temperatures between 0°C and 105°C.
Three thermal threshold definitions impact the overall system operation state:
Warning – 105°C: On managed systems only: When the device crosses the 105°C thresholds, a Warning Threshold message is issued by the
management SW, indicating to system administration that the card has crossed the warning threshold. Note that this temperature threshold does
not require nor lead to any action by hardware (such as adapter card shutdown).
Critical – 115°C: When the device crosses this temperature, the firmware automatically shuts down the device.
Emergency – 130°C: If the firmware fails to shut down the device upon crossing the critical threshold, the device automatically shuts down upon
crossing the emergency (130°C) threshold.
 The card's thermal sensors can be read through the system's SMBus. The user can read these thermal sensors and adapt the system airflow following the
readouts and the needs of the above-mentioned IC therma
83

Advertisement

Table of Contents
loading

Table of Contents