Huawei TaiShan Troubleshooting Manual page 68

Table of Contents

Advertisement

TaiShan Servers
Troubleshooting
Fault
Symptom
A
communic
ation
error
occurs on
a network
port.
Issue 12 (2022-08-30)
Handling Procedure
1. Check whether the network
cable is connected properly to
the network port.
2. Use the
Computing Product
Compatibility Checker
check whether the NIC type is
compatible with the server
board. Contact
Huawei
technical support
whether the NIC firmware and
driver versions match the OS
version. If they do not match,
upgrade the NIC firmware and
driver first.
3. To check whether the network
ports are up, run the ifconfig
N up command in Linux
eth
(the command may vary in
different OSs). To check
whether IP addresses are set
for the required network ports,
run the ethtool eth
command.
4. Run the ethtool -p eth
command in Linux (the
command may vary in other
OSs) to check whether the
information in the network
port configuration file of the
server is consistent with the
actual physical network ports,
and check whether the
network port status indicators
are on and whether the
network ports on the switch
are up.
NOTE
The ethtool -p eth
applies only to PCIe cards.
5. Check the settings of IP
addresses, gateway addresses,
VLANs, bondings, and uplink
switch network ports.
6. Collect OS logs. For details, see
4.2 Collecting OS
Copyright © Huawei Technologies Co., Ltd.
5 Diagnosing and Rectifying Faults
Quick Recovery Method
1. Use the ping command to
check whether the server or
other servers on the
network have network
faults.
to
● If the fault occurs on
to check
● If the fault occurs on
2. Check the network port
status (whether the status
indicator is steady on). If
the network port status is
link down (the status
indicator is off), exchange
the module, cable, and
uplink switch port
corresponding to the
N
abnormal network port with
those corresponding to the
N
normal network port to
check whether the network
port is normal. Replace or
adjust the component based
on the site requirements.
3. If the NIC is causing the
fault, restart the server
when interruption will not
affect services, and check
whether the communication
is normal. If the fault
persists, power the server
off and on. If the fault still
persists, replace the NIC.
N command
4. If the fault persists, contact
Huawei technical
Logs.
more than one server,
check whether the
external switching
network is normal.
only one server, go to 2.
support.
61

Advertisement

Table of Contents
loading

Table of Contents