IBM Power AC922 8335-GTW Handbook page 22

Problem analysis, system parts, and locations
Table of Contents

Advertisement

If
No:
2. To identify the correct service procedure to perform by using operating system log information,
complete the following steps:
a) Log in as the root user.
b) To display the operating system logs, type dmesg and press Enter.
3. Scan the operating system logs that occurred around the time that the problem started for the first
occurrence of keywords, such as fail, failure, or failed. When you find a keyword that accompanies one
or more of the resource names in the following table, a service action is required. Use the following
table to determine the service procedure to perform for your type of problem.
Table 1. Resource names, examples, and service procedures for different types of operating system
logs.
Resource name
eth1, eth2, eth3,
enPxxxxx, where xxxxx
indicates the network
port.
mlx5_core
tg3
NVRM
nvidia-nvlink
nvme
sda, sdb, sdc
8 Power Systems: Problem analysis, system parts, and locations for the 8335-GTC, 8335-GTG, 8335-GTH,
8335-GTW, and 8335-GTX
Then
Continue with the next step.
Example of a log
requiring a service
action
Failed to re-
initialize device
Link Down
health_care:
handling bad
device here
PCI I/O error
detected.
Link is Down
aborting
RmInitAdapter
failed!
IBMNPU: NPU FENCE
detected, machine
power cycle
required
Failed status:
ffffffff, reset
controller
FAILED Result
Type of problem
Network
Network
Network
Graphics
Graphics
NVMe Flash adapter
Storage
Service procedure
Go to "Resolving a
network adapter
problem" on page 9.
Go to "Resolving a
network adapter
problem" on page 9.
Go to "Resolving a
network adapter
problem" on page 9.
Go to "Resolving a
graphics processing
unit problem" on page
10.
Go to "Resolving a
graphics processing
unit problem" on page
10.
Go to "Resolving an
NVMe Flash adapter
problem" on page 14.
Go to "Resolving a
storage device
problem" on page 13.

Advertisement

Table of Contents
loading

Table of Contents