IBM Power 720 Overview page 178

Hide thumbs Also See for Power 720:

Overview (59 pages)

Installation manual (64 pages)

Table Of Contents

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

page of 206

/ 206
Contents
Table of Contents
Bookmarks

Table of Contents

system error log where it is analyzed by error log analysis (ELA) routines. Appropriate

actions are taken to report the boot-time error for subsequent service, if required.

Concurrent access to the service processors menus of the ASMI

This access allows nondisruptive abilities to change system default parameters,

interrogate service processor progress and error logs, and set and reset server indicators

(Guiding Light for midrange and high-end servers, Light Path for low-end servers),

accessing all service processor functions without having to power down the system to the

standby state. This allows the administrator or service representative to dynamically

access the menus from any web browser-enabled console that is attached to the Ethernet

service network, concurrently with normal system operation.

Managing the interfaces for connecting uninterruptible power source systems to the

POWER processor-based systems, performing timed power-on (TPO) sequences, and

interfacing with the power and cooling subsystem

Error checkers

IBM POWER processor-based systems contain specialized hardware detection circuitry that

is used to detect erroneous hardware operations. Error checking hardware ranges from parity

error detection coupled with processor instruction retry and bus retry, to ECC correction on

caches and system buses.

All IBM hardware error checkers have distinct attributes:

Continuous monitoring of system operations to detect potential calculation errors.

Attempts to isolate physical faults based on runtime detection of each unique failure.

Ability to initiate a wide variety of recovery mechanisms designed to correct the problem.

The POWER processor-based systems include extensive hardware and firmware

recovery logic.

Fault isolation registers

Error checker signals are captured and stored in hardware fault isolation registers (FIRs). The

associated logic circuitry is used to limit the domain of an error to the first checker that

encounters the error. In this way, runtime error diagnostics can be deterministic so that for

every check station, the unique error domain for that checker is defined and documented.

Ultimately, the error domain becomes the field-replaceable unit (FRU) call, and manual

interpretation of the data is not normally required.

First-failure data capture

First-failure data capture (FFDC) is an error isolation technique. It ensures that when a fault is

detected in a system, through error checkers or other types of detection methods, the root

cause of the fault will be captured without the need to re-create the problem or run an

extended tracing or diagnostics program.

For the vast majority of faults, a good FFDC design means that the root cause is detected

automatically without intervention by a service representative. Pertinent error data related to

the fault is captured and saved for analysis. In hardware, FFDC data is collected from the fault

isolation registers and from the associated logic. In firmware, this data consists of return

codes, function calls, and so forth.

check stations

FFDC

that potential errors can be quickly identified and accurately tracked to an FRU.

This proactive diagnostic strategy is a significant improvement over the classic, less accurate

reboot and diagnose

164

IBM Power 720 and 740 Technical Overview and Introduction

are carefully positioned within the server logic and data paths to ensure

service approaches.

Table of Contents

Show Quick Links

Hide quick links:

Table of Contents

This manual is also suitable for:

Power 740

IBM Power 720 Overview page 178

Hide quick links:

Related Manuals for IBM Power 720

Related Products for IBM Power 720

This manual is also suitable for:

Table of Contents