Detecting - IBM Power 720 Overview

Hide thumbs Also See for Power 720:

Overview (59 pages)

Installation manual (64 pages)

Table Of Contents

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

page of 206

/ 206
Contents
Table of Contents
Bookmarks

Table of Contents

By delivering on these goals, IBM Power Systems servers enable faster and more accurate

repair, and reduce the possibility of human error.

Client control of the service environment extends to firmware maintenance on all of the

POWER processor-based systems. This strategy contributes to higher systems availability

with reduced maintenance costs.

This section provides an overview of the progressive steps of error detection, analysis,

reporting, notifying and repairing found in all POWER processor-based systems.

4.3.1 Detecting

The first and most crucial component of a solid serviceability strategy is the ability to

accurately and effectively detect errors when they occur. Although not all errors are a

guaranteed threat to system availability, those that go undetected can cause problems

because the system does not have the opportunity to evaluate and act if necessary. Power

processor-based systems employ IBM System z® server-inspired error detection

mechanisms that extend from processor cores and memory to power supplies and hard

drives.

Service processor

The service processor is a microprocessor that is powered separately from the main

instruction processing complex. The service processor provides the capabilities for the

following items:

POWER Hypervisor (system firmware) and HMC connection surveillance

Several remote power control options

Reset and boot features

Environmental monitoring

The service processor monitors the servers built-in temperature sensors, sending instructions

to the system fans to increase rotational speed when the ambient temperature is above the

normal operating range. Using an operating system interface, the service processor notifies

the operating system of potential environmentally related problems so that the system

administrator can take appropriate corrective actions before a critical failure threshold is

reached.

The service processor can also post a warning and initiate an orderly system shutdown in the

following circumstances:

The operating temperature exceeds the critical level (for example, failure of air

conditioning or air circulation around the system).

The system fan speed is out of operational specification (for example, because of multiple

fan failures).

The server input voltages are out of operational specification.

The service processor can immediately shut down a system in the following circumstances:

Temperature exceeds the critical level or remains above the warning level for too long.

Internal component temperatures reach critical levels.

Non-redundant fan failures occur.

Chapter 4. Continuous availability and manageability

161

Table of Contents

Show Quick Links

Hide quick links:

Table of Contents

This manual is also suitable for:

Power 740

Detecting - IBM Power 720 Overview

4.3.1 Detecting

Hide quick links:

Related Manuals for IBM Power 720

Related Content for IBM Power 720

This manual is also suitable for:

Table of Contents