Sign In
Upload
Manuals
Brands
IBM Manuals
Server
Power Systems 775
IBM Power Systems 775 Manuals
Manuals and User Guides for IBM Power Systems 775. We have
1
IBM Power Systems 775 manual available for free PDF download: Manual
IBM Power Systems 775 Manual (358 pages)
for AIX and Linux HPC Solution
Brand:
IBM
| Category:
Server
| Size: 7.02 MB
Table of Contents
Front Cover
1
Table of Contents
5
Trademarks
10
Preface
11
The Team Who Wrote this Book
11
Now You Can Become a Published Author, too
13
Comments Welcome
14
Stay Connected to IBM Redbooks
14
Chapter 1. Understanding the IBM Power Systems 775 Cluster
15
Overview of the IBM Power System 775 Supercomputer
16
Advantages and New Features of the IBM Power 775
17
Hardware Information
18
POWER7 Chip
18
I/O Hub Chip
24
Collective Acceleration Unit (CAU)
27
Nest Memory Management Unit (NMMU)
29
Integrated Switch Router (ISR)
29
Supernova
31
Hub Module
32
Memory Subsystem
36
Quad Chip Module (QCM)
37
Octant
39
Interconnect Levels
41
Node
42
Supernodes
44
Power, Packaging and Cooling
54
Frame
54
Bulk Power and Control Assembly (BPCA)
56
Bulk Power Control and Communications Hub (BPCH)
58
Bulk Power Regulator (BPR)
58
Water Conditioning Unit (WCU)
59
Disk Enclosure (Rodrigo)
62
Overview
62
High Level Description
63
Configuration
65
Cluster Management
67
Hardware Management Console
67
Data Flow
70
HMC X
70
Lpars
71
Utility Nodes
72
GPFS I/O Nodes
74
Isnm
80
Db2
86
Extreme Cluster Administration Toolkit (Xcat)
86
Toolkit for Event Analysis and Logging (TEAL)
89
Gpfs
90
Reliable Scalable Cluster Technology (RSCT)
90
IBM Parallel Environment
102
Loadleveler
107
Parallel ESSL
108
Compilers
109
Logical View
111
Parallel Tools Platform (PTP)
112
Chapter 2. Application Integration
113
Power 775 Diskless Considerations
114
System Access
119
System Capabilities
120
Application Development
120
Advantage for PGAS Programming Model
121
Unified Parallel C (UPC)
122
ESSL/PESSL Optimized for Power 775 Clusters
130
Parallel Environment Optimizations for Power 775
132
Considerations for Data Striping with PE
137
Confirmation of HFI Status
140
Managing Jobs with Large Numbers of Tasks (up to 1024 K)
146
IBM Parallel Environment Developer Edition for AIX
150
Eclipse Parallel Tools Platform (PTP 5.0)
150
IBM High Performance Computing Toolkit (IBM HPC Toolkit)
150
Running Workloads Using IBM Loadleveler
158
Submitting Jobs
158
Querying and Managing Jobs
160
Chapter 3. Monitoring
171
Component Monitoring
172
Loadleveler
178
General Parallel File System (GPFS)
179
Xcat
191
Power Management
194
Db2
207
AIX and Linux Systems
208
Integrated Switch Network Manager (ISNM)
213
Reliable Scalable Cluster Technology (RSCT)
224
Compilers Environment (PE Runtime Edition, ESSL, Parallel ESSL)
227
Diskless Resources (NIM, Iscsi, NFS, TFTP)
228
TEAL Tool
232
Configuration (Loadleveler, GPFS, Service Focal Point, PNSD, ISNM)
232
Management
233
Quick Health Check (Full HPC Cluster System)
238
Component Analysis Location
238
Top to Bottom Checks Direction (Software to Hardware)
240
Bottom to Top Direction (Hardware to Software)
240
EMS Availability
241
Simplified Failover Procedure
242
Component Configuration Listing
246
Loadleveler
248
Xcat
248
Db2
252
AIX and Linux Systems
252
Integrated Switch Network Manager (ISNM)
253
Reliable Scalable Cluster Technology (RSCT)
254
Compilers Environment (PE Runtime Edition, ESSL, Parallel ESSL)
254
Diskless Resources (NIM, Iscsi, NFS, TFTP)
255
Component Monitoring Examples
255
Xcat (Power Management, Hardware Discovery and Connectivity)
255
Integrated Switch Network Manager (ISNM)
255
Troubleshooting Problems
257
Chapter 4 . Problem Determination
258
Xcat
258
Xcatdebug
258
Resolving Xcat Configuration Issues
258
Node Does Not Respond to Queries or Rpower Command
260
Node Fails to Install
261
Unable to Open a Remote Console
262
Time out Errors During Network Boot of Nodes
262
Isnm
263
Checking the Status and Recycling the Hardware Server and the CNM
263
Communication Issues between CNM and DB2
264
Adding Hardware Connections
267
Checking FSP Status, Resolving Configuration or Communication Issues
268
Verifying CNM to FSP Connections
269
Verify that a Multicast Tree Is Present and Correct
270
Correcting Inconsistent Topologies
271
Hfi
273
HFI Health Check
273
SMS Ping Test Fails over HFI
276
Netboot over HFI Fails
277
Other HFI Issues
277
Chapter 5. Maintenance and Serviceability
279
Managing Service Updates
280
Service Packs
280
System Firmware
280
Managing Multiple Operating System (OS) Images
282
Power 775 Xcat Startup/Shutdown Procedures
286
Startup Procedures
286
Shutdown Procedures
296
Managing Cluster Nodes
303
Node Types
303
Adding Nodes to the Cluster
309
Removing Nodes from a Cluster
310
Power 775 Availability Plus (A+)
311
Advantages of Availability Plus (A+)
311
Considerations for a
312
A+ QCM Move Example
320
Appendix A. Serviceable Event Analysis
327
Analyzing a Hardware Serviceable Event that Points to an A+ Action
328
Appendix B. Command Outputs
337
Db2
339
Related Publications
349
IBM Redbooks
349
Other Publications
349
Online Resources
349
Help from IBM
350
Index
351
Advertisement
Advertisement
Related Products
IBM Power 770 9117-MMB
IBM Power 770 9117-MMC
IBM Power 770 9117-MMD
IBM Power 775 9125-F2C
IBM Power 7 7314-G30
IBM 7779
IBM 7013 591
IBM 7104
IBM 79788BU
IBM 7947E2U
IBM Categories
Server
Desktop
Storage
Laptop
Monitor
More IBM Manuals
Login
Sign In
OR
Sign in with Facebook
Sign in with Google
Upload manual
Upload from disk
Upload from URL