Qlogic Fast Fabric User Manual page 147

Table of Contents

Advertisement

Q
Pallas has known scalability limitations, especially in its AllToAll phase. This phase
can simultaneously perform up to 4MB transfers to-and-from all nodes at once. The
downside is a system must have approx 10*NP MB of memory available per process
for Pallas data to run this benchmark. Therefore, for a small cluster (approximately
16 processors or less), it is modest at 160MB. However, for a larger cluster
(approximately 256 processors or greater), it is rather large at 2.5GB.
As such, it is recommended that Pallas be used for smaller runs (2-32 processes)
or that it be recognized that the benchmark is likely to fail (or swap Linux to death)
at larger process counts. Depending upon the amount of memory in the system and
the numbers of processes to run, the VIADEV_MEM_REG_MAX parameter in
/opt/iba/src/mpi_apps/mpi.param.pallas may need to be edited.
To run pallas:
1. cd /opt/iba/src/mpi_apps
2. ./run_pmb NP
where NP is the number of processes to run, such as:
./run_pmb 4
During this benchmark the /opt/iba/src/mpi_apps/mpi.param.pallas
config file is used.
D000006-000 Rev A
6 – MPI Sample Applications
6-5

Advertisement

Table of Contents
loading
Need help?

Need help?

Do you have a question about the Fast Fabric and is the answer not in the manual?

Subscribe to Our Youtube Channel

Table of Contents