Avoiding Hot-Spots on Two-Level Direct Networks
International Conference for High Performance Computing, Networking, Storage and Analysis (SC) 2011
Publication Type: Paper
Repository URL: 201104_BWTopologySC11
Abstract
A low-diameter, fast interconnection network is going to be a pre-requisite for
building exascale machines. A two-level direct network has been proposed by
several groups as a scalable design for future machines. IBM's PERCS topology
and the dragonfly network discussed in the DARPA exascale hardware study are
examples of this design. The presence of multiple levels in this design leads
to hot-spots on a few links for applications where grouping processes together
is optimal for minimizing total communication volume. This is especially true
for communication graphs with a small number of neighbors per task. Routing and
mapping choices can impact the communication behavior of parallel applications.
In this paper, we explore intelligent topology aware mappings for different
communication patterns on the physical topology to identify cases which
minimize link utilization. We also analyze the trade-offs between using direct
and indirect routing with different mappings. We use simulations to study
communication and overall performance of applications on two-level direct
networks since there are no installations of such networks yet. This study
raises interesting issues regarding choice of job scheduling, routing and
mapping for future machines.
TextRef
Abhinav Bhatele, Nikhil Jain, William D. Gropp and Laxmikant V. Kale, Avoiding hot-spots on two-level direct networks, International Conference for High Performance Computing, Networking, Storage and Analysis (Supercomputing) 2011
People
- Abhinav Bhatele
- Nikhil Jain
- William Gropp
- Laxmikant Kale
Research Areas