The International Conference for High Performance Computing, Networking, Storage and Analysis
The Heuristic Static Load-Balancing Algorithm Applied to CESM.
Authors: Yuri Alexeev (Argonne National Laboratory), Sheri Mickelson (Argonne National Laboratory), Sven Leyffer (Argonne National Laboratory), Robert Jacob (Argonne National Laboratory), Anthony Craig (National Center for Atmospheric Research)
Abstract: We propose to use the heuristic static load-balancing algorithm for solving load balancing problems in the Community Earth System Model (CESM), using fitted benchmark data, as an alternative to the current manual approach. The problem of allocating the optimal number of CPU cores to CESM components is formulated as a mixed-integer nonlinear optimization problem which is solved by using an optimization solver implemented in the MINLP package MINOTAUR. Our algorithm was tested for the 1° and 1/8° resolution simulations on 163,840 cores of IBM Blue Gene/P where we consistently achieved well load balanced results. This work is a part of a broader effort by NCAR and ANL scientists to eliminate the need for manual tuning of the code for each platform and simulation type, improve the performance and scalability of CESM, and develop automated tools to achieve these goals.