Characterization and Modeling of PIDX Parallel I/O for Performance Optimization



TIME: 3:30PM - 4:00PM

SESSION CHAIR: André Brinkmann

Parallel I/O library performance can vary greatly in response to user-tunable parameter values such as aggregator count, file count, and aggregation strategy. Unfortunately, manual selection of these values is time consuming and dependent on characteristics of the target machine, the underlying file system, and the dataset itself. Some characteristics, such as the amount of memory per core, can also impose hard constraints on the range of viable parameter values. In this work we address these problems by using machine learning techniques to model the performance of the PIDX parallel I/O library and select appropriate tunable parameter values. We characterize both the network and I/O phases of PIDX on a Cray XE6 as well as an IBM Blue Gene/P system. We use the results of this study to develop a machine learning model for parameter space exploration and performance prediction.

Chair/Author Details:

André Brinkmann (Chair) - Johannes Gutenberg-University Mainz

Sidharth Kumar - University of Utah

Avishek Saha - University of Utah

Venkatram Vishwanath - Argonne National Laboratory

Philip Carns - Argonne National Laboratory

John A. Schmidt - University of Utah

Robert Latham - Argonne National Laboratory

Giorgio Scorzelli - University of Utah

Hemanth Kolla - Sandia National Laboratories

Robert Ross - Argonne National Laboratory

Jackie Chen - Sandia National Laboratories

Michael E. Papka - Argonne National Laboratory

Ray Grout - National Renewable Energy Laboratory

Valerio Pascucci - University of Utah

The full paper can be found in the ACM Digital Library