SC13 Home > SC13 Schedule > SC13 Presentation - Debugging MPI and Hybrid/Heterogeneous Applications at Scale

SCHEDULE: NOV 16-22, 2013

When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.

Debugging MPI and Hybrid/Heterogeneous Applications at Scale

SESSION: Debugging MPI and Hybrid/Heterogeneous Applications at Scale

EVENT TYPE: Tutorials

TIME: 8:30AM - 5:00PM

Presenter(s):Ganesh Gopalakrishnan, David Lecomber, Matthias S. Mueller, Bronis R. de Supinski, Tobias Hilbrich


MPI programming is error prone due to the complexity of MPI semantics and the difficulties of parallel programming. Increasing heterogeneity (e.g., MPI plus OpenMP/CUDA), scale, non-determinism, and platform dependent bugs exacerbate these difficulties. This tutorial covers the detection/correction of errors in MPI programs at small and large scale, as well as for heterogeneous/hybrid programs. We will first introduce our main tools: MUST, that detects MPI usage errors at runtime with a high degree of automation; ISP/DAMPI, that detects interleaving-dependent MPI deadlocks and assertion violations through application replay; and DDT, a highly scalable parallel debugger. Attendees will be encouraged to explore our tools early during the tutorial to better appreciate their strengths/limitations. We will present best practices and a cohesive workflow for comprehensive application debugging with all our tools. We dedicate the afternoon session to advanced use-cases, tool deployment on Leadership-Scale systems, updates on new tool functionality, and for the debugging of hybrid/heterogeneous programming models. The latter includes debugging approaches for MPI, OpenMP, and CUDA and is especially crucial for systems such as Titan (ORNL) and Sequoia (LLNL). DDTs capabilities for CUDA/OpenMP debugging will be presented, in addition to a short introduction to GKLEE, a new symbolic verifier for CUDA applications.

Chair/Presenter Details:

Ganesh Gopalakrishnan - University of Utah

David Lecomber - Allinea Software

Matthias S. Mueller - RWTH Aachen University

Bronis R. de Supinski - Lawrence Livermore National Laboratory

Tobias Hilbrich - Dresden University of Technology

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar