TU Darmstadt / ULB / TUprints

Towards compiler-aided correctness checking of adjoint MPI applications

Hück, Alexander ; Protze, Joachim ; Lehr, Jan-Patrick ; Terboven, Christian ; Bischof, Christian ; Müller, Matthias S. (2021):
Towards compiler-aided correctness checking of adjoint MPI applications. (Preprint)
In: Proceedings of Correctness 2020: Fourth International Workshop on Software Correctness for HPC Applications, pp. 40-48,
Darmstadt, IEEE, 2020 IEEE/ACM 4th International Workshop on Software Correctness for HPC Applications (Correctness), 11.11.2020, ISBN 978-0-7381-1044-8,
DOI: 10.26083/tuprints-00018889,
[Conference or Workshop Item]

[img]
Preview
Text
typeart-must-adjoint-mpi-preprint.pdf
Available under only the rights of use according to UrhG.

Download (365kB) | Preview
Item Type: Conference or Workshop Item
Status: Preprint
Title: Towards compiler-aided correctness checking of adjoint MPI applications
Language: English
Abstract:

Algorithmic Differentiation (AD) is a set of techniques to calculate derivatives of a computer program. In C++, AD typically requires (i) a type change of the built-in double, and (ii) a replacement of all MPI calls with AD-specific implementations. This poses challenges on MPI correctness tools, such as MUST, a dynamic checker, and TypeART, its memory sanitizer extension. In particular, AD impacts (i) memory layouts of the whole code, (ii) requires more memory allocations tracking by TypeART, and (iii) approximately doubles the MPI type checks of MUST due to an AD-specific communication reversal. To address these challenges, we propose a new callback interface for MUST to reduce the number of intercepted MPI calls, and, also, improve the filtering capabilities of TypeART to reduce tracking of temporary allocations for the derivative computation. We evaluate our approach on an AD-enhanced version of CORAL LULESH. In particular, we reduce stack variable tracking from 32 million to 13 thousand. MUST with TypeART and the callback interface reduces the runtime overhead to that of vanilla MUST.

Title of Book: Proceedings of Correctness 2020: Fourth International Workshop on Software Correctness for HPC Applications
Place of Publication: Darmstadt
Publisher: IEEE
Classification DDC: 000 Allgemeines, Informatik, Informationswissenschaft > 004 Informatik
Divisions: 20 Department of Computer Science > Scientific Computing
Zentrale Einrichtungen > University IT-Service and Computing Centre (HRZ) > Hochleistungsrechner
Event Title: 2020 IEEE/ACM 4th International Workshop on Software Correctness for HPC Applications (Correctness)
Event Dates: 11.11.2020
Date Deposited: 12 Jul 2021 07:00
Last Modified: 12 Jul 2021 07:00
DOI: 10.26083/tuprints-00018889
Corresponding Links:
URN: urn:nbn:de:tuda-tuprints-188898
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/18889
Export:
Actions (login required)
View Item View Item