What question did this study set out to answer?

The report aims to assess the parallel performance of MAX flagship codes and identify bottlenecks for optimization.

April 23, 2026Open Access

Interim report on performance analysis of MAX software. Deliverable D3.1 of the HORIZON-EUROHPC-JU-2021-COE-01 project MaX (101093374)

Key Points

The report aims to assess the parallel performance of MAX flagship codes and identify bottlenecks for optimization.
Continuous assessment using code profiling and optimization tools
Utilization of recent programming models
Benchmarking and profiling activities organized with performance metrics
Identification of code bottlenecks related to memory bandwidth, communication imbalance, and latency
Preliminary results on performance enhancement metrics
Deployment status of flagship applications on EuroHPC systems reported

Abstract

WP3 takes care of the continuous assessment and analysis of the parallel performance of the MAX flagship codes, pointing up the direction for the development aimed at the effective exploitation of the existing technology. To this aim, we make use of tools for code profiling and optimisation and of the most recent programming models. WP3 activity is functional to many other tasks in different work packages (WP1, WP2, WP4, WP5), in order to provide feedback on the progress obtained in terms of performance enhancement with respect to the relevant metrics and to the parallel efficiency. On the other hand, an important output of this WP is to discover and monitor code bottlenecks, to identify the code or architecture feature responsible for them (memory bandwidth, communication imbalance, latency, bandwidth to GPU, etc), and to propose dedicated solutions also through the engineering of ad-hoc proof-of-concepts. The solutions of these bottlenecks that require code refactoring or replacement of an algorithm will be implemented within WP1 by the code developers. Our activity will therefore be in continuous synergy with the developing teams of the MaX flagship codes. In the following we will describe how we have organised the benchmarking and profiling activity, introducing the tools adopted (Section 3) and presenting some preliminary results (Section 4, 5 and 6). The deployment status of flagship applications on EuroHPC systems will also be reported (Section 7).

Demander à l'IA

Bookmark

View Full Paper