You are here

Practical Parallel Performance Analysis on Salomon

Thu, 10/20/2016 - 9:30am to Fri, 10/21/2016 - 4:00pm
Registration deadline: 
Tue, 10/18/2016 - 11:45pm
VŠB - Technical University Ostrava, IT4Innovations building, room 207
Brian Wylie (Jülich Supercomputing Centre ), Ronny Brendel (TU Dresden)


Applications need to be effectively parallelised, using techniques such as MPI message-passing and OpenMP multithreading, to fully exploit the capabilities of modern systems for high-performance computing. The course introduces the open-source community-developed Score-P instrumentation and measurement infrastructure, and the Scalasca and Vampir tools, to provide a practical basis for portable performance analysis of parallel applications.

The course will be delivered as a series of presentations with associated hands-on practical exercises using the Salomon computer cluster.  It starts with basic application instrumentation and measurement to generate execution profiles, then improves measurement quality via customization capabilities, and progresses to interactive and automated analyses of execution traces.

While analysis of provided example codes will be used to guide the class through the relevant steps and familiarise with usage of the tools, coaching will also be available to assist participants to analyse their own parallel application codes and may suggest opportunities for improving their execution performance and scalability.

This is especially true for users interested in a deeper analysis and optimization of their codes, which will be offered on 14-15 December 2016 during a Performance Optimization and Productivity Tutorial organized at IT4Innovations jointly with POP (Performance Optimisation and Productivity),  a Centre of Exellence in Computing Applications. Programmers, developers, consult the POP web pages for details and do not miss this unique opportunity.

Purpose of the course (benefits for the attendees)

On completion participants should be familiar with common performance analysis techniques and how they can be employed in practice (on a range of HPC systems).  Those who prepared their own application test cases will have been coached in the tuning of their measurement and analysis, and provided optimization suggestions.

About the tutors

Brian Wylie is a research scientist at Jülich Supercomputing Centre of Forschungszentrum Jülich since 2004, where he is a developer of the Scalasca toolset for scalable performance analysis of large-scale parallel applications. Additional responsibilities for application engagement and training include the series of VI-HPS Tuning Workshops and hands-on tutorials. He has a further 13 years of experience researching and developing parallel performance tools, and as applications scientist working with a spectrum of academic and commercial applications. His 1990 PhD in computational physics was awarded by the University of Edinburgh.

Ronny Brendel received his Diploma in Computer Science at TU Dresden where he now works as a researcher. His work revolves around Software Performance Analysis. For over eight years Ronny has been researching and developing tools for software performance analysis, e.g., Vampir, Score-P, OTF and VampirTrace. Over the years he contributed to many levels of the performance analysis tool stack jointly developed by TU Dresden, Jülich Supercomputing Centre, RWTH Aachen, TU Munich, and TU Darmstadt.

Preliminary schedule

Thursday, October 20,  2016



Introduction to parallel performance engineering

Instrumentation and measurement of parallel applications with Score-P


lunch break


Profile analysis report exploration

Configuration and customisation of Score-P measurements


coffee break


Coaching to profile your own parallel application(s) on Salomon


coffee break

17:00-18:00Parallel execution performance analysis diagnosis and tuning 


Friday, October 21,  2016

Automated trace analysis with Scalasca


coffee break


Interactive trace analysis with Vampir


lunch break


Coaching to trace your own parallel application(s) on Salomon


Familiarity with MPI or OpenMP programming in Fortran, C or C++, and running parallel applications on Linux clusters. Ideally participants will have their own parallel application code which builds and runs successfully on the Salomon cluster.  Participants need to have their own notebook computer configured for access to Salomon and running X11-based graphical tools.


Obligatory registration - registration form here; deadline (extended) see above or exhausted course capacity

Capacity and Fees

30 participants. The event is provided free of charge for them.


  • See links below how to get to the campus of  VŠB - Technical University Ostrava and to the IT4Innovations building.
  • Documentation for IT4Innovations' computer systems is available at