Jste zde

What you need to know about performance analysis using Intel tools

Termín: 
St, 14.06.2017 9:00
Uzávěrka registrace: 
Po, 12.06.2017 23:00
Místo: 
VŠB - Technical University Ostrava, IT4Innovations building, room 207
Lektor: 
Georg Zitzlsberger (IT4Innovations)
Úroveň: 
advanced
Jazyk: 
English

Annotation

Collecting and understanding performance and efficiency characteristics of applications is key for High Performance Computing. During this one day course with hands-on you will be equipped with advanced information about performance and efficiency metrics and how to easily obtain them using selected Intel tools. We will pick the pearls of each tool and advise you how to extract information about your application quickly and how to explain the extracted information for your application and target architecture. Since such focus areas arise on a node level we will primarily focus on that but also address how to work on a system/cluster level. Intel tools covered will be Intel(R) VTune(TM) Amplifier XE (together with "perf"), Intel Software Development Emulator and Intel(R) Advisor.

Since advanced performance analysis is architecture related, we use the available systems on the Salomon cluster, which will be used by the participants for the hands-on exercises. Salomon comprises nodes with Intel(R) Xeon(R) processors v3 (Haswell) and first generation Intel(R) Xeon Phi(TM) coprocessor (KNC). For the hands-on part participants are encouraged to bring their own applications for analysis. Please refer to the Prerequisites section below for this.

Purpose of the course (benefits for the attendees)

At the end of this course participants will be able to extract key metrics from applications using the described tools and methodologies, and make decisions about further performance tuning.

About the tutor

Georg Zitzlsberger formerly worked for Intel Deutschland GmbH (Germany). He has been a Technical Consulting Engineer for Intel(R) Software Development tools for many years before he recently joined IT4Innovations. In his new role as researcher he offers consulting services for Intel Software Development tools and Intel architectures throughout the IT4Innovations network.                                           

Agenda

Wednesday 14 June 2017
9:00-9:30Registration
9:30-9:45Introduction & Agenda
9:45-10:45The architectures in a nutshell
 What is important to know for performance analysis?
 What are typical perfromance bottlenecks?
 System considered is Salomon: Intel(R) Xeon(R) processors v3 (Haswell) and first generation Intel(R) Xeon Phi(TM) coprocessor (KNC)
10:45-11:00Break
11:00-11:45Intel(R) VTune(TM) Amplifier XE:
 Obtaining key metrics from your application
 Black Belt Guide (understand the metrics)
11:45-12:30Apply Intel(R) VTune(TM) Amplifier XE to your application - Hands-On
12:30-13:30Time for lunch
13:30-14:15Intel(R) Advisor and Intel Software Development Emulator
 Intel Advisor (vector tool)
 Roofline-Model (arithmetic intensity and FLOPS)
 SIMD & memory analysis
 Simulate different micro-architectures
14:15-15:00Apply Intel(R) Advisor and Intel Software Development Emulator to your application - Hands-On
15:00-15:15Break
15:15-16:45How to use the tools at System/Cluster Level?
16:45-17:30Final Hands-On with Q&A

Prerequisites

Participants are welcomed and advised to bring their own applications for the best training experience. For participants with no own applications, the trainer will provide prepared examples for the hands-on parts.

 
Participants are welcomed and advised to bring their own applications for the hands-on sessions for best experience. For that, some preparation is required in order to execute own applications on the Salomon cluster. As introduction, we suggest to read the documentation to the Salomon cluster (https://docs.it4i.cz/; see section "Salomon Cluster"). In addition, source code of the applications is needed for analysis and eventual recompilation. It should be possible to recompile own applications quickly as hands-on time is limited. If the applications should be too complex (i.e. long build times) we suggest to extract kernels or subsets for the hands-on part of the course upfront. Such kernels or subsets need to be executable on the Salomon cluster and also come with test data for meaningful analysis.

For participants without own applications, the trainer will provide examples.

Registration

Obligatory registration - registration form here; deadline see above or exhausted course capacity.

Capacity and Fees

25 participants. The event is provided free of charge for them.

Practicalities

  • Documentation for IT4Innovations' computer systems is available at https://docs.it4i.cz/.
  • See links below how to get to the campus of  VŠB - Technical University Ostrava and to the IT4Innovations building.
Přílohy: