Online: Node Level Performance Optimization
Date: 18 - 20 May 2021
This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs) and efficient code parallelization on node-level using OpenMP threading. Advanced aspects of threading and optimization, such as selected topics of new features in OpenMP 5.0 standard, will be covered during the course.Learning outcomeAwareness of modern features in x86 CPUs;Ability to vectorize computations;Ability to use advanced features of OpenMP;Ability to increase code performance using threading and x86 optimization.Prerequisites and content level Good knowledge of C/C++ or Fortran;Good knowledge of threading using OpenMP;Basic knowledge of modern CPU architectures.The content level of the course is broken down as: beginner's - 0%, intermediate - 20%, advanced - 80%, community-targeted content - 0%.AgendaDay 1, Tuesday 18 May 202109:00 - 12:30- Introduction to performance engineering- Overview of modern multicore CPUs12:30 - 13:30 Lunch break13:30 - 16:30- Intel performance analysis tools- AMD performance analysis toolsDay 2, Wednesday 19 May 202109:00 - 12:00- Vectorization12:00 - 13:00 Lunch break13:00 - 16:30- Optimizing memory access- Other single core optimization topicsDay 3, Thursday 20 May 202109:00 - 12:00- OpenMP tasks12:00 - 13:00 Lunch break13:00 - 16:30- Threading optimization- Future outlook on OpenMPLecturers: Jussi Enkovaara (CSC), Mikko Byckling (Intel), Michael Klemm (AMD)Language: EnglishPrice: Free of charge (3 training days)For further detailed information and registration please visit:https://events.prace-ri.eu/e/NodeLevelPerformanceOptimization_2021 REGISTRATION CLOSEDIf you have registered to this course and you are not able to attend, please CANCEL your registration in advance by sending an email to patc@csc.fi
Keywords: PRACE, intermediate, online, tess, training
Venue: Online event
Organizer: CSC - PRACE
Event types:
- Workshops and courses
Activity log