Tutorial: Analyzing an OpenMP* and MPI Application
Intel® Trace Analyzer and Collector
Application Performance Snapshot
Intel® VTune™
for Linux* OSProfiler
Discover how to use Intel® Parallel Studio to tune hybrid applications by reviewing MPI utilization inefficiencies and balancing thread load levels.
About This Tutorial | This tutorial uses the sample
heart_demo and guides you through basic steps required to analyze hybrid OpenMP* and MPI code for inefficiencies using Intel® VTune™
's Application Performance Snapshot, Intel® Trace Analyzer and Collector, and Profiler Intel VTune
.
Profiler The tutorial was last updated for the Intel Parallel Studio 2018 product release. The analysis was run on 8 cluster nodes with Intel® Xeon Phi™ processors (formerly code named Knights Landing), each with 256 logical CPUs.
|
Estimated Duration | Read tutorial: 10 minutes
Run through tutorial with sample application: 60+ minutes
|
Learning Objectives | After you complete this tutorial, you should be able to:
|
More Resources |
|