Analyzing the Performance of an OpenMP* and MPI Application

Use Intel® Parallel Studio XE Cluster Edition to understand the cause of ineffective code in a hybrid application by performing a series of steps in a workflow. This tutorial guides you through these workflow steps while using a sample OpenMP* and MPI application,
heart_demo
, which simulates electrophysiological heart activity.
  • Build the
    heart_demo
    sample application.
  • Test OpenMP thread and MPI process combinations.
  • Run Application Performance Snapshot.
  • Interpret result data.
  • Set up the Intel Trace Analyzer and Collector environment.
  • Run the application with Intel Trace Analyzer and Collector enabled.
  • Review the message profile chart.
  • Update the application code for MPI communication issues.
  • Run Application Performance Snapshot on updated application.
  • Test application performance.
  • Use
    -gtool
    to run
    Intel VTune
    Profiler
    HPC Performance Characterization analysis from a command prompt.
  • Review the analysis data to identify legacy instruction set usage.
  • Fix vector instruction set.
  • Test application performance.
  • Run the HPC Performance Characterization analysis on the application.
  • Identify functions that would benefit from parallelism at the threading level.
  • Update the application code to use parallelized functions.
  • Test application performance.

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.