Webinar: Better Threaded Performance and Scalability With Intel® VTune™ Amplifier + OpenMP*

Published: 09/26/2017

By Anoop Madhusoodhanan Prabha

Webinar

Please click on the link Better Threaded Performance and Scalability with Intel® VTune™ Amplifier + OpenMP* to watch the webinar.

You'll learn:

  • How solutions using OpenMP* 4.0 (which provides new capabilities to achieve explicit SIMD vectorization and threading) can dramatically improve performance on modern processors with Intel® C++ compiler
  • How Intel VTune Amplifier can help you uncover common performance and scalability issues, and identify whether problems are due to imbalance, lock contention, creation overhead, or scheduling overhead
  • How to use both to optimize performance on the newest Intel® hardware

Pre-requisites:

  1. Intel® Parallel Studio Professional or Ultimate Edition Installed on Linux machines (Provides Intel® C++ Compiler, Intel® Vtune Amplifier, Intel® Advisor which we will use in this lab).
  2. Install OpenCV latest version:
    1. Download the source from github (https://github.com/opencv/opencv) using git clone command.
    2. Build OpenCV libraries using instructions documented at http://docs.opencv.org/trunk/d7/d9f/tutorial_linux_install.html.
  3. Make sure that you have a copy of the source code for your lab which includes the lab documentation.

Introduction:

This lab will help you understand how to use Intel® Vtune Amplifier and Intel® Advisor to look for tuning opportunities and tune the code by enabling threading (using OpenMP or Intel® Threading Building Blocks [Intel® TBB]) and enabling vectorization (using OpenMP 4.0 SIMD constructs). 

Detailed document is here.

Attachment Size
lab-instructions.pdf 2.5 MB
code-sample.zip 17.8 MB

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.