Intel® VTune™ Profiler Tutorials

ID 673132
Updated 9/13/2022
Version Latest
Public

author-image

By

Current Tutorials

Use these tutorials as quick paths to start using Intel® VTune™ Profiler. Each tutorial demonstrates an end-to-end workflow that you can ultimately apply to your own applications.

Download Intel® VTune™ Profiler (as a standalone tool or as part of the Intel® oneAPI Base Toolkit).

Find current code samples in the library of Intel® oneAPI code samples used with VTune Profiler.

Take This Short Tutorial Learn To Do This
Finding Common Performance Bottlenecks
Duration: 20-30 minutes
Last Updated: Intel VTune Profiler 2021

C++ Tutorial
Windows* OS: HTML | PDF
Linux* OS: HTML | C++ Tutorial
Windows* OS: HTML | PDF
Linux* OS: HTML | PDF
Sample code: matrix
 

Understand where your application is spending time, identify the most time-consuming program units (hotspots), and detect hardware usage bottlenecks for a sample application.

Analyzing an OpenMP* and MPI Application
Duration: 60+ minutes
Last Updated: Intel VTune Amplifier 2018 Update 1

C++ Tutorial
Linux* OS: HTML
Sample code: https://github.com/CardiacDemo/Cardiac_demo
 

Identify issues in a hybrid OpenMP and MPI application using MPI Performance Snapshot, Intel Trace Analyzer and Collector, and Intel VTune Profiler.

TIP: For more end-to-end analysis use cases, explore the Intel VTune Profiler Cookbook.

NOTE: Starting with the Intel® oneAPI Base Toolkit, Intel® VTune™ Amplifier has been renamed to Intel® VTune™ Profiler. You can use the standalone version of VTune Profiler or download it with the Intel® oneAPI Base Toolkit.

Deprecated Tutorials

These tutorials are deprecated and have not been updated for the latest version of Intel VTune Profiler. These tutorials describe features in Intel VTune Amplifier. The features may be different in the current version of its successor, Intel VTune Profiler.

Take This Short Tutorial Learn To Do This
Finding Hotspots
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2017

Fortran Tutorial
Windows* OS: PDF
Linux* OS:  PDF
Sample code: nqueens_fortran
 

Identify where your application is spending time, detect the most time-consuming program units and how they were called.
Analyzing Locks and Waits
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2017

C++ Tutorial
Windows* OS: PDF
Linux* OS: PDF
Sample code: tachyon_vtune_amp_xe
 

Identify locks and waits preventing parallelization.
Identifying Hardware Issues
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2017

C++ Tutorial
Windows* OS:  PDF
Linux* OS: PDF
Sample code: matrix_vtune_amp_xe
 

Identify the hardware-related issues in your application such as data sharing, cache misses, branch misprediction, and others.
Analyzing Disk Input/Output Waits
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2018

C++ Tutorial
Linux* OS: PDF
Sample code: diskio
 

Analyze an I/O bound application that uses the system file cache and user buffer to work with the I/O device.
Identifying False Sharing
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2018

C Tutorial
Linux* OS:  PDF
Sample code: linear_regression
 

Identify false sharing.

Enabling Performance Collection on an Embedded Linux* System
Duration: 60+ minutes
Last Updated: Intel VTune Amplifier 2017

C++ Tutorial
Linux* OS:  PDF
Sample code: tachyon_vtune_amp_xe
 

Configure a remote Linux embedded system built with the Yocto Project* 2.1 environment for application analysis with VTune Amplifier sampling drivers. Analyze where your application is spending time and identify the most time-consuming program units with Advanced Hotspots analysis.

Finding Hotspots on an Android* Platform
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2017

C++ Tutorial
Windows* OS: PDF
Linux* OS: PDF
Sample code: tachyon_vtune_amp_xe

Configure and run a remote Basic Hotspots analysis on an Android target system.

Analyzing Energy Usage on an Android* Platform
Duration: 10-15 minutes
Last Updated: Intel VTune Amplifier 2017

Tutorial
Linux* OS: PDF
Windows* OS: PDF

Run the Energy analysis with the Intel SoC Watch collector (available with the Intel System Studio) directly on the target Android system and view the collected data with the VTune Amplifier installed on the host Windows* or Linux* system.

Analyzing Energy Usage on a Windows* Platform
Duration: 20-30 minutes
Last Updated: Intel VTune Amplifier 2017

Tutorial
Windows* OS: PDF
Sample code: Pi_Console.exe

Run the Energy analysis of an idle system and a sample application with the Intel SoC Watch collector (available with Intel System Studio) directly in the target Windows* system. Copy the results to the Windows host system and view the collected data with VTune Amplifier.