Intel® Advisor User Guide

ID 766448
Date 12/16/2022
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

GPU Roofline Accuracy Presets

For each perspective, Intel® Advisor has several levels of collection accuracy. Each accuracy level is a set of analyses and properties that control what data is collected and the level of collection details. The higher accuracy value you choose, the higher runtime overhead is added.

The following accuracy levels are available:

Comparison / Accuracy Level

Low

Medium

High

Overhead

5 - 10x

15 - 20x

20 - 50x

Goal

Analyze kernels in your application running on GPU

Analyze kernels running on GPU and loops/functions running on CPU in more details

Analyze kernels running on GPU and loops/functions running on CPU in more details

Analyses

Survey with GPU profiling + Characterization (FLOP)

Survey with GPU profiling + Characterization (FLOP, memory object analysis with light data transfer simulation between host and target device memory) + Performance Modeling for a baseline GPU

Survey with GPU profiling + Characterization (Trip Counts and FLOP with call stacks for CPU, CPU cache simulation, memory object analysis with medium data transfer simulation between host and target device memory) + Performance Modeling for a baseline GPU

Result for kernels on GPU

Memory-level GPU Roofline (for CARM, L3