User Guide

Contents

profile-gpu

Analyze OpenCL™ and oneAPI Level Zero programs running on Intel® Graphics.
GUI Equivalent
Analysis Workflow
Baseline Device
GPU

Syntax

--profile-gpu
--no-profile-gpu
Default
Off (
no-profile-gpu
)
Actions Modified
collect
=survey
collect
=tripcounts
collect
=roofline
collect
=projection
Usage
Prerequisite
: Set up system environment to enable GPU kernel profiling.
Use this option to analyze a GPU-enabled application that uses Data Parallel C++, OpenMP* target, or OpenCL™ programming model.
  • For the
    GPU Roofline Insights
    , use this option to analyze code regions running on a CPU and code regions running on a GPU. This option may affect the performance of your application on the CPU side.
  • For the Offload Modeling, use this option to analyze
    only
    code regions running on a GPU. This is a
    preview
    feature.
    Make sure to use this option with the Survey, Trip Counts, and Performance Modeling analyses.
GPU profiling is applicable only to Intel® Graphics.
Example
Run the Roofline analysis and enable GPU profiling to analyze OpenCL™ and oneAPI Level Zero programs running on Intel® Graphics.
$ advisor --collect=roofline --profile-gpu --project-dir=./advi_results -- ./myApplication
  1. Run the Survey analysis with the GPU kernel profiling enabled.
  2. Run the Trip Counts and FLOP analysis with the GPU kernel profiling enabled.
  3. Run the Performance Modeling with the GPU kernel profiling enabled.
$ advisor --collect=survey --static-instruction-mix --profile-gpu --project-dir=./advi_results -- ./myApplication
$ advisor --collect=tripcounts --flop --profile-gpu --project-dir=./advi_results -- ./myApplication
$ advisor --collect=projection --profile-gpu --project-dir=./advi_results -- ./myApplication

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.