User Guide

Contents

Run
GPU Roofline Insights
Perspective from GUI

  1. Prerequisites:
    1. Configure system to analyze GPU kernels.
    2. In the graphical-user interface (GUI): Create a project and specify an analysis target and target options.
    To configure and run the
    GPU Roofline Insights
    perspective from the GUI:
  2. Configure the perspective and set analysis properties, depending on desired results:
    • Select a collection accuracy level with analysis properties preset for a specific result:
      • Low
        : Analyze performance of kernels executed on GPU and plot a GPU Roofline chart for all memory levels. Plot a basic CPU Roofline chart for loops/functions executed on CPU.
      • Medium
        : Analyze performance of kernels executed on GPU, plot a GPU Roofline chart for all memory levels, and model the application performance to get more optimization recommendations. Plot a basic CPU Roofline chart for loops/functions executed on CPU.
      • High
        : Analyze performance of kernels executed on GPU, plot a GPU Roofline chart for all memory levels, and model the application performance to get more optimization recommendations. Plot an extended CPU Roofline chart for loops/functions executed on CPU for all memory levels.
    • Select the analyses and properties manually to adjust the perspective flow to your needs. The accuracy level is set to
      Custom
      .
    The higher accuracy value you choose, the higher runtime overhead is added to your application. The
    Overhead
    indicator shows the overhead for the selected configuration. For the
    Custom
    accuracy, the overhead is calculated automatically for the selected analyses and properties.
    By default, accuracy is set to
    Low
    . For more information, see GPU Roofline Accuracy Presets.
    For the
    GPU Roofline Insights
    perspective, the High accuracy level controls the complexity of the
    CPU Roofline
    chart generated for loops/functions in your code executed on CPU. If you want to analyze only code regions executed on GPU, select the
    Low
    or
    Medium
    accuracy. This decreases analysis overhead.
  3. If you have multiple GPUs connected to your system, select a target GPU to collect data for from the
    Target GPU
    drop-down.
    The drop-down shows an adapter address and a name for each GPU available and the address is in the format
    <domain>
    :
    <bus>
    :
    <device-number>
    .
    <function-number>
    .
  4. Run the perspective: click button.
    While the perspective is running, you can do the following in the
    Analysis Workflow
    tab:
    • Control the perspective execution:
      • Stop data collection and see the already collected data: Click the button.
      • Pause data collection: Click the button.
      • Cancel data collection and discard the collected data: Click the button.
    • Expand an analysis with to control the analysis execution:
      • Pause the analysis: Click the button.
      • Stop the currently running analysis and start the next analysis selected: Click the button.
      • Interrupt execution of all selected analyses and see the already collected data: Click the button.
To run the
GPU Roofline Insights
perspective with the Low accuracy from the command line interface:
advisor --collect=roofline -–profile-gpu --project-dir=./advi_results -– ./myApplication
To generate command lines for selected perspective configuration, click the
Command Line
button.
Once the
GPU Roofline Insights
perspective collects data, the report opens showing a
Summary
with performance metrics measured for CPU- and GPU-executed parts of your application and preview Roofline charts. Continue to examine GPU bottlenecks on the Roofline chart to investigate the results.

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.