Run
GPU Roofline Insights Perspective from GUI
GPU Roofline Insights
Perspective from GUIPrerequisites:
- Configure system to analyze GPU kernels.
- In the graphical-user interface (GUI): Create a project and specify an analysis target and target options.
To configure and run the
GPU Roofline Insights
perspective from the GUI:
- Configure the perspective and set analysis properties, depending on desired results:
- Select a collection accuracy level with analysis properties preset for a specific result:
- Low: Analyze performance of kernels executed on GPU and plot a GPU Roofline chart for all memory levels. Plot a basic CPU Roofline chart for loops/functions executed on CPU.
- Medium: Analyze performance of kernels executed on GPU, plot a GPU Roofline chart for all memory levels, and model the application performance to get more optimization recommendations. Plot a basic CPU Roofline chart for loops/functions executed on CPU.
- High: Analyze performance of kernels executed on GPU, plot a GPU Roofline chart for all memory levels, and model the application performance to get more optimization recommendations. Plot an extended CPU Roofline chart for loops/functions executed on CPU for all memory levels.
- Select the analyses and properties manually to adjust the perspective flow to your needs. The accuracy level is set toCustom.
By default, accuracy is set toLow. The higher accuracy value you choose, the higher runtime overhead is added to your application. TheOverheadindicator shows the overhead for the selected configuration. For theCustomaccuracy, the overhead is calculated automatically for the selected analyses and properties.For more information, see GPU Roofline Accuracy Presets.If you want to analyze only code regions executed on GPU, select theLoworMediumaccuracy. This decreases analysis overhead. For theGPU Roofline Insightsperspective, theHighaccuracy level controls the complexity of theCPU Rooflinechart generated for loops/functions in your code executed on CPU. - If you have multiple GPUs connected to your system, select a target GPU to collect data for from theTarget GPUdrop-down.The drop-down shows an adapter address and a name for each GPU available. The address is in the format<domain>:<bus>:<device-number>.<function-number>.
- Run the perspective: click
button.
While the perspective is running, you can do the following in theAnalysis Workflowtab:- Control the perspective execution:
- Stop data collection and see the already collected data: Click the
button.
- Pause data collection: Click the
button.
- Cancel data collection and discard the collected data: Click the
button.
- Expand an analysis with
to control the analysis execution:
- Pause the analysis: Click the
button.
- Stop the currently running analysis and start the next analysis selected: Click the
button.
- Interrupt execution of all selected analyses and see the already collected data: Click the
button.
To run the
GPU Roofline Insights
perspective with the Low accuracy from the command line interface:advisor --collect=roofline --profile-gpu --project-dir=./advi_results -- ./myApplication
See
Run
GPU Roofline Insights
from Command Line for details.
Once the
GPU Roofline Insights
perspective collects data, the report opens showing a
Summary
with performance metrics measured for CPU- and GPU-executed parts of your application and preview Roofline charts. Continue to
examine GPU bottlenecks on the Roofline chart to investigate the results.