Graphics Trace Analyzer Window: Platform View
Displays the opened trace file name. The file name consists of the application name, capture date and time in the following format: YYYY_MM_DD_HH_MM_SS.
Displays the opened trace data, notifications, opens the Intel® GPA User Guide, restores the default view, and changes the Intel® GPA color scheme.
Time Line Viewer toolbar
Filters tracks, shows event duration, zooms the selection in and out.
Time Line Viewer pane
Visualizes the opened trace content.
Trace Content pane
Lists the opened trace content.
Synchronization Flow Graph
The arrows reveal event dependencies on the time line. Identify inefficient events synchronization leading to incomplete GPU load. To see the graphs, click one of the objects. Red arrows demonstrate parent-child relationships between events. Green arrows connect synchronization objects in UMD Driver Queues (Device Context Pane) and CPU threads. They also show synchronization points between objects. If you want to see several dependencies at a time, press and hold CTRL and pick objects by clicking.
This feature is only available on Windows* hosts.
Selection Statistics pane
Shows detailed information about the selection in the
The pane is only visible when any event or group of events is selected, unless the
Aggregated selection summaryis enabled in the
Returns to the
Open Trace Capturewindow.
Provides the following statistical information for the analyzed trace:
Provides controls for trace display customization:
Displays messages and non-critical errors.
If more than one message is open, use the Up or Down arrows to navigate between messages.
Provides the following properties of the analyzed trace:
Opens a form to send feedback to the Intel® GPA technical support.
Displays information on navigation controls and opens the Intel® GPA online documentation.
Time Line Viewer Toolbar
- Type Filter Expressionfield searches and filters events based on lexicographical comparison.
- Match Whole Wordbutton enables searching and filtering events by whole names.
- Select Filtered Tasksbutton selects filtered events in theTime Line Viewerpane.
- Time line ruler correlates all the displayed events with the time line. The range of the time line corresponds to the length of the trace in the current zoom level. Depending on the zoom-in level, the time intervals are displayed in seconds, milliseconds, microseconds, on nanoseconds. The default trace length is 5s.To mark a specific timestamp, you can create a bookmark on the time line ruler by double-clicking the necessary point on the time ruler.
- Zoom to Selectionbutton zooms into the time range of selected events.
- Full Zoom Outbutton restores the default zoom level.
Time Line Viewer Pane
- Track Descriptionconsists of a track name and name of the process or platform, which the track belongs to, as well as the percentage or time when the device was utilized for the trace area visible in the current zoom level.
- Track Datais graphic representation of performance events and other data, such as metrics and queue packages.
- Track Toolbarcloses tracks and shows metric ranges.
- CPU tracksVisualize aggregated CPU activity reflecting thread execution on CPU cores. Use to analyze thread execution order, execution duration, and distribution between CPU cores. Blocks of the same color represent the same thread. If you zoom into the trace, you can see the process name the thread belongs to and thread execution duration.
- GPU queuesShow GPU queues for all active video adapters generating graphics content. GPU queues can be of different types depending on the application. The Time Line Viewer pane visualizes GPU utilization over time: execution of the DMA packages on GPU. DMA package color corresponds to the color of the thread, from which the DMA package was submitted. All DMA packages have names visible on mouse hover. Additionally, DMA packages, which are essential for analysis, are marked with different icons depending on their type:
Selecting any of these packages shows an arrow that reveals calls in CPU threads related to that package. For example, you can trace the origin of a Render package from the CPU thread, to the User-Mode Driver, and up to the hardware queue.
- Render package. Render package with a present call is crosshatched.
- Signal package
- Wait package
- Paging package
- Flip queuesShows flip queues for all active video adapters. Flip queue reflects the relationship between the application present calls, present packages of GPU/CPU queues, and Vertical Synchronization (VSync) event of the monitor. Flip queue package consists of two blocks: solid-color and crosshatched. Solid-color block shows the time when Desktop Window Manager (DWM) is generating the content that needs to be displayed. The crosshatched block shows the time during which the content waits for the moment to be displayed (VSync).For applications utilizing layered flip queues, multiple layers can be displayed as sub-tracks of the general Flip Queue track.
- CPU queuesShows CPU queues for all processes generating graphics content. CPU queue represents ordered command packages to be executed, but not yet submitted to GPU. CPU queues can be of different types depending on the application. CPU queue package color corresponds to the color of the thread, from which the CPU queue package was submitted. All CPU packages have names visible on mouse hover. Additionally, CPU packages, which are essential for analysis, are marked with different icons depending on their type:
CPU synchronization is represented asWaitForSingleObject,WaitForMultipleObjects, andSetEventfunction calls on the time line track of the threads. You can use these captured events to profile different synchronization issues.Click on any of these events to visualize dependencies between synchronization events in the form of arrows. For example, an arrow pointing from aSetEventcall in one thread to aWaitForSingleObjectcall in a different thread indicates that a thread was unblocked by aSetEventcall in another thread.On CPU queues tracks, areas where a thread was active are highlighted as green bars. Areas where a thread was idle are shown as gray bars.A thread is considered active and is highlighted accordingly if this thread was actively executed on a CPU core at this time.To see OpenCL™ API calls on CPU tracks, enable OpenCL domain in
- Render package. Render package with a present call is hatched for packages named Present Token and crosshatched for packages named Present Render.
- Signal package
- Wait package
- Paging package
- EventsShows events as markers that have a timestamp, but do not have duration. Markers scope can be global or process-defined. Markers are visualized as colored triangles. Global markers are placed on the Time Line ruler; process-defined markers are shown on executed threads.
- MetricsShows any GPU/CPU metrics, which you enabled in System Analyzer or System Analyzer HUD.
- RegionsShows regions, which are logical application execution blocks, defined with Instrumentation and Tracing Technology API (ITT API) or graphics debug API. By default, block color corresponds to the color of the time line track. Each block has a name and duration.
- ThreadstrackShows executed threads of profiled processes. Use this data to analyze your application performance and behavior based on ITT API and system events. Each colored block represents a logical block of application execution marked up by the user or generated by the system. Each block has a name and duration, and can have nested blocks. By default, block color corresponds to the color of the time line track.
- Parallel ExecutiontrackParallel Execution track visualizes how the driver parallelizes execution of submitted render events.
- OpenCL ExecutiontracksOpenCL Execution tracks visualize execution of OpenCL kernels on a GPU or a CPU. To see the dependency between the tasks of submitting and executing a particular kernel, click any OpenCL packet or OpenCL API call on the CPU Thread track.The data is useful to spot synchronization issues or understand whether there is a problem in OpenCL code if you use different APIs. For example, if OpenCL Execution track is fully loaded, you can detect a problematic kernel for detailed profiling with Intel® VTune™ Profiler analyzer.To see OpenCL Execution tracks, enable OpenCL domain in
Trace Content Pane
Type Filter Expressionfield
Filters trace file tracks by name or parameters. Saves previous search keywords.
Collapse All Groupsbutton
Closes all unfolded track groups.
Expand All Groupsbutton
Unfolds all track groups.
Includes aggregated information on trace file content in the following order:
Selection Statistics Pane
- Single SelectionLeft-click any object in the track.
- Group SelectionLeft-click any objects in tracks while keeping theCTRLkeyboard button pressed down.
- Rectangular SelectionPress the left mouse button and drag the mouse in any desired direction to select the required region.
- Range SelectionPress the right mouse button and drag the mouse horizontally to select events in all tracks within the selected range. You can change the selection by dragging the right or left boundary. The selected range includes all the intercepted tasks.
Aggregated Summary table
All the selected events are grouped by event types:
For all the selected events you can view the following aggregated information:
Itemized Summary table
For each selected item in the
Aggregated Summarytable you can view the list of events with the following information:
Performance Event Details table
Shows arguments for the event selected in the
Itemized Summarytable. Available for GPU queue packages, CPU queue packets, and VSync events.