Context Menus: Project Navigator
Context Menus: Sources Window Panes
Context Menus: Summary Window Panes
Dialog Box: Corresponding inspxe-cl Command Options
Dialog Box: Create a Project
Dialog Box: Create Suppression
Dialog Box: Custom Analysis
Dialog Box: Delete Suppressions
Dialog Box: Export Result
Dialog Box: Merge States
Dialog Box: Options-Debugger
Dialog Box: Options-General
Dialog Box: Options-Result Location
Dialog Box: Options-State Management
Dialog Box: Problem Report
Dialog Box: Project Properties-Binary/Symbol Search
Dialog Box: Project Properties-Source Search
Dialog Box: Project Properties-Suppressions
Dialog Box: Project Properties-Target
Dialog Box: Refine Source File Set
Dialog Box: Select Stack Frame(s)
Dialog Box: View Stack
Hot Keys
Pane: Analysis Type-Custom
Pane: Analysis Type-Memory Errors
Pane: Analysis Type-Threading Errors
Pane: Application Output
Pane: Code and Stack
Pane: Code Locations
Pane: Collection Log
Pane: Collector Messages
Pane: Compare Results
Pane: Filters
Pane: Import Result
Pane: Launch Application
Pane: Problems
Pane: Project Navigator
Pane: Timeline
Toolbar: Command
Toolbar: Intel Inspector
Toolbar: Navigation
Window: Collection Log
Window: Compare Results
Window: Import Result
Window: Sources
Window: Summary After Analysis Is Complete
Window: Summary During Analysis
Asynchronous Buffer
Possible Correction Strategies
Cross-thread Stack Access
Data Race
Deadlock
Host Pointer Used on Device
Incorrect memcpy Call
Invalid Deallocation
Invalid Kernel Argument
Invalid Kernel Argument Size
Invalid Memory Access
Invalid Partial Memory Access
Lock Hierarchy Violation
Memory Growth
Memory Leak
Memory Not Deallocated
Mismatched Allocation/Deallocation
Mismatched Queue
Missing Allocation
Non-Host Pointer
Pointer from Different Device
Thread Exit Information
Thread Start Information
Unhandled Application Exception
Uninitialized Memory Access
Uninitialized Partial Memory Access
appdebug
app-working-dir
archive-name
baseline-result
collect
collect-with
command
convert-suppression-file
create-breakpoints
create-suppression-file
csv-delimiter
debug-this
executable-of-interest
export
filter
finalize
format
help
include-snippets
include-sources
import
knob
knob-list
merge-states
module-filter
module-filter-mode
no-auto-finalize
no-summary
option-file
quiet
report
report-all
report-output
result-dir
return-app-exitcode
search-dir
sort-asc
sort-desc
suppression-file
user-data-dir
verbose
version
Asynchronous Buffer
Occurs when operations between a program executed on host and a kernel are not synchronized.
ID |
Code Location |
Description |
---|---|---|
1 |
Allocation site |
Represents a source location of passing data from a host program to a buffer without synchronization. |
2 |
Read |
Represents a source location of copying data from a buffer to host program, when kernel execution is not yet complete. |
If the operations of passing data to a buffer and copying the calculated data from this buffer back to the host are not synchronized, the program copies data from the buffer before the kernel completes execution. This results in getting initially passed data from the device kernel.
DPC++ Example
queue.submit([&](cl::sycl::handler &cgh) { cgh.parallel_for<class my_task>(cl::sycl::range<1> { N }, [=](cl::sycl::id<1> idx) { // We compute squares deviceData[idx] *= deviceData[idx]; }); }); // queue.wait(); for(int i=0; i<n; i++) std::cout << deviceData[i] << “ “;
Possible Correction Strategies
To copy the correct data to the buffer after a kernel execution, do the following:
- In the OpenCL™ kernel, use the following events to set kernel ordering:
- clGetEventInfo() enables you to get information about the current state of the kernel.
- clFinish() waits until the kernel execution ends and its state is completed.
- In the Data Parallel C++ (DPC++) program, use the queue.wait(); command to wait until the end of kernel execution before copying the calculated data to the host.
CAUTION:
Parent topic: Problem Type Reference