# **Intel® FPGA SDK for OpenCL<sup>™</sup> Pro Edition** #### **Version 19.2 Release Notes** Updated for Intel® Quartus® Prime Design Suite: 19.2 RN-OCL004 | 2019.07.01 Latest document on the web: PDF | HTML #### **Contents** | 1. Intel <sup>®</sup> FPGA SDK for OpenCL <sup>™</sup> Pro Edition Version 19.2 Release Notes | 3 | |-----------------------------------------------------------------------------------------------|---| | 1.1. New Features and Enhancements | | | 1.2. Operating System Support | | | 1.3. Changes to Software Behavior | | | 1.4. Known Issues and Workarounds. | | | 1.5. Software Issues Resolved | 6 | | 1.6. Software Patches Included in this Release | 6 | | 1.7. Intel FPGA SDK for OpenCL Pro Edition Release Notes Archives | 6 | | 1.8. Document Revision History of the Intel FPGA SDK for OpenCL Pro Edition Release | | | Notes | 7 | ## **1.** Intel<sup>®</sup> FPGA SDK for OpenCL<sup>™</sup> Pro Edition Version 19.2 Release Notes The $Intel^{\circledR}$ FPGA SDK for $OpenCL^{\intercal}$ Pro Edition Release Notes provides late-breaking information about the Intel FPGA Software Development Kit (SDK) for $OpenCL^{(1)(2)}$ Pro Edition and the Intel FPGA Runtime Environment (RTE) for OpenCL Pro Edition Version 19.2. #### 1.1. New Features and Enhancements The Intel FPGA SDK for OpenCL Pro Edition and the Intel FPGA RTE for OpenCL Pro Edition include the following new features: - By default, the Intel FPGA SDK for OpenCL Offline Compiler targets the Fast Emulator when you use the -march=emulator flag. For more information about emulating your OpenCL kernel, refer to Emulating and Debugging Your OpenCL Kernel in the Intel FPGA SDK for OpenCL Programming Guide. - The Legacy Emulator is still available and you can target it by using a combination of -march=emulator -legacy-emulator flags. For more information, refer to Using the Legacy Emulator in the Intel FPGA SDK for OpenCL Programming Guide. - The limit for maximum number of declared variables in the local memory per kernel is increased to 512. - Added disable\_loop\_pipelining pragma to direct the Intel FPGA SDK for OpenCL Offline Compiler to disable loop pipelining. - In the high-level design (report.html) report, the System Viewer, Block Viewer, and Cluster Viewer have been merged into a hierarchical Graph Viewer. #### 1.2. Operating System Support Information about OS support for the Intel FPGA SDK for OpenCL is available on the Operating System Support page of the Intel FPGA website. #### **Related Information** **Operating System Support** <sup>(1)</sup> OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission of the Khronos Group™. <sup>(2)</sup> The Intel FPGA SDK for OpenCL is based on a published Khronos Specification, and has passed the Khronos Conformance Testing Process. Current conformance status can be found at www.khronos.org/conformance. Intel Corporation. All rights reserved. Agilex, Altera, Arria, Cyclone, Enpirion, Intel, the Intel logo, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in accordance with Intel's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Intel. Intel customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. #### 1.3. Changes to Software Behavior Items listed in the following table represent cases in which the behaviors of the current release of the Intel FPGA SDK for OpenCL and the Intel FPGA RTE for OpenCL differ from the previous version. | Description | Required Actions | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | The OpenCL headers provided by the Intel FPGA SDK for OpenCL have been updated to the latest version distributed by Khronos. | Refer to the OpenCL 2.0 Headers in the Intel FPGA SDK for OpenCL Pro Edition: Programming Guide for more information. | | The cl.hpp header is now deprecated. | Update your OpenCL host program code to use the c12.hpp header that supports all versions of OpenCL. | | The OpenCL host runtime now returns CL_PROFILING_INFO_NOT_AVAILABLE when you call clGetEventProfilingInfo function on a cl_event associated with a cl_command_queue that was not created with the CL_QUEUE_PROFILING_ENABLE property. Previous versions of the Intel FPGA SDK for OpenCL did not enforce this requirement of the OpenCL Specification version 1.2. | Update your OpenCL host program to create the cl_command_queue with the CL_PROFILING_INFO_ENABLE property if you want to enable profiling. | | The formatting of printf command output of OpenCL vector types has been modified to be conformant with the OpenCL Specification version 1.2. | _ | | Mandatory requirement for all host programs that use Intel FPGA-specific APIs and enumerations. | If your host programs use Intel FPGA-specific APIs and enumerations, such as <code>clReadPipeIntelFPGA</code> , <code>clGetProfileInfoIntelFPGA</code> , and other APIs that end with <code>IntelFPGA</code> , you must explicitly include the <code>CL/cl_ext_intelfpga.h</code> header file in your source code. | | The host pipe feature requires the use of OpenCL 2.0 features. | Enable support for OpenCL 2.0 APIs in all of your host programs that use host pipes in their source code. Refer to the OpenCL 2.0 Headers in the Intel FPGA SDK for OpenCL Pro Edition: Programming Guide for more information. | | In previous versions of Intel FPGA SDK for OpenCL, the offline compiler automatically disabled the local memory replication for all memory systems if it estimated an M20K utilization of over 100%. The offline compiler no longer disables the local memory replication automatically. | To restore the behavior of disabling the local memory replication automatically, use the -no-local-mem-replication option in the aoc command. | | Automatic loop unrolling is now disabled. | If you want to enable it, use LLVM options -unroll- threshold and -unroll-partial-threshold in the aoc command. Recommended settings: • For optimization levels -00, -01, and -02, set - unroll-threshold=150 and -unroll- partial-threshold=150. • For optimization level -03, set -unroll- threshold=300 and -unroll-partial- threshold=150. | | Support for passing pipes or channels by reference is removed. | Update your code to pass pipes or channels by value. | | The restrict keyword is renamed torestrict. | Update your code to use therestrict keyword instead of the restrict keyword. | Items listed in the following table represent cases in which the behaviors of the current release of the Intel FPGA SDK for OpenCL Custom Platform Toolkit and Reference Platforms differ from the previous version. | Description | Workaround | |-------------|------------| | N/A | N/A | #### **Related Information** OpenCL 2.0 Headers #### 1.4. Known Issues and Workarounds This section provides information about known issues that affect the Intel FPGA SDK for OpenCL and the Intel FPGA RTE for OpenCL Version 19.2. | Description | Workaround | | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--| | In the fast emulator, the memory used for modeling channel depth scales differently than under the legacy emulator. This means that some designs (that can be compiled for the legacy emulator) fail to compile for the fast emulator and display the following warning and error message: *Internal compiler error* Unable to allocate | If possible, scale down the design or limit the channel depth. If everything else fails, use the legacy emulator. | | | section memory! | | | | The OpenCL emulator does not return CL_INVALID_GLOBAL_OFFSET when a kernel with the uses_global_work_offset(0) kernel attribute set is enqueued with a non-zero or non-NULL global_work_offset argument. | Set the global_work_offset argument to NULL or zero in all work group dimensions so that this issue does not change the kernel output between the emulator and hardware. | | | OpenCL kernels with names longer than 61 characters might fail in the Intel Quartus® Prime Pro Edition compiler with an error similar to the following error: | Reduce the size of the OpenCL kernel name. | | | Error (16045): Instance " <long_kernel_name>_cra_slave_inst" instantiates undefined entity "<long_kernel_name>_function_cra_slave" File: <filename> Line: <li>linenumber&gt;</li></filename></long_kernel_name></long_kernel_name> | | | | Valid OpenCL kernel pipes cannot be passed as arguments in some cases. The symptom is the runtime will receive a CL_INVALID_BUFFER_SIZE (-61) error when you enqueue your kernel. | Modify your design to use channels instead of pipes. | | | The emulator runtime results in an assertion error if a kernel is enqueued 16,000 times. | Do not enqueue a kernel more than 16,000 times. | | | When alternatively using sub-buffers and their parent buffers, changes written to one might not be reflected in the other. | Unmapping and mapping a buffer forces the sub-buffers and their parent buffers to be synced. Unmapping and mapping a buffer between buffer uses should prevent this issue. | | | In the OpenCL runtime, making more than one OpenCL context in a multithreaded environment might cause a segmentation fault. | _ | | This section provides information about known issues that affect the current release of the Intel FPGA SDK for OpenCL Custom Platform Toolkit and Reference Platforms. These issues might also affect Custom Platforms you create for use with the Intel FPGA SDK for OpenCL. | Description | Workaround | |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------| | Race conditions can occur between enqueue and dequeue buffer operations and host pipe operations. These conditions can results in incorrect data being read or written. | Manually make sure that no enqueue or dequeue buffer operations occur in parallel with host pipe API calls. | | | continued | | Description | Workaround | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | A way to ensure that buffer operations do not occur in parallel with host pipe API calls is to do buffer operations as blocking calls before the first host pipe operation and after you are certain that the last host pipe operation has been completed (for example, all the data is read back). | | For Windows, when the host application queries the number of devices, calls to clGetDeviceIDs return 128 devices regardless of the actual number of devices present. Note: You can find the actual available devices at the beginning of the device list returned by clGetDeviceIDs. This issue affects the Intel Arria® 10 GX FPGA Development Kit Reference Platform and the Intel Stratix® 10 GX FPGA Development Kit Reference Platform. | Perform one of the following workarounds: Rewrite the host application to limit the query for clGetDeviceIDs to the actual number of devices. Rewrite the host application to use clGetDeviceInfo to query which devices are available. Calling clGetDeviceInfo with the CL_DEVICE_AVAILABLE flag correctly reports that extraneous devices are unavailable. Rewrite the host application to only call clCreateContext with the actual number of devices. Calling clCreateContext with extraneous devices will fail with the error CL_DEVICE_NOT_AVAILABLE. Set the environment variable CL_OVERRIDE_NUM_DEVICES_INTELFPGA to the correct number of devices. Doing so fixes the erroneous behavior of clGetDeviceIDs. | #### **Latest Known Intel FPGA SDK for OpenCL Software Issues** For additional known issue information for the current Intel FPGA SDK for OpenCL version and for previous versions, refer to the Knowledge Base web page. #### **Related Information** Knowledge Base #### 1.5. Software Issues Resolved No software issues were reported, corrected or otherwise resolved in the Intel FPGA SDK for OpenCL and the Intel FPGA RTE for OpenCL Version 19.2. #### 1.6. Software Patches Included in this Release The following software patch is included in this release: #### Table 2. Software Patches Included in the Intel FPGA SDK for OpenCL | Software Version | Patch | Customer Service Request<br>Number | |----------------------------------------|--------|------------------------------------| | Intel FPGA SDK for OpenCL Version 19.2 | 0.12cl | - | #### 1.7. Intel FPGA SDK for OpenCL Pro Edition Release Notes Archives If the table does not list a software version, the user guide for the previous software version applies. | Intel Quartus Prime<br>Version | User Guide | |--------------------------------|------------------------------------------------------------------| | 19.1 | Intel FPGA SDK for OpenCL Pro Edition Version 19.1 Release Notes | | 18.1 | Intel FPGA SDK for OpenCL Pro Edition Version 18.1 Release Notes | | | continued | ### 1. Intel® FPGA SDK for OpenCL™ Pro Edition Version 19.2 Release Notes RN-OCL004 | 2019.07.01 | Intel Quartus Prime<br>Version | User Guide | | |--------------------------------|-----------------------------------------------------|--| | 18.0 | Intel FPGA SDK for OpenCL Pro Edition Release Notes | | | 17.1 | Intel FPGA SDK for OpenCL Release Notes | | | 17.0 | Intel FPGA SDK for OpenCL Release Notes | | | 16.1 | Intel FPGA SDK for OpenCL Release Notes | | | 16.0 | Altera SDK for OpenCL Version 16.0 Release Notes | | | 15.1 | Altera SDK for OpenCL Version 15.1 Release Notes | | | 15.0 | Altera SDK for OpenCL Version 15.0 Release Notes | | ## 1.8. Document Revision History of the Intel FPGA SDK for OpenCL Pro Edition Release Notes | <b>Document Version</b> | Intel Quartus<br>Prime Version | Changes | |-------------------------|--------------------------------|------------------| | 2019.07.01 | 19.2 | Initial release. |