Intel® FPGA SDK for OpenCL™ Pro Edition: Programming Guide

ID 683846
Date 12/19/2022
Public
Document Table of Contents

7.18. Forcing Fewer Read Data Reorder Units to Reduce Area at the Expense of Read Throughput to Global Memory (-num-reorder)

To narrow the global memory interconnect in order to save area while reducing read-only throughput, use the -num-reorder=N option in your aoc command, where N is the number of bank's worth of read bandwidth you desire.

When the Intel® FPGA SDK for OpenCL™ Offline Compiler implements a ring topology for the global memory interconnect (either by automatic choice or by forcing the ring through -global-ring), it widens the interconnect by default to allow more reads to occur in parallel. This allows for the saturation of global memory throughput using read-only traffic. For example, if on a two-bank BSP you require only one bank's worth of read bandwidth, set -num-reorder=1.

Example: aoc -num-reorder=1 <your_kernel_filename>.cl