Intel Acceleration Stack for Intel® Xeon® CPU with FPGAs Core Cache Interface (CCI-P) Reference Manual

ID 683193
Date 11/04/2019
Public
Document Table of Contents

1.3.15. CCI-P Guidance

This section suggests techniques and settings that are useful when beginning to use the Integrated FPGA Platform or Intel® FPGA PAC with Intel® FPGA IP system.

The CCI-P interface provides several advanced features for fine grained control of FPGA caching states and virtual channels. When used correctly, optimal performance through the interface can be obtained; if used incorrectly, you may see significant degradation in performance.

The table below lists some suggested parameters for request fields.

Table 40.  Recommended Choices for Memory Requests
Field Recommended Option
vc_sel For producer-consumer type flows VA
For latency sensitive flows VL0
For data dependent flow Use any one of the VCs, except VA; or use MPF's VC map
cl_len For maximum bandwidth 4 CLs (256 bytes)
req_type Memory reads RdLine_I
Memory writes WrLine_M
Use the following guidance, when setting the size of the request buffers in the AFU:
  • Intel® FPGA PAC
    • 64 outstanding requests on VH0
    • VA and VH0 can share the same 64 outstanding request buffers
  • Integrated FPGA Platform
    • VH0 and VH1 can each have 64 outstanding requests.
    • VL0 requires at least 128 transactions in flight to reach full bandwidth, and no more than 256 outstanding requests are required to cover the long latency tail.
    • For VA, maximum performance can be achieved with a minimum of 256 transactions and a maximum of 384 transactions. Consider sharing VA buffers with other VCs, to save design area.