SEU Mitigation User Guide: Agilex™ 5 FPGAs and SoCs

ID 813649
Date 9/20/2024
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

3.1. SDM ECC Error Message Bits

The SDM ECC error message bits store the error message when the Agilex™ 5 device detects an SDM ECC error.

The SDM ECC error message contains information about the sector address and the type of error. You can retrieve the contents of the error message from the generic_sdm_data_out signal of the Advanced SEU Detection Intel® FPGA IP.

Table 7.  SDM and Subsystem ECC Error Message Bits Description
Name Width Bit Description

Sector address

(Most significant 32-bit word in generic_sdm_data_out signal)

32 31:24 Reserved
23:16 Address of sector with error
15:8 Reserved
7:4 Error type:
  • 0000—SEU error
  • 0001—SDM and subsystem ECC error
  • 0010—miscellaneous CNT error 3
  • 0011—SmartVID error
  • 0100—miscellaneous SDM error
  • 0101—External Memory Interface
  • Remaining values—reserved
3:0 Reserved

Error data

(Least significant 32-bit word in generic_sdm_data_out signal)

32 31:29

SDM and Subsystem ECC error type:

  • 001—single bit error
  • 010—double bits error
  • 101—correctable single bit error in the transceiver
  • 110—uncorrectable multiple bits error in the transceiver
  • 111—other error classes
  • Remaining values—reserved
28 Correction status:
  • 0—not corrected
  • 1—corrected
27:12 Reserved
11:0

Specific SDM and Subsystem ECC error types 001 and 010 details: RAM_ID

Specific SDM and Subsystem ECC error type 101 details:
  • 0x02ACRE_SERDES_ECC_CODE_ONEBIT
  • 0x02BCRE_SERDES_ECC_DATA_ONEBIT
  • 0x02CCRE_SERDES_ECC_CODE_ONEBIT
  • 0x02DCRE_SERDES_ECC_DATA_ONEBIT
  • 0x02ECRE_RSFEC_ECC_ONEBIT
Specific SDM and Subsystem ECC error type 110 details:
  • 0x010CRE_SERDES_TWOBIT_ECC_CODE
  • 0x011CRE_SERDES_TWOBIT_ECC_DATA
  • 0x012CRE_SPICO_TWOBIT_ECC_CODE
  • 0x013CRE_SPICO_TWOBIT_ECC_DATA
  • 0x014CRE_RSFEC_ECC_TWOBIT

Specific SDM and subsystem ECC error type 111 details: 0x2FCRE_REFCLK_FAIL

Note: When an SDM and Subsystem ECC error occurs, refer to the error data at bit [31:29] for the error type and bit [11:0] for the respective error type details and perform the necessary action accordingly.
Table 8.  SDM Miscellaneous Error Message Bits Description
Name Width Bit Description

Sector address

(Most significant 32-bit word in generic_sdm_data_out signal)

32 31:24 Reserved
23:16 Address of sector with error
15:8 Reserved
7:4 Error Type:
  • 0000—SEU error
  • 0001—SDM and subsystem ECC Error
  • 0010—miscellaneous CNT error3
  • 0011—SmartVID error
  • 0100—miscellaneous SDM error
  • 0101—External Memory Interface
  • Remaining values—reserved
3:0 Reserved

Error data

(Least significant 32-bit word in generic_sdm_data_out signal)

32 31:29 SDM ECC error type:
  • 000—General error
  • 001—Single core TMR event
  • 111—WDT expiry event
  • Remaining values—reserved
28 Correction Status:
  • 0—not corrected
27:12 Reserved
11:0 Specific miscellaneous SDM error type 000 details:
  • 12'd16—general WDT error
  • 12'd17—nConfig WDT error
For other miscellaneous SDM error type:
  • Remaining values—reserved
Note: When an SDM Miscellaneous error occurs, refer to the error data at bit [31:29] for the error type and error data bit [11:0] for the respective error type details and perform the necessary action accordingly.
Table 9.  External Memory Interface Error Message Bits Description
Name Width Bit Description

Sector address

(Most significant 32-bit word in generic_sdm_data_out signal)

32 31:24 Reserved
23:16 Address of sector with error
15:8 Reserved
7:4 Error Type:
  • 0000—SEU error
  • 0001—SDM and subsystem ECC Error
  • 0010—miscellaneous CNT error3
  • 0011—SmartVID error
  • 0100—miscellaneous SDM error
  • 0101—External Memory Interface
  • Remaining values—reserved
3:0 Reserved

Error data

(Least significant 32-bit word in generic_sdm_data_out signal)

32 31:25 Reserved
24:17 EMIF_ID
16:10 SOURCE_ID
9:6

External memory interface error type:

  • 4'd1ECC_RMW_READ_LINK_DB
  • 4'd2ECC_READ_LINK_DBE
  • 4'd3ECC_READ_LINK_SBE
  • 4'd4ECC_WRITE_LINK_DBE
  • 4'd5ECC_WRITE_LINK_SBE
  • 4'd6ECC_MULTI_DBE
  • 4'd7ECC_SINGLE_DBE
  • 4'd8ECC_MULTI_SBE
  • 4'd9ECC_SINGLE_SBE
5:0 DDR_ADDR_MSB
Note: When an External Memory Interface error occurs, refer to the error data at bit [9:6] for the error type and perform the necessary action accordingly.
Important: For uncorrectable SDM ECC error, Altera recommends that you reconfigure the Agilex™ 5 device.
3 Contact Intel Premier Support and quote ID #15015577051 for further assistance when you see this error.