Declare the ap_float Data Type

Developer Guide

Intel® oneAPI DPC++/C++ Compiler Handbook for FPGAs

Download PDF

ID 785441

Date 6/24/2024

Version

Public

A newer version of this document is available. Customers should click here to go to the newest version.

Declare the ap_float Data Type

The ap_float.hpp header file provides support for arbitrary-precision floating-point numbers. The floating-point representation for ap_float data types adopts the same data layout as the IEEE 754 floating-point representation.

An ap_float variable carries an explicit sign bit and an arbitrary number of bits for the exponent and mantissa. Due to the differences in the internal math implementations and rounding errors, the results from ap_float operations might not always be bit-accurate compared to those produced by C++ native floating-point types with the same exponent and mantissa bit widths.

Perform the following steps to declare the ap_float data type:

Include the ap_float.hpp header file as follows:

#include <sycl/ext/intel/ac_types/ap_float.hpp>

Declare your ap_float variables as follows:

ihc::ap_float<exponent_width, mantissa_width[,rounding_mode]>

Where, the template attributes are defined as follows:

exponent_width, mantissa_width: The bit-width of the exponent and mantissa of the floating-point variable.

The ap_float.hpp header file also provides aliases to declare bfloat16 and bfloat19 data types directly. The ap_float data type supports the following exponent_width, mantissa_width combinations:

Exponent- and Mantissa-Width Combinations Supported by the `ap_float` Data Type
5, 10	8, 7	8, 10	8, 17	8, 23
8, 26	10, 35	11, 44	11, 52	15, 63

Some of these width combinations map to some commonly used floating-point formats, as listed in the following table:

Exponent- and Mantissa-Width Setting for Various Floating-point Formats
Floating-point Format	`exponent_width`, `mantissa_width` Setting
IEEE 754 half-precision (binary16)	5, 10
bfloat16	8, 7
bfloat19	8,10
IEEE 754 single-precision (binary32)	8. 23
IEEE 754 double-precision (binary64)	11, 52
80-bit extended precision	15, 63

rounding_mode: Optional parameter to specify the IEEE 754 rounding mode used when converting between data types. Set the rounding mode with one of the following values:

Rounding Mode Values
Rounding Mode	Description
`ihc::fp_config::FP_Round::RNE`	Round to the nearest, tie break to even. This rounding mode is more accurate (0.5 ULP) but requires more FPGA area.
`ihc::fp_config::FP_Round::RZERO`	Round towards zero. This rounding mode is less accurate (1 ULP) and requires less FPGA area.

NOTE:

If you do not set the rounding_mode parameter, the ihc::FP_Round::RNE rounding mode is used by default.

Math Functions Supported by `ap_float` Data Type

The ap_float data type supports all overloaded math operators and a limited set of the math functions provided by the Intel® oneAPI DPC++/C++ Compiler. For some math operators, you can control the output's precision by using templated versions of the functions.

IMPORTANT:

Due to the differences in the internal math implementations and rounding errors, the results from ap_float operations might not always be bit-accurate when compared to those produced by C++ native floating-point types with the same exponent and mantissa-bit widths. However, these results are validated against the infinitely accurate results.

The following additional math functions are supported through the ap_float_math.hpp header file. All functions are in the ihc namespace.

Math Functions Provided by the `ap_float_math.hpp` header file
Function Type	Math Function	Comment
Exponential and logarithmic functions	`ihc_exp` `ihc_exp2` `ihc_exp10` `ihc_log` `ihc_log2` `ihc_log10`	Supported only for `ap_float` data types with exponent width less than or equal to 15 bits and mantissa width less than or equal to 63 bits.
Exponential and logarithmic functions	`ihc_expm1` `ihc_log1p`	Supported only for `ap_float` data types with exponent width less than or equal to 11 bits and mantissa width less than or equal to 52 bits.
Advanced functions	`ihc_recip` `ihc_rsqrt` `ihc_sqrt` `ihc_cbrt` `ihc_hypot`
Power functions	`ihc_pow` `ihc_pown` `ihc_powr`
Trigonometric functions	`ihc_sin` `ihc_sinpi` `ihc_cos` `ihc_cospi` `ihc_sincos` `ihc_sincospi` `ihc_asin` `ihc_asinpi` `ihc_acos` `ihc_acospi` `ihc_atan` `ihc_atanpi` `ihc_atan2`

Parent topic: Declare and Use the AC Data Types

¹ You can also declare this format directly as a ihc::bfloat16 data type.

² You can also declare this format directly as a ihc::bfloat19 data type.

³ You can also declare this format directly as a ihc::FPsingle data type.

⁴ You can also declare this format directly as a ihc::FPdouble data type.

⁵ Not a bit-to-bit mapping. The integer part (bit 63) of the 80-bit extended precision value is dropped when converting it to ap_float<15,63>.

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Intel® oneAPI DPC++/C++ Compiler Handbook for FPGAs

Declare the ap_float Data Type

Math Functions Supported by `ap_float` Data Type

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Intel® oneAPI DPC++/C++ Compiler Handbook for FPGAs

Declare the ap_float Data Type

Math Functions Supported by ap_float Data Type

Math Functions Supported by `ap_float` Data Type