Intel® C++ Compiler Classic Developer Guide and Reference

ID 767249
Date 12/16/2022
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

Floating-point Optimizations

Application performance is an important goal of the Intel® C++ Compiler, even at default optimization levels. A number of optimizations involve transformations that might affect the floating-point behavior of the application, such as evaluation of constant expressions at compile time, hoisting invariant expressions out of loops, or changes in the order of evaluation of expressions. These optimizations usually help the compiler to produce the most efficient code possible. However, the optimizations might be contrary to the floating-point requirements of the application.

Some optimizations are not consistent with strict interpretation of the ANSI or ISO standards for C and C++. Such optimizations can cause differences in rounding and small variations in floating-point results that may be more or less accurate than the ANSI-conformant result.

The Intel® C++ Compiler provides the -fp-model (Linux* and macOS) or /fp (Windows*) option, which allows you to control the optimizations performed when you build an application. The option allows you to specify the compiler rules for:

  • Value safety: Whether the compiler may perform transformations that could affect the result. For example, in the SAFE mode, the compiler won't transform x/x to 1.0 because the value of x at runtime might be a zero or a NaN . The UNSAFE mode is the default.

  • Floating-point expression evaluation: How the compiler should handle the rounding of intermediate expressions. For example, when double precision is specified, the compiler interprets the statement t0=4.0f+0.1f+t1+t2; as t0=(float)(4.1+(double)t1+(double)t2);

  • Floating-point contractions: Whether the compiler should generate fused multiply-add (FMA) instructions on processors that support them. When enabled, the compiler may generate FMA instructions for combining multiply and add operations; when disabled, the compiler must generate separate multiply and add instructions with intermediate rounding.

  • Floating-point environment access: Whether the compiler must account for the possibility that the program might access the floating-point environment, either by changing the default floating-point control settings or by reading the floating-point status flags. This is disabled by default. You can use the -fp-model:strict (Linux* and macOS) /fp:strict (Windows*) option to enable it.

  • Precise floating-point exceptions: Whether the compiler should account for the possibility that floating-point operations might produce an exception. This is disabled by default. You can use -fp-model:strict (Linux* and macOS) or /fp:strict (Windows*); or -fp-model:except (Linux* and macOS) or /fp:except (Windows*) to enable it.
    Consider the following example:

double a=1.5; int x=0; ... 
  __try {
    int t0=a; //raises inexact
    x=1;
    a*=2; 
  } __except(1) {
  printf("SEH Exception: x=%d\n", x); 
}

Without precise floating-point exceptions, the result is SEH Exception: x=1; with precision floating-point exceptions, the result is SEH Exception: x=0.

The following table describes the impact of different keywords of the option on compiler rules and optimizations:

Keyword

Value Safety

Floating-Point
Expression Evaluation

Floating-Point
Contractions

Floating-Point
Environment Access

Precise Floating-Point
Exceptions

precise
source
double
extended

Safe

Varies
Source
Double
Extended

Yes

No

No

strict

Safe

Varies

No

Yes

Yes

consistent

Safe

Varies

No

No

No

fast=1 (default)

Unsafe

Unknown

Yes

No

No

fast=2

Very unsafe

Unknown

Yes

No

No

except
except-

Unaffected
Unaffected

Unaffected
Unaffected

Unaffected
Unaffected

Unaffected
Unaffected

Yes
No

NOTE:

It is illegal to specify the except keyword in an unsafe safety mode.

Based on the objectives of an application, you can choose to use different sets of compiler options and keywords to enable or disable certain optimizations, so that you can get the desired result.