Developer Guide

Developer Guide for Intel® oneAPI Math Kernel Library Linux*

ID 766690
Date 3/31/2023
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Visible to Intel only — GUID: GUID-E87C4C39-0CCE-4B67-BC69-1854A4D8D257

Document Table of Contents

Configuring Parameters

The most significant parameters in HPL.dat are P, Q, NB, and N. Specify them as follows:

  • P and Q - the number of rows and columns in the process grid, respectively.

    P*Q must be the number of MPI processes that HPL is using.

    Choose PQ.

  • NB - the block size of the data distribution.

    The table below shows recommended values of NB for different Intel® processors:

    Processor

    NB

    Intel® Xeon® Processor X56*/E56*/E7-*/E7*/X7* (codenamed Nehalem or Westmere)

    256

    Intel Xeon Processor E26*/E26* v2 (codenamed Sandy Bridge or Ivy Bridge)

    256

    Intel Xeon Processor E26* v3/E26* v4 (codenamed Haswell or Broadwell)

    192

    Intel® Core™ i3/i5/i7-6* Processor (codenamed Skylake Client)

    192

    Intel® Xeon Phi™ Processor 72* (codenamed Knights Landing)

    336

    Intel Xeon Processor supporting Intel® Advanced Vector Extensions 512 (Intel® AVX-512) instructions (codenamed Skylake Server)

    384

  • N - the problem size:

    NOTE:

    Increasing N usually increases performance, but the size of N is bounded by memory. In general, you can compute the memory required to store the matrix (which does not count internal buffers) as 8*N*N/(P*Q) bytes, where N is the problem size and P and Q are the process grids in HPL.dat. A general rule of thumb is to choose a problem size that fills 80% of memory.

Product and Performance Information

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.

Notice revision #20201201