Performance Through Partnership

As an industry leader in HPC, Intel engages with software vendors like Ansys, Altair, Quantifi, and Dassault, while providing marketing and engineering expertise and resources. These relationships help ensure that popular HPC solutions take full advantage of Intel® hardware and software capabilities so end customers enjoy exceptional power, performance, and price.

HPC Focus Segments

Software vendor HPC applications take advantage of Intel® Xeon® Scalable processors with proven leadership performance1 and the highest available memory bandwidth in any HPC CPU.2 These improvements combined with revolutionary memory and storage capabilities in Intel® Optane™ technology help support compute- and storage-intensive workloads for advanced HPC use cases across multiple segments.

Product and Performance Information

1HPCG: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 2019u5 MKL; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: 2019u5 MKL; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -march=core-avx2, tested by Intel and results as of April 2021. HPL: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: The Intel Distribution for LINPACK Benchmark; Build notes: Tools: Intel MPI 2019u7; threads/core: 1; Turbo: used; Build: build script from Intel Distribution for LINPACK package; 1 rank per NUMA node: 1 rank per socket, EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: AMD official HPL 2.3 MT version with BLIS 2.1; Build notes: Tools: hpc-x 2.7.0; threads/core: 1; Turbo: used; Build: pre-built binary (gcc built) from https://developer.amd.com/amd-aocl/blas-library/; 1 rank per L3 cache, 4 threads per rank, tested by Intel and results as of April 2021. STREAM Triad: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: McCalpin_​STREAM_​OMP-version; Build notes: Tools: Intel C Compiler 2019u5; threads/core: 1; Turbo: used; BIOS settings: HT=on Turbo=On SNC=On. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: McCalpin_​STREAM_​OMP-version; Build notes: Tools: Intel C Compiler 2019u5; threads/core: 1; Turbo: used; BIOS settings: HT=on Turbo=On SNC=On, tested by Intel and results as of April 2021. WRF Geomean of Conus-12km, Conus-2.5km, NWSC-3 NA-3km: Platinum 8358: 1-node 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 4.2.2; Build notes: Intel Fortran Compiler 2020u4, Intel MPI 2020u4; threads/core: 1; Turbo: used; Build knobs:-ip -w -O3 -xCORE-AVX2 -vec-threshold0 -ftz -align array64byte -qno-opt-dynamic-align -fno-alias $(FORMAT_​FREE) $(BYTESWAPIO) -fp-model fast=2 -fimf-use-svml=true -inline-max-size=12000 -inline-max-total-size=30000. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: 4.2.2; Build notes: Intel Fortran Compiler 2020u4, Intel MPI 2020u4; threads/core: 1; Turbo: used; Build knobs: -ip -w -O3 -march=core-avx2 -ftz -align all -fno-alias $(FORMAT_​FREE) $(BYTESWAPIO) -fp-model fast=2 -inline-max-size=12000 -inline-max-total-size=30000, tested by Intel and results as of April 2021. Binomial Options: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: v1.0; Build notes: Tools: Intel C Compiler 2020u4, Intel Threading Building Blocks ; threads/core: 2; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-domain-exclusion=31 -fimf-accuracy-bits=11 -no-prec-div -no-prec-sqrt. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: v1.0; Build notes: Tools: Intel C Compiler 2020u4, Intel Threading Building Blocks ; threads/core: 2; Turbo: used; Build knobs: -O3 -march=core-avx2 -fimf-domain-exclusion=31 -fimf-accuracy-bits=11 -no-prec-div -no-prec-sqrt, tested by Intel and results as of April 2021. Monte Carlo: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 1; Turbo: used; Build knobs: -O3 -xCORE-AVX512 -qopt-zmm-usage=high -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: v1.1; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -O3 -march=core-avx2 -fimf-precision=low -fimf-domain-exclusion=31 -no-prec-div -no-prec-sqrt, tested by Intel and results as of April 2021. Ansys Fluent Geomean of aircraft_​wing_​14m, aircraft_​wing_​2m, combustor_​12m, combustor_​16m, combustor_​71m, exhaust_​system_​33m, fluidized_​bed_​2m, ice_​2m, landing_​gear_​15m, oil_​rig_​7m, pump_​2m, rotor_​3m, sedan_​4m: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 2021 R1; Build notes: One thread per core; Multi-threading Enabled; Turbo Boost Enabled; Intel FORTRAN Compiler 19.5.0; Intel C/C++ Compiler 19.5.0; Intel Math Kernel Library 2020.0.0; Intel MPI Library 2019 Update 8. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: 2021 R1; Build notes: One thread per core; Multi-threading Enabled; Turbo Boost Enabled; Intel FORTRAN Compiler 19.5.0; Intel C/C++ Compiler 19.5.0; Intel Math Kernel Library 2020.0.0; Intel MPI Library 2019 Update 8, tested by Intel and results as of April 2021. Ansys LS-DYNA Geomean of car2car-120ms, ODB_​10M-30ms: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: R11; Build notes: Tools: Intel Compiler 2019u5 (AVX512), Intel MPI 2019u9; threads/core: 1; Turbo: used. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: R11; Build notes: Tools: Intel Compiler 2019u5 (AMDAVX2), Intel MPI 2019u9; threads/core: 1; Turbo: used, tested by Intel and results as of April 2021. OpenFOAM 42M_​cell_​motorbike: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: v8; Build notes: Tools: Intel FORTRAN Compiler 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: v8; Build notes: Tools: Intel FORTRAN Compiler 2020u4, Intel C Compiler 2020u4, Intel MPI 2019u8; threads/core: 1; Turbo: used; Build knobs: -O3 -ip -march=core-avx2, tested by Intel and results as of April 2021. LAMMPS Geomean of Polyethylene, Stillinger-Weber, Tersoff, Water: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: v2020-10-29; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -xCORE-AVX512 -qopt-zmm-usage=high. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: v2020-10-29; Build notes: Tools: Intel MKL 2020u4, Intel C Compiler 2020u4, Intel Threading Building Blocks 2020u4, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -march=core-avx2, tested by Intel and results as of April 2021. NAMD Geomean of Apoa1, STMV: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL , Intel C Compiler 2020u4, Intel MPI 2019u8, Intel Threading Building Blocks 2020u4; threads/core: 2; Turbo: used; Build knobs: -ip -fp-model fast=2 -no-prec-div -qoverride-limits -qopenmp-simd -O3 -xCORE-AVX512 -qopt-zmm-usage=high. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: 2.15-Alpha1 (includes AVX tiles algorithm); Build notes: Tools: Intel MKL , AOCC 2.2.0, gcc 9.3.0, Intel MPI 2019u8; threads/core: 2; Turbo: used; Build knobs: -O3 -fomit-frame-pointer -march=znver1 -ffast-math, tested by Intel and results as of April 2021. RELION Plasmodium Ribosome: Platinum 8358: 1-node, 2x Intel® Xeon® Platinum 8358 (32C/2.6GHz, 250W TDP) processor on Intel Software Development Platform with 256 GB (16 slots/ 16GB/ 3200) total DDR4 memory, ucode 0x261, HT on, Turbo on, CentOS Linux 8.3.2011, 4.18.0-240.1.1.el8_​3.crt1.x86_​64, 1x Intel_​SSDSC2KG96, App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -xCOMMON-AVX512 -qopt-report=5 –restrict. EPYC 7543: 1-node, 2-socket AMD EPYC 7543 (32C/2.8GHz, 240W cTDP) on Dell PowerEdge R7525 server with 1024 GB (16 slots/ 64GB/3200) total DDR4 memory, ucode 0xa001119, SMT on, Boost on, Power deterministic mode, NPS=4, Red Hat Enterprise Linux 8.3, 4.18, 2x Micron 5300 Pro, App Version: 3_​1_​1; Build notes: Tools: Intel C Compiler 2020u4, Intel MPI 2019u9; threads/core: 2; Turbo: used; Build knobs: -O3 -ip -g -debug inline-debug-info -march=core-avx2 -qopt-report=5 -restrict, tested by Intel and results as of April 2021.
2Highest available memory bandwidth of any HPC CPU (9200) with 12 DDR4 channels for powering memory-bound workloads. As of July 21, 2021, Intel offers up to 12 DDR4 memory channels for Intel® Xeon® processors, as compared to 8 for AMD EPYC.
311X higher batch AI inference performance with Intel-optimized Tensor Flow vs. stock Cascade Lake FP32 configuration New: 8380: 1-node, 2x Intel Xeon Platinum 8380 processor on Coyote Pass with 512 GB (16 slots/ 32GB/ 3200) total DDR4 memory, ucode X261, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-65-generic, 1x Intel_​SSDSC2KG96, Intel SSDPE2KX010T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 3/12/2021. Baseline: 8280: 1-node, 2x Intel Xeon Platinum 8280 processor on Wolf Pass with 384 GB (12 slots/ 32GB/ 2933) total DDR4 memory, ucode 0x5003003, HT on, Turbo on, Ubuntu 20.04 LTS, 5.4.0-48-generic, 1x Samsung_​SSD_​860, Intel SSDPE2KX040T8, ResNet-50 v1.5, gcc-9.3.0, oneDNN 1.6.4, BS=128 FP32,INT8, Optimized model : TensorFlow 2.4.1 with Intel optimizations for 3rd Gen Intel Xeon Scalable processor, upstreamed to TensorFlow- 2.5 (container- intel/intel-optimized-tensorflow:tf-r2.5-icx-b631821f), Model zoo: https://github.com/IntelAI/models/tree/icx-launch-public/quickstart/, Unoptimized model : TensorFlow- 2.4.1, Modelzoo:https://github.com/IntelAI/models -b master, test by Intel on 2/17/2021.