Performance Benchmarks and Configuration Details for Intel® Xeon® Scalable Processors

Notices & Disclaimers

Intel® technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at intel.com.

Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit http://www.intel.com/performance.

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance.

Cost reduction scenarios described are intended as examples of how a given Intel-based product, in the specified circumstances and configurations, may affect future costs and provide cost savings.  Circumstances will vary.  Intel does not guarantee any costs or cost reduction.

No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.

Intel does not control or audit third-party benchmark data or the web sites referenced in this document. You should visit the referenced web site and confirm whether referenced data are accurate.

© 2017 Intel Corporation.  Intel, the Intel® logo, and Intel® Xeon® are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as property of others.

Configuration Details

1. 41X since 2006: Source as of June 2017: results estimated or published at www.spec.org using historic SPECint*_rate_base2006 results as of Jun 11 2017.  1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Neon City with 384 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using CPU2006_FOR-OEMs-cpu2006-1.2-ic17.0-lin-binaries-20160922. Data Source: Request Number: 2498, Benchmark: SPECint*_rate_base2006, Score: 2550 Higher is better.

 

2. 1.65X Average Performance Gains: Geomean based on Normalized Generational Performance (estimated based on Intel internal testing of  OLTP Brokerage, SAP SD 2-Tier, HammerDB, Server-side Java, SPEC*int_rate_base2006, SPEC*fp_rate_base2006, Server Virtualization, STREAM* triad, LAMMPS, DPDK L3 Packet Forwarding, Black-Scholes, Intel Distribution for LINPACK.

a. Up to 1.36x claim based on Brokerage Firm OLTP:  1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 512 GB Total Memory on Windows Server* 2012 R2 Standard using SQL Server 2014. Data Source: Request Number: 1640, Benchmark: Brokerage Firm OLTP, Score: 4373 transactions per second (tps) for OLTP vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on on Purley-EP (Lewisburg ) with 764 GB Total Memory on. Windows Server* 2016 RTM Standard using SQL Server 2016 Data, Score: 5979 tps for OLTP. Higher is better.

b. Up to 1.40x claim based on 2-Tier SAP* SD : 1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 512 GB Total Memory on SUSE Linux Enterprise Server* 10 SP4 using SAP EHP5.0 for ERP 6.0 and Sybase ASE 16.0. Data Source: Request Number: 2473, Benchmark: SAP* SD 2-Tier enhancement package 5 for SAP ERP 6.0, Score: 19721  vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Purley-EP (Lewisburg) with 768 GB Total Memory on SUSE Linux Enterprise Server* 12 using SAP ERP6.0/EHP5. Data Source: Request Number: 2558, Benchmark: SAP* SD 2-Tier enhancement package 5 for SAP ERP 6.0, Score: 27678 Higher is better.

c. Up to 1.49x claim based on Server-side Java: 1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Wildcat Pass with 128 GB Total Memory on Red Hat Enterprise Linux* 6.5 kernel 2.6.32-431 using Java 8 SE, JDK8U60, Java Hotspot V1.8.0_60 (if appropriate). Data Source: Request Number: 1633, Benchmark: Server-side Java workload - MultiJVM, Score: 112054 Higher is better, vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Purley-EP (Lewisburg) with 384 GB Total Memory on Red Hat Enterprise Linux* 7.3 using jdk1.8u121. Data Source: Request Number: 2513, Benchmark: Server-side Java workload  - MultiJVM, Score: 167696 Higher is better.

d. Up to 1.52x claim based on SPECint*_rate_base2006 :  1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 256 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using Compiler: C/C++: Version 16.0.0.101 of Intel C++ Studio XE for Linux;  - Fortran: Version 16.0.0.101 of Intel Fortran Studio XE for Linux. Data Source: Request Number: 2342, Benchmark: SPECint*_rate_base2006, Score: 1670 vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Neon City with 384 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using CPU2006_FOR-OEMs-cpu2006-1.2-ic17.0-lin-binaries-20160922. Data Source: Request Number: 2498, Benchmark: SPECint*_rate_base2006, Score: 2550 Higher is better.

e. Up to 1.55x on server virtualization workload:  1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 512 GB Total Memory on VMware ESXi* 6.0 Update 1 using Guest VM's utilize RHEL 6 64bit OS. Data Source: Request Number: 1637, Benchmark: server virtualization workload, Score: 1034 @ 58 vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Wolf Pass SKX with 768 GB Total Memory on VMware ESXi6.0 U3 GA using Guest VM's utilize RHEL 6 64bit OS. Data Source: Request Number: 2563, Benchmark: server virtualization workload, Score: 1580 @ 90 VMs Higher is better.

f. Up to 1.63x claim based on SPECfp*_rate_base2006 :1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 256 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using Compiler: C/C++: Version 16.0.0.101 of Intel C++ Studio XE for Linux; - Fortran: Version 16.0.0.101 of Intel Fortran Studio XE for Linux. Data Source: Request Number: 2340, Benchmark: SPECfp*_rate_base2006, Score: 1050 Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Neon City with 384 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using CPU2006_FOR-OEMs-cpu2006-1.2-ic17.0-lin-binaries-20160922. Data Source: Request Number: 2503, Benchmark: SPECfp*_rate_base2006, Score: 1720 Higher is better.

g. Up to 1.65x claim based on STREAM - triad:  1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 256 GB Total Memory on Red Hat Enterprise Linux* 6.5 kernel 2.6.32-431 using Stream NTW avx2 measurements. Data Source: Request Number: 1709, Benchmark: STREAM - Triad, Score: 127.7 Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Neon City with 384 GB Total Memory on Red Hat Enterprise Linux* 7.2-kernel 3.10.0-327 using STREAM AVX 512 Binaries. Data Source: Request Number: 2500, Benchmark: STREAM - Triad, Score: 199 Higher is better.

h. Up to 1.73x claim based on HammerDB:1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 384 GB Total Memory on Red Hat Enterprise Linux* 7.1 kernel 3.10.0-229 using Oracle 12.1.0.2.0 (including database and grid) with 800 warehouses, HammerDB 2.18. Data Source: Request Number: 1645, Benchmark: HammerDB, Score: 4.13568e+006 Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Purley-EP (Lewisburg) with 768 GB Total Memory on Oracle Linux* 7.2 using Oracle 12.1.0.2.0, HammerDB 2.18. Data Source: Request Number: 2510, Benchmark: HammerDB, Score: 7.18049e+006 Higher is better.

i. Up to 1.73x claim based on LAMMPS: LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. It is used to simulate the movement of atoms to develop better therapeutics, improve alternative energy devices, develop new materials, and more. E5-2697 v4: 2S Intel® Xeon® processor E5-2697 v4, 2.3GHz, 36 cores, Intel® Turbo Boost Technology and Intel® Hyperthreading Technology on, BIOS 86B0271.R00, 8x16GB 2400MHz DDR4, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.  Gold 6148: 2S Intel® Xeon® Gold 6148 processor, 2.4GHz, 40 cores, Intel® Turbo Boost Technology and Intel® Hyperthreading Technology on, BIOS 86B.01.00.0412.R00, 12x16GB 2666MHz DDR4, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

j. Up to 1.77x claim based on DPDK L3 Packet Forwarding: E5-2658 v4: 5 x Intel® XL710-QDA2, DPDK 16.04. Benchmark: DPDK l3fwd sample application Score:  158 Gbits/s packet forwarding at 256B packet using cores. Gold 6152: Estimates based on Intel internal testing on Intel Xeon 6152 2.1 GHz, 2x Intel® FM10420(RRC) Gen Dual Port 100GbE Ethernet controller (100Gbit/card) 2x Intel® XXV710 PCI Express* Gen Dual Port 25GbE Ethernet controller (2x25G/card), DPDK 17.02. Score:  281 Gbits/s packet forwarding at 256B packet using cores, IO and memory on a single socket.

k. Up to 1.87x claim based on Black-Scholes: which is a popular mathematical model used in finance for European option valuation. This is a double precision version. E5-2697 v4: 2S Intel® Xeon® processor CPU E5-2697 v4 , 2.3GHz, 36 cores, turbo and HT on, BIOS 86B0271.R00, 128GB total memory, 8 x16GB 2400 MHz DDR4 RDIMM, 1 x 1TB SATA, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327. Gold 6148: Intel® Xeon® Gold processor 6148@ 2.4GHz, H0QS, 40 cores 150W. QMS1, turbo and HT on, BIOS SE5C620.86B.01.00.0412.020920172159, 192GB total memory, 12 x 16 GB 2666 MHz DDR4 RDIMM, 1 x 800GB INTEL SSD SC2BA80, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

l. Up to 2.27x claim based on LINPACK*: 1-Node, 2 x Intel® Xeon® Processor E5-2699 v4 on Grantley-EP (Wellsburg) with 64 GB Total Memory on Red Hat Enterprise Linux*  7.0 kernel 3.10.0-123 using MP_LINPACK 11.3.1 (Composer XE 2016 U1). Data Source: Request Number: 1636, Benchmark: Intel® Distribution of LINPACK, Score: 1446.4 Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Wolf Pass SKX with 384 GB Total Memory on Red Hat Enterprise Linux* 7.3 using mp_linpack_2017.1.013. Data Source: Request Number: 3753, Benchmark: Intel® Distribution of LINPACK, Score: 3295.57 Higher is better.

3. a. 2.2X higher deep learning training and inference performance than the prior generation: Platform: 2S Intel® Xeon® Platinum 8180 CPU @ 2.50GHz (28 cores), HT disabled, turbo disabled, scaling governor set to “performance” via intel_pstate driver, 384GB DDR4-2666 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.10.2.el7.x86_64. SSD: Intel® SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact‘, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Compared with Platform: 2S Intel® Xeon® CPU E5-2699 v4 @ 2.20GHz (22 cores), HT enabled, turbo disabled, scaling governor set to “performance” via acpi-cpufreq driver, 256GB DDR4-2133 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.10.2.el7.x86_64. SSD: Intel® SSD DC S3500 Series (480GB, 2.5in SATA 6Gb/s, 20nm, MLC). Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact,1,0‘, OMP_NUM_THREADS=44, CPU Freq set with cpupower frequency-set -d 2.2G -u 2.2G -g performance.  Neon: ZP/MKL_CHWN branch commit id:52bd02acb947a2adabb8a227166a7da5d9123b6d. Dummy data was used. The main.py script was used for benchmarking , in mkl mode. ICC version used : 17.0.3 20170404, Intel® MKL small libraries version 2018.0.20170425; Inference and training throughput uses FP32 instructions.

b. 18X higher deep learning training and inference performance than 4-yr old system: Platform: 2S Intel® Xeon® Platinum 8180 CPU @ 2.50GHz (28 cores), HT disabled, turbo disabled, scaling governor set to “performance” via intel_pstate driver, 384GB DDR4-2666 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.10.2.el7.x86_64. SSD: Intel® SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact‘, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Compared with Platform: 2S Intel® Xeon® CPU E5-2697 v2 @ 2.70GHz (12 cores), HT enabled, turbo enabled, scaling governor set to “performance” via intel_pstate driver, 256GB DDR3-1600 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.21.1.el7.x86_64. SSD: Intel® SSD 520 Series 240GB, 2.5in SATA 6Gb/s, 25nm, MLC.  Intel Caffe: (http://github.com/intel/caffe/), revision b0ef3236528a2c7d2988f249d347d5fdae831236. Inference measured with “caffe time --forward_only” command, training measured with “caffe time” command. For “ConvNet” topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from https://github.com/intel/caffe/tree/master/models/intel_optimized_models (GoogLeNet, AlexNet, and ResNet-50), GCC 4.8.5, MKLML version 2017.0.2.20170110.

c. Over 100X higher deep learning training and inference performance than 3-yr old system using unoptimized software: Platform: 2S Intel® Xeon® Platinum 8180 CPU @ 2.50GHz (28 cores), HT disabled, turbo disabled, scaling governor set to “performance” via intel_pstate driver, 384GB DDR4-2666 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.10.2.el7.x86_64. SSD: Intel® SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact‘, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Compared with Platform: 2S Intel® Xeon® CPU E5-2699 v3 @ 2.30GHz (18 cores), HT enabled, turbo disabled, scaling governor set to “performance” via intel_pstate driver, 256GB DDR4-2133 ECC RAM. CentOS Linux release 7.3.1611 (Core), Linux kernel 3.10.0-514.el7.x86_64. OS drive: Seagate* Enterprise ST2000NX0253 2 TB 2.5" Internal Hard Drive.Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact,1,0‘, OMP_NUM_THREADS=36, CPU Freq set with cpupower frequency-set -d 2.3G -u 2.3G -g performance. Intel Caffe: (http://github.com/intel/caffe/), revision b0ef3236528a2c7d2988f249d347d5fdae831236. Inference measured with “caffe time --forward_only” command, training measured with “caffe time” command. For “ConvNet” topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from https://github.com/intel/caffe/tree/master/models/intel_optimized_models (GoogLeNet, AlexNet, and ResNet-50), GCC 4.8.5, MKLML version 2017.0.2.20170110. BVLC-Caffe: https://github.com/BVLC/caffe, Inference & Training measured with “caffe time” command.  For “ConvNet” topologies, dummy dataset was used. For other topologies, data was st ored on local storage and cached in memory before training  BVLC Caffe (http://github.com/BVLC/caffe), revision 91b09280f5233cafc62954c98ce8bc4c204e7475 (commit date 5/14/2017). BLAS: atlas ver. 3.10.1.

4. Neusoft SaCa® Aclome*: SaCa® Aclome workload (for general performance) and compressing/decompressing workload (for QAT). OS: CentOS  7.3.1611. Testing by Intel and Neusoft May 2017. BASELINE: 2S Intel® Xeon® processor E5-2699 v4, 2.2GHz, 22 cores, turbo and HT on, 128GB total memory, 8 slots / 16GB / 2400 MT/s / DDR4, SATA SSD. NEW: 2S Intel® Xeon® Platinum 8180 processor, 2.5GHz, 28 cores, turbo and HT on, Intel® Chipset C627, 128GB total memory, 8 slots / 16GB / 2666 MT/s / DDR4, SATA SSD. Notes: Data compression/depression using Zlib 1.2.8. QAT Driver version: S4PR1-Linux-QAT1.7.Upstream.L.0.9.0-36

5. AsiaInfo Telco BSS*: AsiaInfo Telco BSS workload. OS: RHEL* 7.3. Testing by Intel & AsiaInfo May 2017. BASELINE: 4S Intel® Xeon® processor E7-8890 v4, 2.2GHz, 24 cores, turbo and HT on, 256GB total memory, 16 slots / 16GB / 1600 MT/s / DDR4, P3700 2T SSD x 2. NEW 1 (for general workload benchmarking and Intel® QAT benchmarking): 4S Intel® Xeon® Platinum 8180 processor, 2.5GHz, 28 cores, turbo and HT on, Intel® Chipset C627, 384GB total memory, 24 slots / 16GB / 2666 MT/s / DDR4, Intel® SSD DC P3700 2TB x 2. NEW 2 (for Intel® Optane™ SSD benchmarking): 4S Intel® Xeon® Platinum 8180 processor, 2.5GHz, 28 cores, turbo and HT on, Intel® Chipset C627, 384GB total memory, 24 slots / 16GB / 2666 MT/s / DDR4, Intel® SSD DC P3700 2TB x 2, Intel® Optane™ SSD DC P4800X 375G x 2.

6. IBM DB2*: DB2 v11.1.1.1. The IBM Big Data Insights Internal Heavy Multiuser Workload (BDInsights) is a multi-user data warehousing workload based on a retail environment. The workload is comprised with a mix of complex and intermediate queries. The scale factor for the workload is 3TB with 12 users. Testing by Intel and IBM. April/May 2017. BASELINE: 4S Intel® Xeon® processor E7-8890 v4, 2.2GHz, 24 cores, turbo on, HT on, BIOS 335.R00, 1.5TB total memory, 96 slots / 16GB / 1600 MT/s / DDR4 LRDIMM, 1 x 800GB, Intel® SSD DC S3700, Red Hat Enterprise Linux* 7.3 kernel 3.10.0-514.16.1.el7.x86_64. NEW: 4S Intel® Xeon® Platinum processor 8180 v4, 2.5GHz, 28 cores, turbo on, HT on, BIOS 119.R05, 1.5TB total memory, 48 slots / 32GB / 2677 MT/s / DDR4 LRDIMM, 1 x 800GB, Intel® SSD DC S3700, Red Hat Enterprise Linux* X.X kernel 3.10.0-514.16.1.el7.x86_64.

7. Aerospike* database: The database was populated with 200 M records of 100 bytes each and benchmarked with the Aerospike Java Benchmark tool (https://github.com/aerospike/aerospike-client-java) . The workload simulated 95%/5% read/update ratio. Two Aerospike instances were launched on a single server forming a cluster. Each Aerospike instance was affinitized to a CPU socket and configured to use one of the 10GB NICs. Each 10GB NIC had its interrupt IRQs affinitized to a CPU socket.  BASELINE:  Aerospike Server Enterprise* 3.6.4 , CentOS* 6.7, kernel version 2.6.32-573.3.1.el6.x86_64, 2 Intel® Xeon® processor E5-2697 v3, 2.6GHz, 28 cores, 128GB DDR4/1866, regular DIMM, 2x 10Gb network Intel X540-AT2 not bonded, no disk used – in memory workload.  NEXT GEN (old software): Aerospike Server Enterprise 3.6.4, CentOS 6.7, kernel version 2.6.32-573.3.1.el6.x86_64, 2 Intel® Xeon® processor E5-2699 v4, 2.2GHz, 44cores, 128GB DDR4/2134, regular DIMM, 2x 10Gb network Intel X540-AT2 not bonded, no disk used – in memory workload. Clients: 8 client systems were used to concurrently submit queries to the servers and drive the workload. The same clients were used in both “baseline” and “new”. The clients were configured as follows: E5-2697 v3 128GB of memory and 10GB Intel X540-AT2 network. The database was populated with 400 M records of 100 bytes each and benchmarked with the Aerospike Java Benchmark tool (https://github.com/aerospike/aerospike-client-java) . The workload simulated 95%/5% read/update ratio. Two Aerospike instances were launched on a single server forming a cluster.  NEXT GEN (new software): Aerospike Server Enterprise 3.12.1, OS: CentOS 7.2 with kernel updated to 4.4.59, Intel® Xeon® processor E5-2699 v4, 2.2GHz, 22 cores, turbo and HT on, BIOS SE5C610.86B.01.01.0016.033120161139, 128GB total memory, 16 DIMMs / 8GB / Configured Clock Speed: 1866 MHz / DDR4 DIMM, 2 x Intel® 82599ES 10 Gigabit Ethernet Controllers – all 4 ports on the 2 network controllers were bonded for an aggregate 40000Mb/s bond. No storage – in-memory workload. NEW: Aerospike Server Enterprise 3.12.1, OS: CentOS 7.2 with kernel updated to 4.4.59, Intel® Xeon® Platinum processor 8180, 2.5GHz, 28 cores, turbo and HT on, BIOS SE5C620.86B.01.00.0412.020920172159 , 384GB total memory, 12 DIMMs / 32GB / Configured Clock Speed: 2666 MHz / DDR4 DIMM, 2 x Intel® 82599ES 10 Gigabit Ethernet Controllers – all 4 ports on the 2 network controllers were bonded for an aggregate 40000Mb/s bond. No storage – in-memory workload. Clients: 8 client systems were used to concurrently submit queries to the servers and drive the workload. The same clients were used in both “baseline” and “new”.  The clients were configured as follows: CentOS 7.2 with kernel 3.10.0-327. Intel® Xeon® processor E5-2697 v4, 2.3GHz, 18 cores, turbo and HT on, BIOS SE5C610.86B.01.01.0016.033120161139, 128GB total memory, 8 DIMMs / 16GB / Configured Clock Speed: 2400 MHz, 1 x Intel® 82599ES 10 Gigabit Ethernet Controllers.

8. Up to 1.63x gains for Technical Computing workloads: based on Geomean of Weather Research Forecasting - Conus 12Km, HOMME, LSTCLS-DYNA Explicit, INTES PERMAS V16, MILC, GROMACS water 1.5M_pme, VASPSi256, NAMDstmv, LAMMPS, Amber GB Nucleosome, Binomial option pricing, Black-Scholes, Monte Carlo European options.

a. PERMAS by INTES is an advanced Finite Element software system that offers a complete range of physical models at high performance, quality, and reliability. It plays a mission-critical role in the design process at customers from automotive, ship design, aerospace, and more. E5-2697 v4: 2S Intel® Xeon® processor E5-2697v4, 2.3GHz, 18 cores, turbo on, HT off, NUMA on, BIOS 338.R00, 256 GB total memory (8x 32GB w/ 2400 MT/s, DDR4 LRDIMM), 4x Intel® SSD DC P3600 2 TB  in RAID 0 (stripe size 64k). CentOS Linux* release 7.2, kernel 3.10.0-327.13.1.el7.x86_64. Intel® Composer 2015.5.223. INTES PERMAS V16.00.  Gold 6148: Intel® Xeon® Gold 6148 processor, 2.4 GHz, 20 cores, turbo on, HT off, NUMA on, BIOS SE5C620.86B.01.00.0412.020920172159, 384 GB total memory (12x 32GB w/ 2400 MT/s, DDR4 LRDIMM), 3x Intel® SSD DC P3600 2 TB in RAID 0 (stripe size 64k), CentOS* Linux* release 7.3, kernel 3.10.0-514.10.2.el7.x86_64. Intel® Composer 2015.7.235. INTES PERMAS V16.00.

b. LS-DYNA is the leading product in the crash simulation market. It is used by the automobile, aerospace, construction, military, manufacturing, and bioengineering industries in worldwide. Workload: 2M elements Car2car model with 120ms simulation time.  LS-DYNA explicit standard benchmarks tested by Intel, March 2017.  E5-2697 V4: 2S Intel® Xeon® processor E5-2697 v4, 2.3GHz, 18 cores, turbo and HT on, BIOS SE5C610.86B.01.01.0016.033120161139, 128GB total memory, 8 memory channels / 8x16GB / 2400 MT/s / DDR4, Red Hat Enterprise Linux* 7.3 kernel 3.10.0-229.20.1.el6.x86_64.knl2.  GOLD 6148: 2S Intel® Xeon® Gold 6148 processor, 2.4GHz, 20 cores, turbo and HT on, BIOS version 412, 192GB total memory, 12 memory channels / 12x16GB / 2400 MT/s / DDR4, Red Hat Enterprise Linux* 7.3 kernel 3.10.0-514.el7.x86_64.

c. Binomial option pricing is a lattice-based approach that uses a discrete-time model of the varying price over time of the underlying financial instrument. This is compute bound, double precision workload. FSI Binomial workload. OS: Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327. Testing by Intel March 2017. E5-2697 v4: 2S Intel® Xeon® processor CPU E5-2697 v4 , 2.3GHz, 36 cores, turbo and HT on, BIOS 86B0271.R00, 128GB total memory, 8 slots / 16GB / 2400 MT/s / DDR4 RDIMM, 1 x 1TB SATA, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.  Gold 6148: Intel® Xeon® Gold 6148 processor, 2.4GHz, 40 cores, turbo and HT on, BIOS 86B.01.00.0412, 192GB total memory, 12 slots / 16 GB /  2666 MT/s / DDR4 RDIMM, 1 x 800GB INTEL SSD SC2BA80, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

d. Monte Carlo is a numerical method that uses statistical sampling techniques to approximate solutions to quantitative problems. In finance, Monte Carlo algorithms are used to evaluate complex instruments, portfolios, and investments. This is compute bound, double precision workload. FSI Monte Carlo workload. OS: Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327. Testing by Intel March 2017. E5-2697 v4: 2S Intel® Xeon® processor CPU E5-2697 v4 , 2.3GHz, 36 cores, turbo and HT on, BIOS 86B0271.R00, 128GB total memory, 8 x16GB 2400 MHz DDR4 RDIMM, 1 x 1TB SATA, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.  Gold 6148: Intel® Xeon® Gold 6148 processor@ 2.4GHz, H0QS, 40 cores 150W. QMS1, turbo and HT on, BIOS SE5C620.86B.01.00.0412.020920172159, 192GB total memory, 12 x 16 GB 2666 MHz DDR4 RDIMM, 1 x 800GB INTEL SSD SC2BA80, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

e. Black-Scholes is a popular mathematical model used in finance for European option valuation. This is a double precision version. E5-2697 v4: 2S Intel® Xeon® processor CPU E5-2697 v4 , 2.3GHz, 36 cores, turbo and HT on, BIOS 86B0271.R00, 128GB total memory, 8 x16GB 2400 MHz DDR4 RDIMM, 1 x 1TB SATA, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327. Gold 6148: Intel® Xeon® Gold 6148 processor@ 2.4GHz, H0QS, 40 cores 150W. QMS1, turbo and HT on, BIOS SE5C620.86B.01.00.0412.020920172159, 192GB total memory, 12 x 16 GB 2666 MHz DDR4 RDIMM, 1 x 800GB INTEL SSD SC2BA80, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

f. Amber* is a suite of programs for classical molecular dynamics and statistical analysis. The main MD program is PMEMD (Particle Mesh Ewald Molecular Dynamics) employs two separate algorithms for implicit- and explicit-solvent dynamics. Here performance for explicit solvent (PME) is presented. Amber: Version 16 with all patches applied at December, 2016. Workloads: PME Cellulose NVE(408K atoms), PME stmv(1M atoms), GB Nucleosome (25K), GB Rubisco (75K). No cut-off was used for GB workloads. Compiled with -mic2_spdp –intelmpi - openmp, –DMIC2 * defined. Tests performed on March 2017. E5-2697 v4: Executed with 36 MPI, 2 OpenMP. 2S Intel® Xeon® processor E5-2697 v4, 2.3GHz, 36 cores, turbo and HT on, BIOS 86B0271.R00, 8x16GB 2400MHz DDR4, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.  Gold 6148: Executed with 40 MPI and 2 OpenMP. 2S Intel® Xeon® Gold 6148 processor, 2.4GHz, 40 cores, turbo on, HT on, BIOS 86B.01.00.0412.R00, 12x16GB 2666MHz DDR, Red Hat Enterprise Linux* 7.2 kernel 3.10.0-327.

9. RADIOSS*: RADIOSS 2017, Neon 1M 8ms benchmark workload. NEON front car crash refined model with 1 million of elements, first 8ms run. BASELINE: Altair RADIOSS 14 on Red Hat Enterprise Linux* 6.5, 2 Intel® Xeon® processor E5-2697 v3, 2.6GHz, 28 cores, Hyperthreading with 28 MPI x 2 OpenMP, 64GB DDR3/1833, regular DIMM, Intel® SSD DC S3700 800GB, 1Gb network, Source is Altair internal as of April 1, 2016. Next GEN: Altair RADIOSS 2017 on Red Hat Enterprise Linux 6.5, 2 Intel® Xeon® processor E5-2699 v4, 2.2GHz, 44 cores, Hyperthreading with 44 MPI x 2 OpenMP, 64GB DDR3/1833, regular DIMM, Intel® SSD DC S3700 800GB, 1Gb network, Bios SE5C620.86B.01.00.0412, Source is Altair internal as of April 1, 2017. NEW: Altair RADIOSS 2017 on CentOS Linux 7.2, 2 Intel® Xeon® Gold 6148 processor, 2.4GHz, 40 cores, Hyperthreading with 40 MPI x 2 OpenMP, 192GB DDR4/2666, regular DIMM, Intel® SSD DC S3700 800GB, 1Gb network, Bios 0271.R00, Source is Altair internal as of April 11, 2017.

10. Tencent InGame Purchase Machine Learning Platform*: OS: CentOS 7.3.1611. Testing by Intel May 2017. BASELINE: 2S Intel® Xeon® processor E5-2699 v4, 2.2GHz, 22 cores, turbo and HT on, 128GB total memory, 8 slots / 16GB / 2400 MT/s / DDR4, Intel® SSD DC S3700 800GB. NEW: 2S Intel® Xeon® Platinum 8180 processor, 2.5GHz, 28 cores, turbo and HT on, 192GB total memory, 12 slots / 16GB / 2666 MT/s / DDR4, Intel® SSD DC S3700 800GB.

11. Storage: up to 5X more IOPS while reducing latency by up to 70% than the prior generation when using Intel® optimized software and upgrading from NVMe to Intel® Optane™ SSD: BASELINE: Skylake w/ 4x P4600 no SPDK – CPU: 2S Intel® Xeon® Platinum 8170 CPU @ 2.10GHz; Memory: 196GB, 6x Memory Channels per socket, 1 16GB 2666 DDR4 DIMM per channel; Board: Intel Wolf Pass, BIOS: SE5C620.86B.01.00.0511.051220170820; Storage: 4x Intel P4600 1.6TB, 2 on socket 0 + 2 on socket 1; OS: Ubuntu 16.04.1; Linux Kernel: 4.11.0_x86_64; Turbo: On; HT: Disabled; C-States: Disabled; Power & Performance: Performance; Speed Stepping: Enabled; BenchMark: SPDK Perf; IODepth: 128; Block Size: 4096; RunTime: 300 sec: No. of Runs: 3 Times; Num of Cores: Core 0 (Single Core); IOPS – 4k random read: 3207706; IOPS – 4k random writes: 3005696; Lat – 4K random read: 239; Lat – 4K random writes: 255. NEW: Skylake with 6x Optane + SPDK – CPU: 2S Intel® Xeon® Platinum 8168 CPU @ 2.70GHz; Memory: 196GB, 6x Memory Channels per socket, 1 16GB 2666 DDR4 DIMM per channel; Board: Intel Wolf Pass, BIOS: SE5C620.86B.01.00.0511.051220170820; Storage: 6x Intel P4800X 375GB, 1 on socket 0 + 5 on socket 1; OS: Ubuntu 16.04.1; Linux Kernel: 4.11.0_x86_64; SPDK Commit: 730a63d02b6; DPDK: 17.02; Turbo: On; HT: Disabled; C-States: Disabled; Power & Performance: Performance; Speed Stepping: Enabled; BenchMark: SPDK Perf; IODepth: 32; Block Size: 4096; RunTime: 300 sec: No. of Runs: 3 Times; Num of Cores: Core 0 (Single Core); IOPS – 4k random read: 614531; IOPS – 4k random writes: 588277; Lat – 4K random read: 833; Lat – 4K random writes: 870.

12. Keepixo workload. OS: OS: CentOS Linux* 7.3 kernel 3.10.0. Testing by Keepixo May 2017. BASELINE: 2S Intel® Xeon® processor E5-2699 v4, 2.2GHz, 22 cores, turbo and HT on, BIOS 251.R01, 64GB total memory, 8 slots / 8GB / 2133 MT/s / DDR4 LRDIMM, CentOS Linux* 7.1 kernel 3.10.0. NEW: 2S Intel® Xeon® Platinum 8168 processor, 2.7GHz, 24 cores, turbo and HT on, BIOS 412, 192GB total memory, 12 slots / 16GB / 2600 MT/s / DDR4 LRDIMM, CentOS Linux* 7.3 kernel 3.10.0.

13. Up to 4.2x more VMs based on virtualization consolidation workload: Based on Intel® internal estimates 1-Node, 2 x Intel® Xeon® Processor E5-2690 on Romley-EP with 256 GB Total Memory on VMware ESXi* 6.0 GA using Guest OS RHEL6.4, glassfish3.1.2.2, postgresql9.2. Data Source: Request Number: 1718, Benchmark: server virtualization workload, Score: 377.6 @ 21 VMs Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Wolf Pass SKX with 768 GB Total Memory on VMware ESXi6.0 U3 GA using Guest VM's utilize RHEL 6 64bit OS. Data Source: Request Number: 2563, Benchmark: server virtualization workload, Score: 1580 @ 90 VMs Higher is better.

14. Up to 65% lower 4-year TCO estimate example based on equivalent rack performance using VMware ESXi* virtualized consolidation workload comparing 20 installed 2-socket servers with Intel Xeon processor E5-2690 (formerly “Sandy Bridge-EP”) running VMware ESXi* 6.0 GA using Guest OS RHEL6.4 compared at a total cost of $919,362 to 5 new Intel® Xeon® Platinum 8180 (Skylake) running VMware ESXi6.0 U3 GA using Guest OS RHEL 6 64bit at a total cost of $320,879 including basic acquisition.  Server pricing assumptions based on current OEM retail published pricing for 2-socket server with Broadwell based Intel Xeon processor systems– subject to change based on actual pricing of systems offered.

Total Cost of Ownership Estimated

Server A:  2S Xeon® E5-2690 (2.9GHz, 8C, 20MB)

Server B: 2S Xeon® Platinum 8180 (2.5GHz, 28C, 38.5M)

Number of Servers

20

5

Server Acquisition

$ 273,220

$ 150,995

Operating System & Software

OS License

$ 123,100

$ 30,775

OS Maintenance

$ 241,360

$ 60,340

Software License

$ 39,800

$ 9,950

Software Maintenance

$ 51,680

$ 12,920

 Server Maintenance

$ 134,080

$ 33,520

Infrastructure &
Utilities

Power and Cooling

$ 42,522

$ 18,979

Rack/Floor Space

$ 12,400

$ 3,100

Networking

$1,200

$ 300

 Total

$ 919,362

$ 320,879

15. Up to 1.59x claim based on SAP testing of SAP* HANA workload: 1-Node, 4S Intel® Xeon® processor E7-8890 v4 on Grantley-EX-based platform with 1024 GB Total Memory on SLES12SP1 vs. estimates based on SAP internal testing on 1-Node, 4S Intel® Xeon® processor Scalable family (codename Skylake-SP) system.

16. Security resulting in near zero encryption overhead for stored data: Near Zero encryption overhead: BigBench query Runtime/second. Testing done by Intel.  BASELINE: Platform 8168, NODES 1 Mgmt + 6 Workers, Make Intel Corporation, Model S2600WFD, Form Factor 2U, Processor Intel® Xeon® Platinum 8168 processor, Base Clock 2.70 GHz, Cores per socket 24, Hyper-Threading Enabled, NUMA mode Enabled, RAM 384GB DDR4, RAM Type 12x 32GB DDR4, OS Drive Intel® SSD DC S3710 Series (800GB, 2.5in SATA 6Gb/s, 20nm, MLC), Data Drives 8x - Seagate Enterprise  2.5 HDD ST2000NX0403 2TB, Intel® SSD DC P3520 Series (2.0TB), Temp Drive DC 3520 2TB, NIC Intel  X722 10GBE - Dual Port, Hadoop Cloudera 5.11, Benchmark BigBench, Operating System CentOS Linux release 7.3.1611 (Core); HDFS encryption turned OFF.  vs. NEW: Platform 8168, NODES 1 Mgmt + 6 Workers, Make Intel Corporation, Model S2600WFD, Form Factor 2U, Processor Intel® Xeon® Platinum 8168 processor, Base Clock 2.70 GHz, Cores per socket 24, Hyper-Threading Enabled, NUMA mode Enabled, RAM 384GB DDR4, RAM Type 12x 32GB DDR4, OS Drive Intel® SSD DC S3710 Series (800GB, 2.5in SATA 6Gb/s, 20nm, MLC), Data Drives 8x - Seagate Enterprise  2.5 HDD ST2000NX0403 2TB, Intel® SSD DC P3520 Series (2.0TB), Temp Drive DC 3520 2TB, NIC Intel  X722 10GBE - Dual Port, Hadoop Cloudera 5.11, Benchmark BigBench, Operating System CentOS Linux release 7.3.1611 (Core); HDFS encryption turned ON.

17. Technicolor proof of concept accelerated 3D rendering workload times by nearly 3X:  Approximately 3x performance claim based on internal Technicolor, Inc. rendering workload: one-node, 2 x Intel® Xeon® processor E5-2699 v4 with 32 GB total memory, 400 GB Intel® SSD DC S3510 Series, running on Windows 10 Standard*. Scores were normalized based on system 1 configuration as 1.0 baseline performance. Compared against one-node, 2 x Intel Xeon Platinum 8180 processor with 32 GB total memory, Intel® Optane™ SSD DC P4800X, running on Windows 10 Standard. Score: 2.95, as normalized against system 1.

18. 3.1X improvement in SHA Algorithms for cryptographic hashing: Performance on single core with frequency obfuscation comparing Intel® Xeon® Platinum 8180 Processor vs Intel® Xeon® E5-2650v4 Processor. ISA-L Configuration:  Intel Xeon® Processor Scalable Family: Platinum 8180 Processor, 28C, 2.5 GHz, H0, Neon City CRB, 12x16 GB DDR4 2666 MT/s ECC RDIMM, BIOS PLYCRB1.86B.0128.R08.1703242666.  Intel® Xeon® E5-2650v4 Processor, 12C, 2.2 GHz, Aztec City CRB, 4x8 GB DDR4 2400 MT/s ECC RDIMM, BIOS GRRFCRB1.86B.0276.R02.1606020546.  Operating System: Redhat Enterprise Linux 7.3, Kernel 4.2.3, ISA-L 2.18, BIOS Configuration, P-States: Disabled, Turbo: Disabled, Speed Step: Disabled, C-States: Disabled, ENERGY_PERF_BIAS_CFG: PERF.

19. Up to 5x claim based on OLTP Warehouse 4S: 1-Node, 4 x Intel® Xeon® Processor E7-4870 on Emerald Ridge with 512 GB Total Memory on Oracle Linux* 6.4 using Oracle 12c* running 800 warehouses. Data Source: Request Number: 56, Benchmark: HammerDB, Score: 2.46322e+006 Higher is better vs. 1-Node, 4 x Intel® Xeon® Platinum 8180 Processor on Lightning Ridge SKX with 768 GB Total Memory on Red Hat Enterprise Linux* 7.3 using Oracle 12.2.0.1 (including database and grid) with 800 warehouses. Data Source: Request Number: 2542, Benchmark: HammerDB, Score: 1.2423e+007 Higher is better.

20. Up to 4.89x claim based on OLTP Warehouse 2S : 1-Node, 2 x Intel® Xeon® Processor E5-2690 on Intel® Server Board S2600CP2 with 128 GB Total Memory on Oracle Linux* 6.4 using Oracle 11.2.0.3 with 5000 warehouses. Data Source: Request Number: 408, Benchmark: HammerDB, Score: 1.46826e+006 Higher is better vs. 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor on Purley-EP (Lewisburg) with 768 GB Total Memory on Oracle Linux* 7.2 using Oracle 12.1.0.2.0, HammerDB 2.18. Data Source: Request Number: 2510, Benchmark: HammerDB, Score: 7.18049e+006 Higher is better.

21. Up to 1.2x claim based on AES-128-GCM algorithm speedup: 1-Node, 2 x Intel® Xeon® Processor E5-2650 v4, 12C, 2.2 GHz, Aztec City CRB, 4x8 GB DDR4 2400 MT/s ECC RDIMM, BIOS GRRFCRB1.86B.0276.R02.1606020546 vs. 1-Node, 2 x Intel Xeon® Platinum 8180 Processor, 28C, 2.5 GHz, H0, Neon City CRB, 12x16 GB DDR4 2666 MT/s ECC RDIMM, BIOS PLYCRB1.86B.0128.R08.1703242666 Operating System: Red Hat Enterprise Linux* 7.3, Kernel 4.2.3, ISA-L 2.18, BIOS Configuration, P-States: Disabled, Turbo: Disabled, Speed Step: Disabled, C-States: Disabled, ENERGY_PERF_BIAS_CFG: PERF.

22. Up to 2.55x claim based on IPSec forwarding workload. Configuration data is in the table below:

Up to 2.55x on IPSec Forwarding test 1-Node, 2 x Intel® Xeon® Processor E5-2658v4 1-Node, 2 x Intel® Xeon® Platinum 8180 Processor
IPSec Forwarding Rate @1420B packet (in Gb/s, Higher is better)
Single Socket performance with Cores, NICs, and test memory used on only 1 socket
43.2 (2 cores used) 110.2 (3 cores used)
Processor Intel® Xeon® Processor E5-2658 v4 (35 Cache, 2.3 GHz) Intel® Xeon® Platinum 8180 Processor (39M Cache, 2.5 GHz)
Vendor Intel Intel
Nodes 1 1
Sockets 1 2
Cores Per Processor 14 28
Logical Processors 28 56
Platform Grantley-EP (Wellsburg) Purley-EP (Lewisburg)
Accelerator Used Intel DH895XCC (Coleto Creek) Intel Lewisburg in x24 link mode
Platform Comments SuperMicro® X10DRX Neon City
Memory DIMMs Slots used/Processor 4 6
Total Memory 64 GB 96 GB
Memory DIMM Configuration 16 GB / 2400 MT/s / DDR4 RDIMM 8 GB / 2400 MT/s / DDR4 RDIMM
Memory Comments MTA36ASF2G72PZ-2G4AU , 16GB, 2Rx8 M393A1G43DB1-CRC, Samsung 32GB 2Rx8 PC4 
Network Interface Cards 2x X710-DA4 Intel® Ethernet Controller, 6 10GbE ports used 3x X710-DA4 Intel® Ethernet Controller, 12 10GbE ports used
OS Ubuntu15.0 Ubuntu 16.04.1
OS/Kernel Comments 4.2.0-16 4.4.0
Primary / Secondary Software DPDK 16.11 DPDK 17.02
Other Configurations Version 2.0. Revision: 5.6 , P,C,Turbo state: Disabled, NUMA: Enabled, COD: Disabled BIOS: PLYDCRB1.86B.0114.R11.16122119, IMC Interleaving - 1 way, CPU Power & Performance - Perf, P,C and C states disabled, NUMA Enabled
Computer Type Server Server
Benchmark DPDK IPSec-secgw  DPDK IPSec-secgw 

23. 1.28X higher performance and 1.46X higher core performance, compared to other x86 processors: Source as of June 29th, 2017: performance results estimated or published at www.spec.org using SPECint_rate2006 (base).  Perf/watt based on Specint_rate2006 (base) and published CPU TDP (EPYC 7601 of 180W x 2, 8180 205W x2, 8160 150W x2).