Intel® oneAPI HPC Toolkit

Deliver Fast Applications That Scale across Clusters

Optimized Tools for High-Performance Computing

High-performance computing (HPC) is at the core of AI, machine learning, and deep learning applications. Intel® oneAPI HPC Toolkit delivers what developers need to build, analyze, optimize, and scale HPC applications with the latest techniques in vectorization, multithreading, multi-node parallelization, and memory optimization.

This toolkit includes powerful data-centric libraries and advanced analysis tools.

See All Toolkits

Download the Toolkit

Deliver fast C++, Fortran, OpenMP*, and MPI applications that scale.

Get It Now

Purchase with Priority Support

Submit questions, problems, and other technical support issues through direct and private interactions with Intel’s engineers.

Learn More

Features

Build
Simplify implementation of HPC applications on CPUs and accelerators with Intel’s industry-leading compiler technology and libraries.

Analyze
Quickly gauge how your application is performing, how resource use impacts your code, and where it can be optimized to ensure faster cross-architecture performance.

Scale
Deploy applications and solutions across shared memory and distributed memory (such as clusters) computing systems using the included standards-driven Intel® MPI Library and benchmarks, MPI analyzer, and cluster tuning tools.

What's Included

Intel® SHMEM
Implement Partitioned Global Address Space (PGAS) programming for host-initiated and device-initiated operations.

Intel® Fortran Compiler
Compile and optimize standard Fortran code for CPU and GPU architectures.

Intel® MPI Library
Deliver flexible, efficient, scalable cluster messaging on CPU and GPU architectures and a variety of network fabrics.

Intel® oneAPI DPC++/C++ Compiler
Compile and optimize standard C++ and SYCL* code for CPU and GPU target architectures.

Intel® DPC++ Compatibility Tool
Migrate legacy CUDA* code to open multiarchitecture SYCL code with this assistant.

Intel® VTune™ Profiler
Find and optimize performance bottlenecks across CPU and GPU systems.

Intel® Advisor
Design code for efficient vectorization, threading, and offloading to accelerators.

Intel® Distribution for GDB*
Enable deep, system-wide debugging of SYCL, C, C++, and Fortran code.

Intel® oneAPI DPC++ Library (oneDPL)
Speed up data parallel workloads with these key productivity algorithms and functions.

Intel® oneAPI Threading Building Blocks (oneTBB)
Simplify parallelism with this advanced threading and memory-management template library.

Intel® oneAPI Math Kernel Library (oneMKL)
Accelerate math processing routines that include matrix algebra, fast Fourier transforms (FFT), and vector math.

Intel® Integrated Performance Primitives
Speed up performance of imaging, signal processing, data compression, cryptography, and more.

Intel® Cryptography Primitives Library
These secure, fast, lightweight building blocks for cryptography are optimized for Intel CPUs.

Intel® oneAPI Data Analytics Library (oneDAL)
Boost machine learning and data analytics performance.

Intel® oneAPI Deep Neural Network Library (oneDNN)
Develop fast neural networks on Intel CPUs and GPUs with performance-optimized building blocks.

Intel® oneAPI Collective Communications Library (oneCCL)
Implement optimized communication patterns to distribute deep learning model training across multiple nodes.

Bundles

In addition to the full toolkit, download smaller bundles for specific development use cases.

Intel® C++ Essentials

This bundle is for C++ and SYCL developers who want to compile, debug, and use the most popular performance libraries in the Base Kit for Intel CPUs and GPUs. It includes:

Intel® Distribution for GDB*
Intel® DPC++ Compatibility Tool
Intel® oneAPI DPC++/C++ Compiler
oneDPL
oneMKL
oneTBB

Intel® Fortran Essentials

Fortran developers with numeric workloads can use this bundle to compile, debug, and use the most popular performance libraries in the Intel oneAPI HPC Toolkit for Intel CPUs and GPUs.

Intel® Fortran Compiler
Intel Distribution for GDB
Intel MPI Library
oneMKL

Get Started

Get Your System Ready

Check that you have the supported hardware and that your system meets the minimum requirements to install the HPC Kit.

Download the HPC Kit

After downloading, follow the Get Started Guide to configure your system and run your first sample.

Get Started Guide: Linux* | Windows* | Containers

Download

Next Steps

Access samples or run your own workloads with the help of tutorials and training.

Documentation & Code Samples

Documentation

Installation Guide

Get Started Guides:

Linux | Windows | Containers

View All Documentation

Code Samples

Learn how to access oneAPI code samples in a tool command line.

oneAPI Code Sample Catalog

Training

OpenMP* Offload Basics [self-paced]

Learn the fundamentals of using OpenMP offload directives to target GPUs, as well as using Intel® C, C++, and Fortran Compilers through hands-on practice in this guided learning path.

Essentials of SYCL [self-paced]

Learn the basics of SYCL for heterogeneous computing (CPU, GPU, and AI accelerators) using live sample code on the Intel® Tiber™ AI Cloud.

Intel® oneAPI Math Kernel Library (oneMKL) Essentials [self-paced]

Learn how to use oneMKL and its functions to create performant applications and speed up computations with low-level math routines.

Additional HPC Kit Training

Specifications

CPUs:

Intel® Xeon® processor family
Intel® Core™ processor family
Other processors that are compatible with Intel® 64 architecture

GPUs:

Intel® UHD Graphics for 11th generation Intel processors or newer
Intel® Iris® X^e graphics
Intel® Arc™ graphics
Intel® Server GPU
Intel® Data Center GPU Flex Series
Intel® Data Center GPU Max Series
NVIDIA* or AMD* GPUs using plug-ins from Codeplay*

Languages:

C++ with SYCL
Fortran
Note Requires Microsoft Visual Studio* on Windows

Operating systems:

Windows
Linux

Development environments:

Compatible with compilers from Microsoft*, GCC*, Intel, and others that follow established language standards
Windows: Microsoft Visual Studio
Linux: Eclipse*

Distributed environments:

OpenFabrics Interface* (OFI) framework implementation supporting the following:

InfiniBand*
iWARP, RDMA over Converged Ethernet (RoCE)
Amazon Web Services Elastic Fabric Adapter (AWS EFA)*
Cornelis Networks*
Ethernet, IP over InfiniBand (IPoIB), IP over Intel OPA

For more information, see the system requirements.

Get Help

Your success is our success. Access these support resources when you need assistance.

For additional help, see our general oneAPI Support.

Stay In the Know on All Things CODE

Sign up to receive the latest tech articles, tutorials, dev tools, training opportunities, product updates, and more, hand-curated to help you optimize your code, no matter where you are in your developer journey. Take a chance and subscribe. You can change your mind at any time.

All fields are required unless marked optional.

Business Email Address

Please select a country/region

Company

Intel strives to provide you with a great, personalized experience, and your data helps us to accomplish this.

I consent to Intel collecting and using my personal information as described below.

I expressly consent to Intel transferring my Personal Data outside of China to Intel U.S. headquarters as described in the Intel Privacy Notice Supplement for Users in China.

By submitting this form, you are confirming you are age 18 years or older. Intel will process your Personal Data for the purpose of this business request. To learn more about Intel's practices, including how to manage your preferences and settings, visit Intel's Privacy Notice.

You’re In!

Thank you for signing up. Be on the lookout for a welcome email to get you started.

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in