Next Steps
Intel® oneAPI HPC Toolkit
After successfully building a sample application, Explore SYCL* Through Samples.
Maximize resources with OpenMP Offloading.
Diagnose issues in the Troubleshooting section.
The Intel® oneAPI HPC Toolkit has the tools you need to start building and analyzing applications out of the box. Start compiling and optimizing your code today using all the tools and libraries in this suite.
Learn more about the components in this toolkit:
Tool | Description |
Intel® oneAPI DPC++/C++ Compiler | Use this standards-based C++ compiler with support for OpenMP* to take advantage of more cores and built-in technologies in platforms based on Intel® Xeon® Scalable processors and Intel® Core™ processors. Learn more. |
Intel® C++ Compiler | Create code that takes advantage of more cores and built-in technologies in platforms based on Intel® processors. Learn more. |
Intel® Fortran Compiler Classic | Generate optimized, scalable code for Intel® Xeon® Scalable processors and Intel® Core™ processors with this standards-based Fortran compiler with support for OpenMP*. Get Started. |
Intel® MPI Library | Intel® MPI Library is a multi-fabric message passing library that implements the Message Passing Interface, version 3.0 (MPI-3.0) specification. Use the library to develop applications that can run on multiple cluster interconnects. Get Started. |
Intel® Inspector | Locate and debug threading, memory, and persistent memory errors early in the design cycle to avoid costly errors later. Learn more. |
Intel® Trace Analyzer and Collector | Understand MPI application behavior across its full runtime. Learn more. |
oneAPI GPU Optimization Guide | The oneAPI GPU Optimization Guide demonstrates how to improve the behavior of your software by partitioning it across the host and accelerator to specialize portions of the computation that run best on the accelerator. Specialization includes restructuring and tuning the code to create the best mapping of the application to the hardware. The value of oneAPI is that it allows each of these variations to be expressed in a common language with device-specific variants launched on the appropriate accelerator. |
HPC Component |
Description |
---|---|
The Intel® C++ and Intel® Fortran optimizing compilers create fast code for modern processors. They use the latest instruction sets, auto-vectorize code for supporting/utilizing wider vector registers, and highly-tuned parallel models like OpenMP*, and Intel® TBB. The compilers offer broad support for the latest C, C++, and Fortran standards. |
|
Intel® MPI Library is a multifabric message-passing library that implements the open-source Message Passing Interface (MPI) specification. Use the library to create, maintain, and test advanced, complex applications that perform better on HPC clusters based on Intel® processors. Develop applications that can run on multiple cluster interconnects chosen by the user at run time. Quickly deliver maximum end-user performance without having to change the software or operating environment. Reduce the time to market by linking to one library and deploying on the latest optimized fabrics. |
|
Memory errors and non-deterministic threading errors are difficult to find without the right tool. Intel® Inspector is designed to find these errors. It is a dynamic memory and threading error debugger for C, C++, and Fortran applications that run on Windows* and Linux* operating systems. It helps you: Save money: Locate the root cause of memory and threading errors before you release. Save time: Quickly debug intermittent races and deadlocks. Save data: Find errors like missing or redundant cache flushes for persistent memory implementations. Save effort: Use the stand-alone interface, Microsoft Visual Studio* plug-in, or command line. No special compilers or builds are required. |
|
Use this graphical tool to understand MPI application behavior across its full runtime. It helps you: Find temporal dependencies and bottlenecks in your code Check the correctness of your application Locate potential programming errors, buffer overlaps, and deadlocks Visualize and understand parallel application behavior Evaluate profiling statistics and load balancing Analyze performance of subroutines or code blocks Learn about communication patterns, parameters, and performance data Identify communication hot spots Decrease time to solution and increase application efficiency |
For more information about this toolkit, see the Intel® oneAPI Toolkits page.