Intel® oneAPI News Updates

What's New

  • oneAPI Spec 1.2 Release PLUS New Members Added to Steering Committee Read

  • Intel @ Supercomputing 2022 – Open, Accelerated Computing for HPC and AI Read

  • Intel® oneAPI 2022.3 Tools Available Read

  • oneAPI Initiative Expands to a Community Forum for Open Accelerated Computing Read

  • Get the Latest on All Things CODE

author-image

By

oneAPI Spec 1.2 Release PLUS New Members Added to Steering Committee  

November 14, 2022 | oneAPI initiative

The oneAPI Specification 1.2 is comprised of a major new release of the oneDNN specification which includes the brand new oneDNN Graph API, bringing enhanced performance by enabling a larger scope of deep neural network (DNN) compute graph functionality.

Additional features include:

  • Updates and extensions to DPC++ (oneAPI’s open source SYCL implementation)
  • Enhancements to oneMKL, with new routines for the BLAS libraries
  • oneVPL’s addition of a new API for processing camera RAW data and more
  • Level Zero’s addition of a fabric topoloty discovery API and sRGB support for image copy

oneAPI Community Forum Expansion

Led by Rod Burns, VP of Ecosystem at Codeplay Software, the forum has added the following new members to its steering committee:

  • Kevin Harms from Argonne National Labs – Performance Engineering Team Lead, MS in Computer Science
  • Penporn Koanantakool from Google – Sr. Software Engineer, Ph.D in Computer Science
  • Robert Cohn from Intel – Sr. Principal Engineer, Ph.D in Computer Science

Get the details >


 

Intel @ Supercomputing 2022 – Open, Accelerated Computing for HPC and AI

November 09, 2022 | Intel® oneAPI and AI Toolkits, oneAPI initiative

A lot of developer goodness was announced today by Jeff McVeigh, Intel VP and GM of its Super Computing group, Here are the highlights.

oneAPI and AI Tools 2023 Release

Available in December, Intel’s oneAPI and AI 2023 tools will provide optimized support for powerful new architectures, including the 4th Gen Intel® Xeon® Scalable Processor, Intel® Xeon® Processor Max Series (formerly codenamed Sapphire Rapids HBM), and Intel® Data Center GPU Max Series (formerly codenamed Ponte Vecchio).

These standards-based tools continue to help developers deliver multiarchitecture performance and productivity. New HPC and AI features include:

  • HPC and General Compute – Select tools support OpenMP 5.1, Intel® oneAPI DPC++/C++ Compiler provides improved SYCL language support, and Intel® Fortran Compiler fully implements F2003, F2008 and F2018 standards across Intel® CPUs and GPUs. 
  • AI – Optimizations for TensorFlow and PyTorch accelerate performance on current and upcoming Intel CPUs and GPUs. Extended quantization and distillation capabilities in the Intel® Neural Compressor deliver faster AI inference. These features are bundled in the Intel® AI Analytics Toolkit powered by oneAPI. 
  • Code Portability – Enhanced CUDA-to-SYCL code migration functions simplify creating single-source code for multiarchitecture systems. 

Other enhancements for this release were previewed at Intel® Innovation on Oct. 28.

Learn more >

7 New AI Reference Kits Released

To accelerate industry-driven solutions for AI, Intel recently released 7 new AI reference kits to address key business issues. The kits are powered by oneAPI and include optimized frameworks and oneAPI libraries, tools, and other components to maximize AI performance on Intel® hardware. The new kits target:

  • Health & Life Sciences - speech-to-text AI  
  • Retail – personalize experiences with customer segmentation, automate purchase prediction, demand forecasting, order-to-delivery forecasting
  • Financial Services - loan default risk prediction
  • Cross-industry - network intrusion detection

These kits join 9 others for a total of 16. Get them now via Intel or on GitHub.

New oneAPI Center of Excellence Focuses on Earthquake Research

The Southern California Earthquake Center with the San Diego Supercomputer Center at UC San Diego is hosting a new oneAPI Center of Excellence. The center’s focus addresses the challenges of numerically simulating the dynamics of fault rupture and seismic ground motion in realistic 3D models. It will optimize Anelastic Wave Propagation – Olsen, Day, Cui (AWP-ODC) software, an open source simulation code, using oneAPI to create portable, high-performance, multiarchitecture code for advanced HPC systems.  

The Anelastic Wave Propagation code is used extensively by the SCEC community, the National Science Foundation consortium, and scientists and researchers in real-world seismic hazard simulations and research domains. It enables computational productions from standard “forward” simulations (computing three-component seismograms, i.e., records of earthquake phenomena) to multiple-source “reciprocal” simulations (calculating seismic hazard estimates for sites of interest). The computational outcomes allow for ground motion predictions that help decision-makers reduce seismic risk by improving building codes and increasing community resilience to earthquake hazards. 

This oneAPI Center of Excellence joins 28 others around the globe working to accelerate oneAPI through research, code optimizations and implementations, and training programs.

Learn more >


 

Intel® oneAPI 2022.3 Tools Available

October 10, 2022 | Intel® oneAPI and AI Toolkits, oneAPI initiative

Enabling an Open, Multiarchitecture World

The newest update of Intel® oneAPI Toolkits and standalone tools is now available for direct download and/or use in the Intel® DevCloud for oneAPI. More than 30 tools are included in this release, each optimized to deliver improved performance and expanded capabilities for data-centric workloads.

Intel oneAPI Toolkits are purpose-built to optimize and accelerate cross-architecture and heterogeneous computing, delivering to developers open choice without sacrificing performance or functionality.

The toolkits provide compilers, languages, libraries, and analysis and debug tools that implement industry standards including SYCL*, C++, C, Python, Fortran, MPI, and OpenMP* as well as optimized versions of popular AI frameworks and Python libraries.

2022.3 Highlights:

Compilers

  • Intel® oneAPI DPC++/C++ Compiler adds more SYCL 2020 features to improve programming productivity on various hardware accelerators including GPUs and FPGAs and enhances OpenMP 5.x compliance.
  • Intel® Fortran Compiler adds Fortran 2008 and 2018 coarrays, DLLImport/DLLExport, DO CONCURRENT offload support, and -int and additional -check compiler options.

CUDA*-to-SYCL Porting

  • Intel® DPC++ Compatibility Tool supports more complete CUDA-to-SYCL code migration by adding support for CUDA 11.7 header files and CUDA runtimes and driver APIs including cuDNN, NCCL, Thrust, cuBLAS, and cuFFT.
  • The SYCLomatic Project expands Intel’s support of open computing with an open source version of the Compatibility Tool, which enables community collaboration to advance adoption of the SYCL standard.

Performance Libraries

  • Intel® oneAPI Math Kernel Library adds BLAS GPU device-timing support to ensure faster and easier detecting of exceptions and quicker recovery; improves portability and compatibility by extending OpenMP cluster offload capability to support the OpenMP 5.1 spec for LAPACK.
  • Intel® oneAPI DPC++ Library expands support of the C++ standard library in SYCL kernels with nine additional heap and sorting algorithms to simplify the coding of common functions.
  • Intel® oneAPI Video Processing Library includes the ability to provide extensive data about what is encoded, thereby opening up opportunities for quality improvement and algorithm innovation.

Analysis & Debug Tools

AI and Analytics

  • Intel® Extension for PyTorch* is updated to 1.12.100, includes automatic INT8 quantization, and adds operation and graph enhancements to improve performance across a broad set of workloads.
  • Intel® Optimization for TensorFlow* is updated to TensorFlow 2.9.1, includes performance improvements for bfloat16 models, and removes compiler requirement to enable oneDNN optimizations on Linux*-based Cascade Lake and newer CPUs.
  • Intel® Neural Compressor improves productivity with a lighter binary size, a new quantization accuracy feature and experimental auto-coding support, plus adds support for TensorFlow quantization API, QDQ quantization for ITEX, mixed-precision enhancement, DyNAS, training for block-wise structure sparsity, and op-type wise tuning strategy.

Rendering & Ray Tracing

  • Intel® Open Volume Kernel Library improves performance and memory efficiencies and adds support for VDB volumes (packed/contiguous data layouts for temporally constant volumes) and Intel® AVX-512 8-wide CPU device mode.
  • Intel® OSPRay supports primitive, object, and instance ID buffers as framebuffer channels, and face-varying attributes for mesh and subdivision geometry.
  • Intel® Embree supports the Intel oneAPI DPC++/C++ Compiler.

Get all the details >

Bookmark the oneAPI Training Portal – Learn the way you want to with learning paths, tools, on-demand training, and opportunities to share and showcase your work.

oneAPI Initiative Expands to a Community Forum for Open Accelerated Computing

September 28, 2022 | oneAPI Initiative & Specification

The future of oneAPI is shifting to a community forum to address the evolving needs of developers, software vendors, national labs, researchers, and silicon vendors.

Why? To build on the progress made on oneAPI adoption and implementations across multiple architectures and vendors.

Codeplay will lead in establishing the forum to grow and coordinate the oneAPI developer community—its history driving open standards and cross-platform experience with SYCL* development and oneAPI implementationss uniquely position it to facilitate these next steps.

Benefits

The forum will lead to greater community participation and guide the continuing evolution of oneAPI to enable more cross-architecture, multivendor implementations, and rapid adoptions.

Codeplay, in concert with the community, will provide additional details on the transition in the next quarter.

Get the details


​​​​​​​

Sneak Peek: 2023 Intel® oneAPI Tools

September 28, 2022 | Intel® oneAPI Toolkits

New Enhancements Coming in December

Winter is coming. Which means that Intel® oneAPI tools (toolkits and standalone tools) are on the precipice of revving to improved and optimized versions that are purpose-built to help developers continually deliver applications and solutions that work across multiple architectures—CPU, GPU, FPGA, and more.

The 2023 release includes enhancements to its standards-based developer products that are optimized for the latest and upcoming architectures (solely or in combination) such as:

  • 4th Gen Intel® Xeon® Scalable Processor
  • Intel® Data Center GPU codenamed Ponte Vecchio
  • Intel® Data Center GPU Flex Series
  • Intel® Arc™ Graphics
  • Intel® Agilex™ FPGAs

New top features include:

  • HPC and General Compute – Enhanced CUDA-to-SYCL code migration functions simplify creating performant single source code for multiarchitecture systems. The Intel® oneAPI DPC++/C++ Compiler provides improved SYCL language support and the Intel® Fortran Compiler fully implements F2003, F2008 and F2018 standards across Intel CPUs and GPUs.
  • Artificial Intelligence – TensorFlow* and PyTorch* are optimized for the 4th gen Xeon Scalable processor and Ponte Vecchio. Extended quantization and distillation capabilities in the Intel® Neural Compressor deliver faster AI inference. These features are bundled in the Intel® AI Analytics Toolkit powered by oneAPI.

The tools begin shipping in December.


 

Announcing 6 New oneAPI Centers of Excellence

September 28, 2022 | Academic Centers of Excellence

Six new oneAPI Centers of Excellence recently joined the oneAPI community. They will focus on accelerating oneAPI development on multiarchitecture systems by optimizing key software codes, creating new implementations, porting strategic applications to oneAPI, and developing and broadly sharing new curriculum to enable and expand oneAPI adoption.

The six new oneAPI Centers are:

  • Science and Technology Facilities Council will accelerate exascale software development on multiarchitecture systems using the SYCL* standard and oneAPI with specific focus on optimizing two prominent open source HPC software codes: a C++ coupling library called Multiscale Universal Interface (MUI) and a high-fidelity Computational Fluid Dynamics code called Xcompact3d. Both are integral within the UK’s ExCALIBUR exascale programme and part of its landscape for developing future exascale computing capabilities, providing accelerated computing platforms that can handle upwards of a trillion of calculations per second. 
  • School of Software and Microelectronics of Peking University is expanding teaching and practical usage of oneAPI programming, including developing and teaching classes and broadly sharing the new local language curriculum to enable and expand oneAPI adoption at universities in the People’s Republic of China.
  • Technion Israel Institute of Technology is facilitating studies in contemporary scientific computing on CPUs, GPUs, and other accelerators using oneAPI and Intel® Developer Cloud. Advanced courses using oneAPI and OpenMP* will expand to other universities. Undergraduate projects will also port select open source HPC and AI applications via oneAPI to OpenMP/SYCL and optimize their performance. Read more: English | Hebrew
  • University of California San Diego will focus on enabling high-performance molecular dynamics simulations in Amber via oneAPI—CPUs and accelerators—at its Supercomputer Center.
  • University of Utah in collaboration with the Lawrence Livermore National Laboratory is focused on developing portable, scalable, and performant data compression techniques by accelerating ZFP compression software using oneAPI on multiple architectures to advance exascale computing. 
  • Zuse Institute Berlin is focused on using oneAPI for energy-efficient HPC computing by delivering portable implementations on GPUs and FPGAs. 

To date, 28 oneAPI Centers of Excellence are driving oneAPI open accelerated compute adoption around the world.

Learn more: oneAPI Centers of Excellence


 

Now Available: 3 New AI Reference Kits

September 28, 2022 | AI Reference Kits

Solve important business problems.

Building on a set of AI SW Reference Kits released in July (in collaboration with Accenture), three new AI application reference kits powered by oneAPI are now available for healthcare to help clinicians with disease prediction, medical imaging diagnostics, and document automation. The kits can be downloaded from Intel or GitHub.

A continuing drumbeat of new AI reference kit releases will continue through 2023. 


 

Joint Solution with Red Hat Accelerates AI, New Data Science Developer Program

September 28, 2022 

New Enhancements Coming in December

 

Intel and Red Hat introduced a new joint solution that combines Intel’s AI hardware and software portfolio with Red Hat OpenShift Data Science (RHODS), an AI platform that enables data scientists and developers to work together to create, test, and build intelligent applications.

This solution enables developers to train and deploy their models using the Intel® AI Analytics Toolkit and OpenVINO™ tools, which are powered by oneAPI. 

Red Hat is also working to make the Habana Gaudi* training accelerator available on its service to deliver cost-efficient, high-performance, deep-learning model training and deployment. Additionally, a joint Intel and Red Hat AI developer program will enable developers to learn, test, and deploy AI software directly from both the RHODS sandbox and the Intel® Developer Cloud.

Learn more: Developer Resources from Intel & Red Hat | Boost OpenShift Data Science with Intel® AI Analytics Toolkit


 

For a Limited Time: Get Beta Access to New Intel® Technologies

September 27, 2022 | Intel® Developer Cloud

New technologies are a click away in the expanded Intel® Developer Cloud.

As noted by Intel CEO Pat Gelsinger during his keynote at Intel® Innovation, a limited beta trial opportunity is now open to for the newly expanded Intel® Developer Cloud.

Starting right now, approved developers and customers can get early access to Intel technologies—from a few months to a full year ahead of product availability—and try out, test, and evaluate them on Intel’s enhanced, cloud-based service platform.

The beta trial includes new and upcoming Intel compute and accelerator platforms such as:

  • 4th Gen Intel® Xeon® Scalable Processors (Sapphire Rapids)
  • Intel® Xeon 4th Gen® processor with high bandwidth memory (HBM)
  • Intel® Data Center GPU codenamed Ponte Vecchio
  • Intel® Data Center GPU Flex Series
  • Habana® Gaudi®2 Deep Learning accelerators

Registration and prequalification is required.

Visit cloud.intel.com to get started.

Intel Among Official TensorFlow Build Collaborators

September 21, 2022 | TensorFlow install with pip

Intel has officially partnered with Google to take ownership of developing and releasing TensorFlow Windows Native CPU builds, starting with TensorFlow 2.10. This close collaboration with Google underscores Intel’s commitment to deliver optimal experience for TensorFlow developers on Windows platforms.

Get the details >


 

Tencent Achieves Up to 85% Performance Boost using oneAPI Tools

September 12, 2022 | Intel® oneAPI DPC++/C++ Compiler, Intel® VTune™ Profiler

Tencent’s results with optimizing MySQL demonstrate the importance both of using up-to-date [Intel® oneAPI] developer tools like the Intel oneAPI DPC++/C++ Compiler and the latest optimization techniques using Intel VTune Profiler. The significant improvements in performance yield either faster time-to-results or more results for business-critical applications.

Joe Curley, Intel VP and GM, Intel Software Products & Ecosystem group

Tencent significantly enhanced the performance of its database hosting service, TencentDB for MySQL. Based on the open source relational database management system MySQL and built on Intel® Xeon® processors, performance increased by using the advanced Intel® oneAPI DPC++/C++ Compiler and Intel® VTune™ Profiler (part of the Intel® oneAPI Base Toolkit).

Why It Matters

Distributed data storage serves a critical role across industries and use cases, including internet, finance and e-commerce. Solutions like TencentDB for MySQL provide developers with a service for distributed data storage that supports easy setup, operation and expansion of relational databases in the cloud.

Read the story >


 

From Rendering to HPC, Intel® oneAPI Tools are Optimizing Open Source Solutions

September 8, 2022 | Intel® oneAPI Base Toolkit, Intel® oneAPI HPC Toolkit, Intel® oneAPI Rendering Toolkit

New this week, two popular and powerful applications, Blender 3.3 and Radioss (which now has an open source version: OpenRadioss), are optimized by very different (and very important) oneAPI tools and capabilities that benefit developers.

Check them out:

Blender Cycles Provides Full Support for Intel Discrete GPUs

Starting with Blender 3.3, the Cycles rendering engine now includes oneAPI as a rendering device API with support for Intel® Arc™ A-series discrete graphics and Intel® Data Center GPU Flex Series. The new support is implementing SYCL, by The Khronos Group, an open, standards-based language that provides multivendor CPU and GPU code development. This is a first step in an evolutionary development approach that aims to free Blender creators and users from being locked into single, proprietary architecture and programming.

Cycles is a ray tracing renderer in Blender with complex path-tracing scenes, geometry notes, indirect lighting and dense geometry for final frames. Over the years, Intel’s contributions to Blender include development consulting, integrating advanced ray tracing capabilities, and training. Intel® Embree, the academy award-winning 3D ray tracing kernel library, was integrated into Blender several years ago, delivering high-fidelity photorealism and supporting many films and projects. In 2019, Intel® Open Image Denoise was added, helping artists and studios deliver final frame image quality in less time.

Altair Unveils OpenRadioss

Altair taking OpenRadioss into the open source community enables developers who want to solve critical problems in structural analysis like crash simulation access to the benefits of open source development. Intel’s commitment to open source development is reflected in collaboration with Altair using open oneAPI compilers, libraries, and developer tools that help them productively maximize value from their high-performance hardware.

Joe Curley, Intel VP and GM, Intel Software Products & Ecosystem group

Altair has moved Radioss—a leading analysis solution to improve the crashworthiness, safety, and manufacturability of complex designs—to open source as OpenRadioss. Altair engineers used several tools in the Intel® oneAPI Base and HPC Toolkits to optimize the software.

Learn more >

Watch the video >


 

New Intel® Data Center GPU Flex Series for the Intelligent Visual Cloud Uses an Open Software Stack

Aug. 24, 2022 | oneVPL | Intel® VTune™ Profiler

Unveiled today, the Intel® Data Center GPU Flex Series is a versatile and seamless hardware with an open software solution stack that brings much-needed flexibility and performance to intelligent visual cloud workloads.

It delivers:

  • 5x media transcode throughput performance and 2x decode throughput performance at half the power of competitive solutions1
  • More than 30% bandwidth improvement for significant total cost of ownership (TCO) savings
  • Broad support for popular media tools, APIs, frameworks, and the latest codecs

The Intel Flex Series GPU is designed to flexibly handle a wide range of workloads—media delivery, cloud gaming, AI, metaverse, more—without compromising performance or quality, while lowering and optimizing TCO. The GPU frees users from the constraints of siloed and proprietary environments and reduces the need for data centers to use separate, discrete solutions.

Developers can access a comprehensive software stack that combines open source components and tools to effectively realize the Flex Series GPU capabilities for visual cloud workloads. Intel’s oneAPI tools empower developers to deliver accelerated applications and services, including oneVPL, Intel® VTune™ Profiler, and many more.

Watch for more details on easy downloadable software packages coming soon.  

Learn More: Intel News Byte | Intel Flex Series GPU

3D Artist’s Visuals Come to Life through Intel Hardware & Advanced Ray Tracing

August 16, 2022 | Intel® Open VKL, Intel® oneAPI Rendering Toolkit

Intel® Advanced Ray Tracing + Intel’s mobile HX processors deliver high performance for professional workflows and amazing content creation.

Intel recently released the Intel® Open VKL plugin for RenderMan*. It works with Pixar Animation’s Renderman—one of the world’s most versatile renderers for VFX and animation—and utilizes Intel® Open Volume Kernel Library to provide significant performance improvements for final-frame volumetric rendering.

The powerful combo of Renderman, Intel Open VKL, and 12th Gen Intel® Core™ HX processors helps artists like Fabio Sciedlarczyk render compute-intensive volumetric content more quickly, including fire, water, air, clouds, and smoke. That performance allows him more time to craft a visually stunning story. In this video, see how Sciedlarczyk used these tools to build amazing photoreal visuals, producing them on-the-go without sacrificing performance while dramatically reducing compile times on his mobile workstation.

These days, with the tools I have available, computer graphics is becoming a medium of almost no restrictions. And Intel is continually pushing the boundaries of what’s possible.

Fabio Sciedlarczyk, Looks development artist

 

The open source plugin is available free to the public on GitHub and aligns with Intel’s open software strategy to foster innovation and broad adoption by content creators and developers across the software ecosystem. Intel Open VKL is part of the Intel® oneAPI Rendering Toolkit.

More Resources

Intel and Aible Team Up to Fast-Track AI

August 9, 2022 | Aible, Intel® AI Analytics Toolkit

Intel® Xeon® Scalable processors, along with software optimizations, enable business results within 30 days.

Intel’s collaboration with Aible, a cloud-based AI/ML platform solution provider, enables customers to deliver datacenter-based AI applications and initiatives faster and with better TCO without increasing complexity.

When paired with AI-accelerated Intel® Xeon® Scalable processors plus AI-optimized tools: oneAPI Deep Neural Network Library (oneDNN) + others from Intel® AI Analytics Toolkit, Aible’s technology provides a serverless-first approach that trains machine learning modules faster than other server-oriented solutions.

Learn more & see the benchmark >

Read the case study >

Deploy Stunning Hi-Fi Graphics with Intel® Advanced Ray Tracing

August 8, 2022 | [NEW!] Intel® Arc™ Pro A-series graphics, Intel® Open VKL plugin for RenderMan*, Intel® Open Path Guiding Library, SIGGRAPH 2022

Newly Unveiled: Intel® Arc™ Pro GPUs, Intel® Open VKL Plugin for Renderman*, and Open Path Guiding Library

Just in time for SIGGRAPH 2022, Intel introduced new GPU hardware and software technologies that accelerate high-fidelity graphics. These innovations showcase the next step in the company’s mission to provide open, end-to-end, platform-wide solutions for performance scaling across consumer and high-end laptops, workstations, data center/render farm, cloud - to the world’s largest supercomputers.

Highlights

The Intel Arc Pro A-series professional range of GPUs feature built-in ray-tracing hardware, industry-first AV1 hardware encoding acceleration, and machine learning capabilities. Learn more >

New Intel® Advanced Ray Tracing technologies enable sophisticated ray tracing, visual compute, high-fidelity, and visualization capabilities.

Cross-industry collaborations with global leaders in standards-based solutions are continuously advancing graphics innovations, with the latest including:

  • DreamWorks Animation announced plans to release its MCRT renderer, MoonRay*, as open source software later this year. The renderer’s photoreal ray-tracing performance is supported by two open source tools in the Intel® oneAPI Rendering Toolkit (Render Kit): Intel® Embree ray tracing kernel library for advanced rendering features, and Intel® Implicit SPMD Program Compiler (Intel® ISPC) for vector instruction parallelism.
  • Blender* 3.3 is available in beta where oneAPI programming delivers one codebase support on Linux* and Windows* across Intel Arc GPUs and upcoming Intel data center GPUs.
  • Intel collaborated with leading Unity asset publisher Procedural Worlds on creating the Intel® Game Dev AI Toolkit with Gaia ML (for Unity). It enables developers to bring machine learning capabilities to their gaming experiences.
  • Foundry’s Modo* 16.0 release adds new support to its real-time viewport for upcoming Intel Arc GPUs.

Attending SIGGRAPH 2022?

If so, visit the Intel Booth (#427) to see demos showcasing innovative usages including Topaz and SketchUp running on the just-announced Intel Arc Pro graphics, and SideFX, Blender, and RenderMan optimized by the Render Kit on Intel GPUs and CPUs.

Discover More

Intel Releases Open Source AI Reference Kits to Simplify Development

July 12, 2022 |  Intel AI Dev Tools

Intel released the first set of open-source AI reference kits specifically designed to make AI more accessible to organizations in on-prem, cloud, and edge environments. 

First introduced at Intel Vision, these kits include AI model code, training data, end-to-end machine learning pipeline instructions, libraries, and Intel® oneAPI components for cross-architecture performance.

The First Kits Available Today

  • Utility Asset Health – This predictive analytics model was trained to help utilities deliver higher service reliability.
  • Visual Quality Control – Automate VQ control inspections for life sciences, including pharma to help improve the quality of the pills and lower the cost of operations.
  • Customer Chatbot for the Enterprise – This conversational AI chatbot model was trained using over 4,000 utterances from the Airline Travel Information Systems dataset to provide 94% predictive accuracy.
  • Intelligent Document Indexing – Automate the processing and categorizing of millions of documents via faster routing and lower manual labor costs.

Get the details >

Innovation thrives in an open, democratized environment and Intel’s AI tools and framework optimizations are built on the foundations of an open, standards-based, unified oneAPI programming model. These Project Apollo reference kits, built with components of Intel’s End-to-End AI software portfolio, will enable millions of developers and data scientists to quickly and easily introduce AI into their applications or boost their existing AI/ML implementations. This will help deliver a wide range of intelligent solutions across several use cases and industries.

Wei Li, Intel vice president and general manager of AI and Analytics

Intel & Google Cloud Provide Turnkey, Optimized Solution for HPC Workloads

July 6, 2022 |  Intel® oneAPI Base Toolkit, Intel® oneAPI HPC Toolkit

Intel and Google are working together to drive high-performance computing forward on Google Cloud with the release of the Cloud HPC Toolkit. This new resource provides access to tools from the Intel® oneAPI Base and HPC Toolkits—including Intel® MPI Library and Intel® oneAPI Math Kernel Library—to optimize performance through Intel® Select Solutions for Simulations & Modeling. These new tools improve compile times and speed of results and offer multi-vendor acceleration in SYCL.

Why It’s Important

In a nutshell, the new Toolkit simplifies adoption of robust high-performance cloud computing by removing the challenges inherent in groking and overcoming unfamiliar development concepts and tools. (These can result in slow deployment for demanding workloads, software incompatibilities, and subpar performance.)

Using Cloud HPC Toolkit with an Intel Select Solutions for Simulations & Modeling blueprint brings the added benefit of automatically spinning up a hardware-software configuration that has been rigorously tested and optimized for real-world performance, eliminating guesswork.

Ilias Katsardis – HPC Solution Lead – Google

Now Available: Intel® VTune™ Profiler 2022.3

June 7, 2022 | Intel® VTune™ Profiler

Intel VTune Profiler 2022.3

Find and optimize performance bottlenecks fast across CPU, GPU, and FPGA systems.

What’s New?

  • Supports DirectML API to pinpoint host-side API call inefficiencies and their causes
  • Enables developers to identify memory-transfer-related bottlenecks for GPU computing tasks which use USM extension of OpenCL™ API via analyzing CPU-side stacks.

Download it today >

Learn more at software.intel.com/vtune

Intel to Acquire Codeplay Software

June 1, 2022 | oneAPI Specification

Intel to Acquire Codeplay Software

Intel is further advancing its support of the oneAPI ecosystem through an agreement to acquire Codeplay Software, a global leader in cross-architecture, open, standards-based developer technologies. 

Codeplay is globally recognized for its expertise and leadership in SYCL, the Khronos Group’s open-standard programming model used in oneAPI, and its significant contributions to the industry ranging from open-ecosystem activities like SYCL and OpenCL™ to RISC-V, automotive software safety, and medical imaging. 

Codeplay has extensively delivered products supporting diverse hardware platforms globally, embracing the mission of bringing oneAPI to the masses.

Bolstered by the strength of Intel, Codeplay will be able to extend the delivery of SYCL solutions into cross-architecture and multi-vendor products, based on open standards and the open source ecosystems upon which they are built.

Joe Curley - VP & GM - Intel Software Products & Ecosystem

Intel at ISC 2022 Focuses on Sustainable, Open HPC-AI

May 31, 2022 | Intel @ ISC 2022

At International SuperComputing 2022, Jeff McVeigh, VP of Super Compute Group, highlighted Intel’s HPC leadership technologies that are being used to accelerate innovation for a more sustainable and open HPC-AI, including how:

  • Intel software and oneAPI extend across the software stack to provide tools, platforms and software IP to help developers produce scalable, better-performing, more efficient code that take advantage of the latest silicon innovations without the burden of refactoring code.
  • Two new Intel oneAPI Centers of Excellence join the ecosystem, bringing the total to 22 universities and labs working across the globe to increase oneAPI capabilities and adoption.

Read and watch Jeff’s editorial >

 

Introducing the New Intel oneAPI Centers of Excellence

  • University of Bristol is developing best practices for achieving performance portability at exascale using oneAPI and the Khronos Group* SYCL abstraction layer for cross-platform programming. The goal: ensure scientific codes can achieve high performance on massive heterogeneous supercomputing systems.
  • Centre for Development of Advanced Computing (CDAC) is building a base of skilled instructors who deliver oneAPI training to India HPC and AI communities. CDAC will scale training broadly in the country through its infrastructure and teach oneAPI in top universities.

More to Discover

Heidelberg University Drives Heterogeneous Computing with oneMKL Open-source Interfaces

May 25, 2022 |  Intel® oneAPI Math Kernel Library, oneAPI Specification

Heidelberg U has recently enabled ROCm support for random number generation and BLAS in Intel® oneAPI Math Kernel Library (oneMKL) interfaces. This is a new and significant community contribution to the oneMKL interfaces project, part of the oneAPI industry initiative that provides SYCL-based APIs for math algorithms focused on CPUs and compute-accelerator architectures.

This work—adding into the project support for rocRAND and rocBLAS—now makes it possible to generate random numbers and perform linear algebra computations using the hipSYCL compiler to achieve near-native performance in cross-platform applications written in hipSYCL. Additionally, it makes oneMKL open-source interfaces the first oneAPI component with upstream support for other SYCL implementations apart from DPC++.

Read all the details >

Additional resources

oneDNN AI Optimizations Turned Enabled by Default in TensorFlow 2

May 25, 2022 |  Intel® oneAPI Deep Neural Network Library

In the latest release of TensorFlow 2.9, performance improvements are delivered by Intel® oneAPI Deep Neural Network Library (oneDNN) enabled by Google as the default backend CPU optimization for x86 packages. This applies to all Linux x86 packages and for CPUs with neural-network-focused hardware features like AVX512_VNNI, AVX512_BF16, and AMX vector and matrix extensions found on 2nd gen Intel® Xeon® Scalable processors and newer CPUs.

These optimizations accelerate key performance-intensive operations such as convolution, matrix multiplication, and batch normalization, with up to 3 times performance improvements compared to versions without oneDNN acceleration.

Why It’s Important

While there is an emphasis today on AI accelerators like GPUs for machine learning and deep learning, CPUs remain a primary player in all stages of the AI workflow—ubiquitous across most personal devices, workstations, and data centers. These default optimizations will help enable millions of developers who already use TensorFlow to achieve productivity gains, faster time to train, and efficient utilization of compute.

Performance gains will benefit applications spanning natural language processing, image and object recognition, autonomous vehicles, fraud detection, medical diagnosis and treatment, and more.

Learn more >

Get the Software

More Resources

Intel Open Sources SYCLomatic Migration Tool to Help Developers Create Heterogeneous Code

May 15, 2022 | Data Parallel C++/SYCL

Intel recently released an open-source tool to migrate code to SYCL through a project called SYCLomatic; it helps developers more easily port CUDA code to SYCL and C++ to accelerate cross-architecture programming for heterogeneous architectures. This open-source project enables community collaboration to advance adoption of the SYCL standard, a key step in freeing developers from a single-vendor proprietary ecosystem.

How the SYCLomatic Tool Works

SYCLomatic assists developers in porting CUDA code to SYCL, typically migrating 90-95% of CUDA code automatically to SYCL code. To finish the process, developers complete the rest of the coding manually and then custom tune to the desired level of performance.

According to James Reinders, Intel oneAPI evangelist, “Migrating to C++ with SYCL gives code stronger ISO C++ alignment, multivendor support to relieve vendor lock-in, and support for multiarchitecture to provide flexibility in harnessing the full power of new hardware innovations. SYCLomatic offers a valuable tool to automate much of the work, allowing developers to focus more on custom tuning than porting.”

SYCLomatic is a GitHub project. Developers are encouraged to use the tool and provide feedback and contributions to advance the tool’s evolution.

Learn more >

Now Available: Intel® oneAPI Toolkits 2022.2

May 18, 2022 | oneAPI, Intel® oneAPI Toolkits

The latest Intel® oneAPI Tools are now available for direct download and/or use in the Intel® DevCloud. This release includes updates to all Toolkits (including 30+ individual tools)—each optimized to deliver improved performance and expanded capabilities for data-centric workloads.

2022.2 highlights:

Intel® Arc™ (Discrete) GPUs for Media, Gaming, and AI workloads

Compilers

  • Intel® oneAPI DPC++/C++ Compiler adds more SYCL* 2020 features to improve developer productivity for programming various hardware accelerators such as GPUs and FPGAs, enhances OpenMP* 5.1 compliance, and improves performance of OpenMP reductions for compute offload.
  • Intel® Fortran Compiler, based on modern LLVM technology, adds support for parameterized-derived types, F2018 IEEE Compare, and VAX structures support, and expands support for OpenMP 5.0 with Declare Mapper for scalars support.

High-Performance Libraries

  • oneMKL adds MKL_VERBOSE GPU support for the BLAS Domain and CPU support for the transpose domain for improved visibility during debugging.
  • oneCCL now supports Intel® Instrumentation and Tracing Technology profiling, opening new insights with tools such as VTune Profiler.
  • oneTBB improves support and use of the latest C++ standard for parallel_sort, plus adds fully functional features for task_arena extension, collaborative_all_once, adaptive mutexes, heterogeneous overloads for concurrent_hash_map, and task_scheduler_handle.
  • oneVPL supports multiple hardware adapters and expanded development environments, plus MPEG2 decode in a CPU implementation to improve codec coverage for systems that do not have dedicated hardware.
  • Intel® MPI Library enables better resource planning and control at an application level with GPU pinning, plus adds multi-rail support to improve application internode communication bandwidth.

Analysis Tools

  • Intel® Advisor adds user recommendations and sharing, including optimizing data-transfer reuse costs of CPU-to-GPU offloading, details of GPU Roofline kernels and Offload Modeling, and seeing offloaded parts of the code at source level (including performance metrics) in a GPU Roofline perspective.
  • Intel® VTune™ Profiler opens the ability to identify performance inefficiencies related to Intel® VT-d for latest-generation server platforms, supports Intel Arc GPUs, and is available as a Docker container.

AI Workload Acceleration

  • Intel® Extension for TensorFlow* adds faster model loading, improvements in efficient element-wise Eigen operations, and support for additional fusions such as matmul biasadd-g.
  • Additional functionality and productivity for Intel® Extension for Scikit-learn* and Intel® Distribution of Modin* through new features, algorithms and performance improvements such as Minkowski and Chebyshev distances in kNN and acceleration of the t-SNE algorithm.
  • Acceleration for AI deployments with quantization and accuracy controls in the Intel® Neural Compressor, making great use of low-precision inferencing across supported Deep Learning Frameworks. 
  • Support of new PyTorch model inference and training workloads via Model Zoo for Intel® Architecture, extending support to include Python 3.9, TensorFlow v2.8.0, PyTorch v1.10.0, and IPEX v1.10.0.

Scientific Visualization with Rendering & Ray Tracing

FPGAs

  • Intel® FPGA Add-On for oneAPI Base Toolkit enables users to specify an exact, min, or max latency between read and write access on memories and pipes and provides the ability to implement arithmetic floating point operations involving a constant with either DSPs and ALMs or only ALMs.

Get the details >

intel arc series-a av1

GROMACS & oneAPI Aid in Open Source Drug Discovery

May 5, 2022 | oneAPI Spec, Intel® oneAPI Tools

GROMACS, accelerated by SYCL, oneAPI, and multiarchitecture tools, has strong performance on GPUs based on Intel Xe Architecture

The recent GROMACS 2022 release was extended to multi-vendor architectures, including current and upcoming GPUs based on Intel Xe Architecture.  

The team, led by Erik Lindahl from Stockholm University & Royal Institute of Technology, ported GROMACS’ CUDA code, which only runs on Nvidia hardware, to SYCL using the Intel® DPC++ Compatibility Tool; the tool typically automates 90%-95% of the code1,2. The result: A single, portable, cross-architecture-ready code base that significantly streamlines development and provides flexibility for deployment in multiarchitecture environments.

The software’s accelerated compute was made possible by using Intel oneAPI cross-architecture tools—oneAPI DPC++/C++ Compiler, oneAPI libraries, and HPC analysis and cluster tools.

With GROMACS 2022’s full support of SYCL and oneAPI, we extended GROMACS to run on new classes of hardware. We’re already running production simulations on current Intel Xe architecture-based GPUs as well as the upcoming Intel Xe architecture-based GPU development platform Ponte Vecchio via the Intel® DevCloud. Performance results at this stage are impressive – a testament to the power of Intel hardware and software working together. Overall, these optimizations enable diversity in hardware, provide high-end performance, and drive competition and innovation so that we can do science faster, and lower costs downstream.

— Erik Lindahl

Read the whole story >

Watch the video >

About GROMACS

GROMACS is a molecular dynamics package designed for simulations of protein, lipids, and nucleic acids. Its simulations contribute to the identification of crucial pharmaceutical solutions for conditions such as breast cancer, COVID-19, and Type 2 diabetes, and the international distributed-computing initiative Folding@home.

1The team ported GROMACS’ Nvidia CUDA code to  Data Parallel C++ (DPC++), which is a SYCL implementation for oneAPI, in order to create new cross-architecture-ready code. 

2Intel estimates as of September 2021. Based on measurements on a set of 70 HPC benchmarks and samples, with examples like Rodinia, SHOC, PENNANT. Results may vary. 

Cross-architecture Dev Tools Deliver Incredible End-User Experiences on New GPU Systems

March 31, 2022 | Intel® Software Tools, Intel® Graphics Performance Analyzers, Intel® oneAPI Video Processing Library

If you’re a content creator or game developer, new Intel® Evo™ laptops equipped with Intel Arc A-Series GPUs empower you to create immersive end-user experiences with innovation across technologies, software, and AI-accelerated processing.

And Intel® software tools are a big part of helping developers liberate Intel Arc graphics capabilities and optimize applications for maximum visual performance on the GPU combined with Intel CPUs. Using them, you can:
 

  • Analyze and optimize graphics bottlenecks. Use Intel® Graphics Performance Analyzers to profile graphics and game applications and ramp up profiling abilities with ray tracing, system-level profiling, and XSuper Sampling (XeSS) capabilities. Capture streams and traces, optimize shaders, and identify the most expensive events with support for multiple APIs (DX, Vulkan, OpenGL, OpenCL, etc.). Download
  • Accelerate compute-intensive tasks. Identify the most time-consuming parts of CPU and GPU code. Visualize thread behaviors to quickly find and fix concurrency problems using Intel® VTune™ Profiler. Download
  • Speed up media processing and cloud game streaming. Intel® oneAPI Video Processing Library (oneVPL) enables hardware AV1 encode and decode support, and Intel® Deep Link via Hyper Encode APIs, delivering up to 1.4x faster1 single stream transcoding when taking advantage of multiple Intel accelerators in a platform. For content creators already using Handbrake and DaVinci Resolve, oneVPL is integrated into the latest versions. Download
  • Integrate AI and machine learning. For game developers, the Intel® Game Dev AI Toolkit delivers a spectrum of AI-powered capabilities, from immersive world creation to real-time game-object-style transfer visualizations. Download

Learn more >

1. Up to 40% higher FPS in video encoding through an internal release of HandBrake on integrated Intel Xe graphics + discrete Intel Arc graphics compared to using Intel Arc graphics alone. Handbrake running on Alchemist pre-production silicon. As of October 2021.

Soda Announces Intel oneAPI Center of Excellence to Support Scikit-learn Performance across Architectures

March 31, 2022 | Intel® Extension for SciKit-learn*

The Social Data research team (Soda) at Inria, France’s national research institute for digital science and technology, is establishing an Intel oneAPI Center of Excellence to focus on developing hardware-optimized performance boosters for scikit-learn, one of the most widely used machine learning libraries.

This scikit-learn extension will deliver more efficient machine learning by using oneAPI numba_dppy or DPC++ components. Additionally, the implementation will be packaged in an independently-managed project possibly maintained by scikit-learn core developers, Intel engineers, and other interested community members.

Heterogenous computing is inevitable. It happens when a host schedules computational tasks to different processors and accelerators like CPUs and GPUs. This partnership will make scikit-learn more performant and energy-efficient on multi-architecture systems.

Olivier Grisel, scikit-learn maintainer, Inria

Learn more.

About Soda
The Social Data research team specializes in computational and statistical research in data science and machine learning—including scikit-learn optimizations—to harness large databases focused on health and social sciences.

Intel Compilers Available in VS Marketplace

March 10, 2022 | Intel® oneAPI DPC++/C++ Compiler

Now there are more ways to download multi-parallelism-supporting compilers. LLVM-based DPC++/C++/C compilers for Windows* can now be downloaded from the Visual Studio Marketplace.

Feature Highlights:

  • Include extensions that support productive development of fast, multicore, vectorized, and cluster-based applications.
  • Support the latest C/C++ language and OpenMP* standards.
  • Support multiple parallelism models and high-performance libraries including oneTBB, oneMKL, oneVPL, and Intel® IPP.
  • Can be used to build mixed-language applications with C++, Visual Basic, C#, and more.

Learn more and get the free download.
 

Intel Investing in Growth Opportunities Enabled by Software

February 22, 2022 | Intel® oneAPI Tools

 

At Intel’s 2022 Investor Meeting, product updates included next-generation Intel® Xeon® and client CPUs and Ponte Vecchio/Arctic Sound-M GPUs that will accelerate data center, AI, and other segment workloads, along with the software to make this all happen.

Intel’s Software-First strategy was noted in Executive Breakout sessions.
 

  • Greg Lavender, Sr. Vice President, CTO, and GM of Intel Software and Advanced Technology Group, discussed in an editorial and presentation how open, standards-based, cross-architecture programming through oneAPI and Intel® oneAPI Toolkits delivers performance and development productivity across  advanced architectures.
  • Raja Koduri, Sr. Vice President and GM of Intel Accelerated Computing Systems & Graphics Group, outlined the combined power of hardware and software fronting Intel’s Media and HPC-AI Super Compute Strategies. Highlights:
    • Intel® Xeon® processors and an open ecosystem, including oneAPI Video Processing Library, Intel® oneAPI AI Analytics Library, and OpenVINO™ toolkit, deliver high-density, real-time broadcast and premium content to meet global demands where 80% of Internet traffic is video.1
    • Upcoming Artic Sound-M GPU will deliver a seamless media supercomputer with leadership transcode performance that addresses quality, latency, and density requirements for desktop and cloud gaming, with an AI analytics engine. It will be the industry’s only open-source media solution stack for streaming, gaming, and analytics, and the industry’s first GPU with AV1 encode that delivers over 30% bandwidth improvement at the same quality.2
    • Billions of lines are code are optimized for Xeon, which powers 85% of super computers.3 This sets a strong, seamless ecosystem foundation for the fierce combo of Intel Xeon Sapphire Rapids + Ponte Vecchio GPU, where oneAPI unleashes developers to utilize a range of CPUs and accelerators using a single codebase.

Learn more
 

1Source Cisco Global 2021 Forecast Highlights
2Source: Mhojhos Research
3Based on TOP500 list over the past decade
 

Technical University of Darmstadt Establishes Intel oneAPI Center of Excellence

February 14, 2022 | Intel® oneAPI Tools

The Technical University of Darmstadt (TU Darmstadt) Embedded Systems and Applications Group announces establishing an Intel oneAPI Center of Excellence (CoE). The center’s objective is to accelerate data parallel computing and simulation software used in medical and pharmaceutical research powered by oneAPI open cross-architecture programming.

Together with Intel, the university will port an accelerated version of the Autodock application to create a single code base that can be efficiently optimized and tuned for multiple hardware architecture targets. 

Additionally, TU Darmstadt is working on a next-gen parallel implementation of Autodock-GPU, which aims to speed up drug-discovery simulations by parallel execution across CPUs, GPUs, and FPGAs.

“The new oneAPI Center of Excellence is an exciting step forward for the multiarchitecture SYCL language and oneAPI,” says Joe Curley, vice president and general manager of Intel Software Products and Ecosystem division. “This collaboration with TU-Darmstadt team provides a path for medical and pharmaceutical researchers to use AutoDOCK-GPU productively on the hardware of their choice.”

Learn more. English | German
 

For a Limited Time: Get Beta Access to New Intel® Technologies

September 27, 2022 | Intel® Developer Cloud

New technologies are a click away in the expanded Intel® Developer Cloud. 

As noted by Intel CEO Pat Gelsinger during his keynote at Intel® Innovation, a limited beta trial opportunity is now open to for the newly expanded Intel® Developer Cloud

Starting right now, approved developers and customers can get early access to Intel technologies—from a few months to a full year ahead of product availability—and try out, test, and evaluate them on Intel’s enhanced, cloud-based service platform. 

The beta trial includes new and upcoming Intel compute and accelerator platforms such as: 

  • 4th Gen Intel® Xeon® Scalable Processors (Sapphire Rapids)
  • Intel® Xeon 4th Gen® processor with high bandwidth memory (HBM) 
  • Intel® Data Center GPU codenamed Ponte Vecchio 
  • Intel® Data Center GPU Flex Series 
  • Habana® Gaudi®2 Deep Learning accelerators 

Registration and prequalification is required.  

Visit cloud.intel.com to get started. 

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.