Profile Workload Performance on High-Bandwidth Memory (HBM) CPUs
Profile Workload Performance on High-Bandwidth Memory (HBM) CPUs
Subscribe Now
Stay in the know on all things CODE. Updates are delivered to your inbox.
Overview
The 4th generation Intel® Xeon® CPU Max Series with HBM is designed to maximize bandwidth and optimize performance for workloads such as modeling, AI, deep learning, high-performance computing, and data analytics.
This session discusses how to profile those types of workloads on the new Intel Xeon CPU Max Series with HBM using Intel® VTune™ Profiler, including:
- How performance is affected by HBM use and how it is distributed between DRAM and HBM.
- A review of the different memory modes available with HBM and how to use Intel VTune Profiler to identify which offers the best performance: HBM only, HBM flat, or HBM cache.
- A demonstration of how to collect HBM-specific performance metrics to better understand a workload’s HBM memory use.
The session includes a walk-through of the Intel® Developer Cloud.
Skill level: Intermediate
Featured Software
Get the stand-alone version of Intel VTune Profiler or as part of the Intel® oneAPI Base Toolkit.
Download Code Samples
Find and fix performance bottlenecks and optimize application and system performance and system configuration for HPC, cloud, IoT, media, storage, and more.
Develop high-performance, data-centric applications for CPUs, GPUs, and FPGAs with this core set of tools, libraries, and frameworks including LLVM*-based compilers.
You May Also Like
Related Articles & Blogs