Low Overhead
Accurately identify where the program spends time. Sampling is system wide with negligible overhead (typically less than 5 percent).
Find the Critical Path
Determine function calling sequences and find the critical path using Call Graph.
No Recompile Required
Unlike traditional instrumented profilers that make you recompile or modify your build script, just use your production executables.
Compatibility
VTune™ Performance Analyzer supports the latest Intel® processors (Intel® 64 architecture-based processors, Intel® Itanium® processors, multi-core processors...) and a wide variety of Linux* distributions.
Programming Language and Compiler Independent
VTune analyzer supports all compilers that follow industry standards (ELF, STABS, DWARF).
Mixed Java* and Native Code
Unlike Java*-only analyzers, VTune analyzer tunes mixed Java and native code1.
Minimal Memory Footprint
Remote profiling minimizes the performance impact on the target system by running the user interface on a separate system.
Command Line Capability
Automate batch operations.
Large Applications Welcome
VTune analyzer is a robust solution even with large executables2. If you have a large application with hundreds of thousands of functions, bring it to VTune analyzer.
Listen to the Compiler's Advice
An optimizing compiler can do a lot better with just a few tips from you. We've integrated the Intel®Compilers with VTune analyzer to make this easy and very effective.
"The Intel VTune Performance Analyzer took a multi-day task and turned it into a sub-day task."
"The enhanced command-line interface and the new automatable data export capabilities in VTune analyzer simplified integration into our daily development activities."
Nick Monyatovsky ABAQUS Inc.
"VTune Analyzer for Linux is very cool. Getting useful data was quick and easy. I applaud you on this. I've never seen a tool that is easier to get the basic information you want to know."
Bradley Sergeant SergArt Software Software Architect after the first hour using VTune™ Analyzer
Large Enterprise and HPC Systems:
Minimize Traffic in Non-uniform Memory Architecture (NuMA) Systems by storing sampling data in local CPU memory. This is critical to avoid saturating the interconnect fabric and slowing the system under test.
Designed for High Performance Computing
Large High Performance Computing (HPC) systems have unique requirements supported by VTune analyzer.
Multiple users can share a large system for simultaneous Call Graph performance analyses.
Sampling is supported on systems with 128 or more3 processors using local buffering per CPU for minimum inter-node contention. To limit the amount of data collected we recommend selecting a maximum of 64 CPUs for simultaneous data collection.
"Our Applications Analysts have leveraged VTune analyzer's code-level capabilities to isolate and correct application performance bottlenecks in several major ISV HPC applications. We regularly use VTune for tuning and optimizing applications."
Donny Cooper Senior Systems Analyst NEC Solutions (America) Inc. Advanced Technical Computing Center
Isolate problems like poor pre-fetch and poor memory alignment. Sometimes just choosing an event is not selective enough, because the event can occur both at critical and non-critical times. On Intel Itanium architecture, instruction filtering allows you to collect events only when they occur with a specified op-code.
Minimize Data Collection with CPU Selection
Collect only the data you need. CPU selection lets you control exactly where data is collected, from all the processors, only those in your allocation or only the processors you specify. This greatly reduces the amount of data you need to collect.
2 Large applications are welcome! For example, the source distribution tree of one large application including the tools and predefined libraries required to do a build (but not the build itself) is about 1.85 GB with over 62,700 files. The execution tree alone is about 870 MB with over 8,200 files.
3 Due to the unique requirements for supporting large systems, if the software will be used on systems with more than 128 cores please contact us before purchase to make special arrangements.
Intel® Software Network
It’s free and easy to become a member, so join today!