TAU
The TAU Performance System(R) is a parallel performance evaluation toolkit that supports profiling and tracing. It supports automatic instrumentation of applications written in C++, C, Fortran, and Python with support for runtimes including Intel(R) OneAPI, MPI, OpenSHMEM, pthread, OpenMP, Kokkos on Intel(R) platforms.
Application Features
Find out more about the Intel® oneAPI Base Toolkit
https://software.intel.com/content/www/us/en/develop/tools/oneapi/base-toolkit.html
The TAU Performance System
University of Oregon- TAU Performance System® is a portable profiling and tracing toolkit for performance analysis of parallel programs written in Fortran, C, C++, UPC, Java, Python.
TAU (Tuning and Analysis Utilities) is capable of gathering performance information through instrumentation of functions, methods, basic blocks, and statements as well as event-based sampling. All C++ language features are supported including templates and namespaces. The API also provides selection of profiling groups for organizing and controlling instrumentation. The instrumentation can be inserted in the source code using an automatic instrumentor tool based on the Program Database Toolkit (PDT), dynamically using DyninstAPI, at runtime in the Java Virtual Machine, or manually using the instrumentation API.