Analyzing the Results

Intel® Trace Analyzer and Collector User and Reference Guide

Download PDF

ID 767272

Date 10/31/2024

Version 2021.10

Public

Document Table of Contents

Document Table of Contents x

Intel® Trace Analyzer and Collector User and Reference Guide

Intel® Trace Analyzer and Collector User and Reference Guide x

Introduction Install and Set Up Intel® Trace Analyzer and Collector Trace Your Applications Analyze Your Applications Intel® Trace Collector Reference Intel® Trace Analyzer Reference Notices and Disclaimers

Introduction x

Notational Conventions Get Help

Trace Your Applications x

Tracing Conventional MPI Applications Tracing Failing MPI Applications Tracing OpenSHMEM* Applications Tracing MPI File IO Handling of Communicator Names Tracing MPI Load Imbalance Tracing User Defined Events Configuring the Collector Filtering Trace Data Recording OpenMP* Regions Information Tracing System Calls (Linux* OS) Collecting Lightweight Statistics Recording Source Location Information Recording Hardware Performance Information (Linux* OS) Recording Operating System Counters Tracing Library Calls Correctness Checking Tracing Distributed Non-MPI Applications

Correctness Checking x

Correctness Checking of MPI Applications Running with Valgrind* (Linux* OS) Configuring Error Checks Analyzing the Results Debugger Integration

Debugger Integration x

TotalView* Debugger GNU* Symbolic Debugger Allinea* Distributed Debugging Tool* (DDT*)

Analyze Your Applications x

Starting Intel® Trace Analyzer Intel Trace Analyzer Graphical User Interface Navigating Timelines Concepts Viewing Correctness Checking Reports Comparing Two Trace Files Interoperability with Intel® VTune™ Profiler and Intel® Advisor OpenMP* Regions Display Support OTF2 Format Support

Navigating Timelines x

Zoom Stack

Concepts x

Level of Detail Aggregation Advanced Aggregation Tagging and Filtering

Viewing Correctness Checking Reports x

Event Timeline Correctness Checking Reports Qualitative Timeline Correctness Checking Reports Detailed Dialog

Comparing Two Trace Files x

Mappings in Comparison Views Comparison Charts

Mappings in Comparison Views x

Mapping of Processes Mapping of Functions

Comparison Charts x

Comparison Function Profile Comparison Message Profile Comparison Collective Operations Profile

Intel® Trace Collector Reference x

API Reference Configuration Reference Correctness Checking Errors Structured Tracefile Format stftool Utility Time Stamping Secure Loading of Dynamic Link Libraries* on Windows* OS

API Reference x

Initialization, Termination and Control Defining and Recording Source Locations Defining and Recording Functions or Regions Defining and Recording Scopes Defining Groups of Processes Defining and Recording Counters Recording Communication Events Additional API Calls in libVTcs C++ API

Initialization, Termination and Control x

VT_initialize VT_finalize VT_getrank VT_registerthread VT_registernamed VT_registerprefixed VT_getthrank VT_traceon VT_traceoff VT_tracestate VT_symstate VT_flush VT_timestamp VT_timestart VT_setfinalizecallback VT_getdescription VT_countsetcallback

Defining and Recording Functions or Regions x

New Interface Old Interface State Changes

C++ API x

VT_FuncDef Class Reference VT_SclDef Class Reference VT_Function Class Reference VT_Region Class Reference

Configuration Reference x

Configuration File Format Protocol File Configuration Options

Configuration Options x

ACTIVITY ALTSTACK AUTOFLUSH CHECK CHECK-LEAK-REPORT-SIZE CHECK-MAX-DATATYPES CHECK-MAX-ERRORS CHECK-MAX-PENDING CHECK-MAX-REPORTS CHECK-MAX-REQUESTS CHECK-SUPPRESSION-LIMIT CHECK-TIMEOUT CHECK-TRACING CLUSTER COMPRESS-RAW-DATA COUNTER CURRENT-DIR DEADLOCK-TIMEOUT DEADLOCK-WARNING DEMANGLE DETAILED-STATES ENTER-USERCODE ENVIRONMENT EXTENDED-VTF FLUSH-PID FLUSH-PREFIX GROUP HANDLE-SIGNALS INTERNAL-MPI KEEP-RAW-EVENTS LOGFILE-FORMAT LOGFILE-NAME LOGFILE-PREFIX LOGFILE-RANK MEM-BLOCKSIZE MEM-FLUSHBLOCKS MEM-INFO MEM-MAXBLOCKS MEM-MINBLOCKS MEM-OVERWRITE NMCMD OS-COUNTER-DELAY PCTRACE PCTRACE-CACHE PCTRACE-FAST PLUGIN PROCESS PROGNAME PROTOFILE-NAME STATISTICS STATE STF-PROCS-PER-FILE STF-USE-HW-STRUCTURE STOPFILE-NAME SYMBOL SYNC-MAX-DURATION SYNC-MAX-MESSAGES SYNC-PERIOD SYNCED-CLUSTER SYNCED-HOST TIME-WINDOWS (Experimental) TIMER TIMER-SKIP UNIFY-COUNTERS UNIFY-GROUPS UNIFY-SCLS UNIFY-SYMBOLS VERBOSE VT_START_PAUSED VT_COMPRESS_TRACE

Correctness Checking Errors x

Supported Errors How the Collection Works

How the Collection Works x

Parameter Checking Premature Exit Overlapping Memory Detecting Illegal Buffer Modifications Buffer Given to MPI Cannot Be Read or Written Distributed Memory Checking Illegal Memory Access Request Handling Datatype Handling Buffered Sends Deadlocks Checking Message Transmission Datatype Mismatches Data Modified during Transmission Checking Collective Operations Freeing Communicators

Structured Tracefile Format x

STF Components Single-File STF Configuring STF

stftool Utility x

stftool Utility Options Expanded ASCII output of STF Files

Time Stamping x

Clock Synchronization Choosing a Timer

Choosing a Timer x

gettimeofday/_ftime QueryPerformanceCounter CPU Cycle Counter Normalized CPU Cycle Counter MPI_Wtime() High Precision Event Timers POSIX* clock_gettime

Intel® Trace Analyzer Reference x

Graphical User Interface Reference Intel® Trace Analyzer Command Line Interface Reference Filter Expression Grammar otf2-to-stf Utility

Graphical User Interface Reference x

Welcome Page Summary Page Main Menu Bar View Menu Bar View Bars Charts Dialogs Settings

Main Menu Bar x

File Menu Options Menu Project Menu Windows Menu Help Menu

View Menu Bar x

View Charts Navigate Advanced Layout Comparison Menu

Advanced x

Tagging Specific Events Filtering Events Simulating Ideal Communication Checking Application Imbalance Aggregating Results Aggregating Functions Creating Command Line for Intel® VTune™ Profiler and Intel® Advisor

View Bars x

Toolbar Trace Map Status Bar

Charts x

Event Timeline Qualitative Timeline Quantitative Timeline Counter Timeline Function Profile Message Profile Collective Operations Profile Performance Assistant Common Chart Features

Event Timeline x

Context Menu Filtering and Tagging

Qualitative Timeline x

Context Menu Filtering and Tagging

Quantitative Timeline x

Context Menu Filtering and Tagging

Counter Timeline x

Context Menu Filtering and Tagging

Function Profile x

Flat Profile Load Balance Call Tree Call Graph Context Menu Filtering and Tagging Function Profile Settings

Message Profile x

Context Menu Filtering and Tagging Aggregation Message Profile Settings

Collective Operations Profile x

Context Menu Filtering and Tagging Collective Operations Profile Settings

Dialogs x

Process Aggregation Function Aggregation Function Group Color Editor Filtering Dialog Box Tagging Dialog Box Idealization Dialog Box Imbalance Diagram Dialog Box Trace Merge Dialog Box Details Dialog Box Source View Dialog Time Interval Selection Configuration Dialogs Find Dialog Box Command line for Intel® VTune™ Profiler and Intel® Advisor Dialog Box OTF2 to STF Conversion Dialog Box Configuration Assistant

Process Aggregation x

Comparison Mode

Function Aggregation x

Comparison Mode

Filtering Dialog Box x

Building Filter Expressions Using Graphical Interface Building Filter Expressions Manually Filter Expressions in Comparison Mode

Details Dialog Box x

Detailed Attributes of Function Events Detailed Attributes of Message Events Detailed Attributes of Collective Operation Events

Configuration Dialogs x

Load Configuration File Dialog Edit Configuration File Dialog

Settings x

Preferences Font Settings Number Formatting Settings

Preferences x

General Preferences Tracefile Preferences Event Timeline Settings Qualitative Timeline Settings Quantitative Timeline Settings Counter Timeline Settings

Notices and Disclaimers x

Appendix A Copyright and Licenses

Intel® Trace Analyzer and Collector User and Reference Guide

Introduction

Notational Conventions

Get Help

Install and Set Up Intel® Trace Analyzer and Collector

Trace Your Applications

Tracing Conventional MPI Applications

Tracing Failing MPI Applications

Tracing OpenSHMEM* Applications

Tracing MPI File IO

Handling of Communicator Names

Tracing MPI Load Imbalance

Tracing User Defined Events

Configuring the Collector

Filtering Trace Data

Recording OpenMP* Regions Information

Tracing System Calls (Linux* OS)

Collecting Lightweight Statistics

Recording Source Location Information

Recording Hardware Performance Information (Linux* OS)

Recording Operating System Counters

Tracing Library Calls

Correctness Checking

Correctness Checking of MPI Applications

Running with Valgrind* (Linux* OS)

Configuring Error Checks

Analyzing the Results

Debugger Integration

TotalView* Debugger

GNU* Symbolic Debugger

Allinea* Distributed Debugging Tool* (DDT*)

Tracing Distributed Non-MPI Applications

Analyze Your Applications

Starting Intel® Trace Analyzer

Intel Trace Analyzer Graphical User Interface

Navigating Timelines

Zoom Stack

Concepts

Level of Detail

Aggregation

Advanced Aggregation

Tagging and Filtering

Viewing Correctness Checking Reports

Event Timeline Correctness Checking Reports

Qualitative Timeline Correctness Checking Reports

Detailed Dialog

Comparing Two Trace Files

Mappings in Comparison Views

Mapping of Processes

Mapping of Functions

Comparison Charts

Comparison Function Profile

Comparison Message Profile

Comparison Collective Operations Profile

Interoperability with Intel® VTune™ Profiler and Intel® Advisor

OpenMP* Regions Display Support

OTF2 Format Support

Intel® Trace Collector Reference

API Reference

Initialization, Termination and Control

VT_initialize

VT_finalize

VT_getrank

VT_registerthread

VT_registernamed

VT_registerprefixed

VT_getthrank

VT_traceon

VT_traceoff

VT_tracestate

VT_symstate

VT_flush

VT_timestamp

VT_timestart

VT_setfinalizecallback

VT_getdescription

VT_countsetcallback

Defining and Recording Source Locations

Defining and Recording Functions or Regions

New Interface

Old Interface

State Changes

Defining and Recording Scopes

Defining Groups of Processes

Defining and Recording Counters

Recording Communication Events

Additional API Calls in libVTcs

C++ API

VT_FuncDef Class Reference

VT_SclDef Class Reference

VT_Function Class Reference

VT_Region Class Reference

Configuration Reference

Configuration File Format

Protocol File

Configuration Options

ACTIVITY

ALTSTACK

AUTOFLUSH

CHECK

CHECK-LEAK-REPORT-SIZE

CHECK-MAX-DATATYPES

CHECK-MAX-ERRORS

CHECK-MAX-PENDING

CHECK-MAX-REPORTS

CHECK-MAX-REQUESTS

CHECK-SUPPRESSION-LIMIT

CHECK-TIMEOUT

CHECK-TRACING

CLUSTER

COMPRESS-RAW-DATA

COUNTER

CURRENT-DIR

DEADLOCK-TIMEOUT

DEADLOCK-WARNING

DEMANGLE

DETAILED-STATES

ENTER-USERCODE

ENVIRONMENT

EXTENDED-VTF

FLUSH-PID

FLUSH-PREFIX

GROUP

HANDLE-SIGNALS

INTERNAL-MPI

KEEP-RAW-EVENTS

LOGFILE-FORMAT

LOGFILE-NAME

LOGFILE-PREFIX

LOGFILE-RANK

MEM-BLOCKSIZE

MEM-FLUSHBLOCKS

MEM-INFO

MEM-MAXBLOCKS

MEM-MINBLOCKS

MEM-OVERWRITE

NMCMD

OS-COUNTER-DELAY

PCTRACE

PCTRACE-CACHE

PCTRACE-FAST

PLUGIN

PROCESS

PROGNAME

PROTOFILE-NAME

STATISTICS

STATE

STF-PROCS-PER-FILE

STF-USE-HW-STRUCTURE

STOPFILE-NAME

SYMBOL

SYNC-MAX-DURATION

SYNC-MAX-MESSAGES

SYNC-PERIOD

SYNCED-CLUSTER

SYNCED-HOST

TIME-WINDOWS (Experimental)

TIMER

TIMER-SKIP

UNIFY-COUNTERS

UNIFY-GROUPS

UNIFY-SCLS

UNIFY-SYMBOLS

VERBOSE

VT_START_PAUSED

VT_COMPRESS_TRACE

Correctness Checking Errors

Supported Errors

How the Collection Works

Parameter Checking

Premature Exit

Overlapping Memory

Detecting Illegal Buffer Modifications

Buffer Given to MPI Cannot Be Read or Written

Distributed Memory Checking

Illegal Memory Access

Request Handling

Datatype Handling

Buffered Sends

Deadlocks

Checking Message Transmission

Datatype Mismatches

Data Modified during Transmission

Checking Collective Operations

Freeing Communicators

Structured Tracefile Format

STF Components

Single-File STF

Configuring STF

stftool Utility

stftool Utility Options

Expanded ASCII output of STF Files

Time Stamping

Clock Synchronization

Choosing a Timer

gettimeofday/_ftime

QueryPerformanceCounter

CPU Cycle Counter

Normalized CPU Cycle Counter

MPI_Wtime()

High Precision Event Timers

POSIX* clock_gettime

Secure Loading of Dynamic Link Libraries* on Windows* OS

Intel® Trace Analyzer Reference

Graphical User Interface Reference

Welcome Page

Summary Page

Main Menu Bar

File Menu

Options Menu

Project Menu

Windows Menu

Help Menu

View Menu Bar

View

Charts

Navigate

Advanced

Tagging Specific Events

Filtering Events

Simulating Ideal Communication

Checking Application Imbalance

Aggregating Results

Aggregating Functions

Creating Command Line for Intel® VTune™ Profiler and Intel® Advisor

Layout

Comparison Menu

View Bars

Toolbar

Trace Map

Status Bar

Charts

Event Timeline

Context Menu

Filtering and Tagging

Qualitative Timeline

Context Menu

Filtering and Tagging

Quantitative Timeline

Context Menu

Filtering and Tagging

Counter Timeline

Context Menu

Filtering and Tagging

Function Profile

Flat Profile

Load Balance

Call Tree

Call Graph

Context Menu

Filtering and Tagging

Function Profile Settings

Message Profile

Context Menu

Filtering and Tagging

Aggregation

Message Profile Settings

Collective Operations Profile

Context Menu

Filtering and Tagging

Collective Operations Profile Settings

Performance Assistant

Common Chart Features

Dialogs

Process Aggregation

Comparison Mode

Function Aggregation

Comparison Mode

Function Group Color Editor

Filtering Dialog Box

Building Filter Expressions Using Graphical Interface

Building Filter Expressions Manually

Filter Expressions in Comparison Mode

Tagging Dialog Box

Idealization Dialog Box

Imbalance Diagram Dialog Box

Trace Merge Dialog Box

Details Dialog Box

Detailed Attributes of Function Events

Detailed Attributes of Message Events

Detailed Attributes of Collective Operation Events

Source View Dialog

Time Interval Selection

Configuration Dialogs

Load Configuration File Dialog

Edit Configuration File Dialog

Find Dialog Box

Command line for Intel® VTune™ Profiler and Intel® Advisor Dialog Box

OTF2 to STF Conversion Dialog Box

Configuration Assistant

Settings

Preferences

General Preferences

Tracefile Preferences

Event Timeline Settings

Qualitative Timeline Settings

Quantitative Timeline Settings

Counter Timeline Settings

Font Settings

Number Formatting Settings

Intel® Trace Analyzer Command Line Interface Reference

Filter Expression Grammar

otf2-to-stf Utility

Notices and Disclaimers

Appendix A Copyright and Licenses

Analyzing the Results

For interactive debugging, you should start the application so that stderr is printed to a console window. Then you can follow which errors are found while the application is running and start analyzing them without having to wait for it to complete. If critical errors are found early, you can abort the run, fix the problem and restart. This ensures a much faster code and test cycle than a post-mortem analysis.

The output for each error varies, depending on the error: only the relevant information is printed, thus avoiding the need to manually skip over irrelevant information. In general, Intel® Trace Collector starts with the error name and then continues with a description of the failure.

For each MPI call involved in the error the MPI parameters are dumped. If PC tracing is enabled (see PCTRACE), Intel Trace Collector also provides a backtrace of source code locations for each call. For entities like requests, the involved calls include the places where a request was created or activated. This helps to track down errors where the problem is not at the place where it is detected.

Because multiple processes might print errors concurrently, each line is prefixed with a tag that includes the rank of the process in MPI_COMM_WORLD which reports the problem. MPI applications which use process spawning or attachment are not supported, therefore that rank is unique.

When the application terminates, Intel Trace Collector does further error checks (for example, unfree resources, pending messages).

Notes:

If any process is killed without giving it a chance to clean up (that is, by sending it a SIGKILL), this final step is not possible.
Sending a SIGINT to mpiexec through kill or pressing CTRL-C will cause Intel MPI Library to abort all processes with such a hard SIGKILL.

Parent topic: Correctness Checking

Level Two Title

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Intel® Trace Analyzer and Collector User and Reference Guide

Analyzing the Results