If you select the thread group representing a single node to concentrate on
intra-node effects, then the analysis becomes slower than using the thread group
alone. Why does it happen? First of all, Intel Trace
Analyzer does not have to do any aggregation for the
group because it is flat (assuming no threads are used). The second is, despite
the fact that only a single SMP node is chosen, all other threads go through the
analysis and are thrown into the artificially created thread group
Advanced > Show Process Group 'Other'
to make this group
visible. To speed things up, choose a filter that only lets the threads of the
selected SMP node pass.