Measuring Communication and Computation Overlap
Measuring Pure Communication Time
Iallgather
Iallgather_pure
Iallgatherv
Iallgatherv_pure
Iallreduce
Iallreduce_pure
Ialltoall
Ialltoall_pure
Ialltoallv
Ialltoallv_pure
Ibarrier
Ibarrier_pure
Ibcast
Ibcast_pure
Igather
Igather_pure
Igatherv
Igatherv_pure
Ireduce
Ireduce_pure
Ireduce_scatter
Ireduce_scatter_pure
Iscatter
Iscatter_pure
Iscatterv
Iscatterv_pure
Get_all_local
This benchmark tests the MPI_Get operation where one active process obtains data from all other processes. All target processes are waiting in the MPI_Barrier call, while the active process performs the transfers. The completion of the origin process is ensured by the MPI_Win_flush_local_all operation. Since local completion of the MPI_Get operation is semantically equivalent to a regular completion, the benchmark flow is very similar to the One_get_all benchmark.
NOTE:
This benchmark is not enabled in IMB-RMA by default. Specify the benchmark name in the command line or use the –include command-line parameter to run this benchmark.
Property |
Description |
---|---|
Measuredpattern |
(N*MPI_Get)/MPI_Win_flush_local_all, where N is the number of target processes |
MPI data type |
MPI_BYTE (origin and target) |
Reportedtimings |
Bare time |
Reportedthroughput |
MBps |