Intel® Optane™ Persistent Memory Decision Guide

Victoria Yashina

About the Decision Guide

This document provides guidance to determine if Intel® Optane™ persistent memory (PMem) in the memory mode can benefit the user workload. The document describes how to analyze telemetry and topology data of a workload on a DRAM-only system to identify if the system can benefit from the affordable cost and high capacity of PMem.

The guidance is largely agnostic of a tool or platform. It intends to present a growing list of use cases that are neither exclusive nor exhaustive. However, some metrics are only available through specific tools and platforms. They are indicated accordingly in Appendix B.

This article enumerates different situations under which PMem can be helpful for you. Each of these opportunities of benefiting from PMem are listed in this document as a scenario. Each scenario contains the high-level description, the potential value proposition and the steps to identify the scenario using the data from a DRAM-only workload run.

The use of this document requires some familiarity with PMem. You can get an overview of PMem technology on the Intel Optane Persistent Memory Page. The knowledge of persistent memory programming is not required to understand and apply the methodology defined in this document.

The efficiency of the analysis is not strictly governed solely by the metrics described in this document. Equivalent metrics that capture the same workload characteristics also can be used. For a comprehensive list of relevant metrics and their definitions, see Appendix A.

Evaluating PMem Benefits for Your Workload

PMem in the memory mode provides you with a cost-effective solution with a large byte-addressable main memory. Configured for the memory mode, an operating system and supported applications perceive a pool of volatile memory, not differing from the available memory on a DRAM-only system. The difference in the PMem Memory Mode is that PMem provides the main memory while the DRAM acts as a cache for the accessed frequently data.

Figure 1 illustrates the memory hierarchy of a DRAM-only system. Elements that are faster, costlier, and smaller are at the top of the pyramid (L1 Cache) and sit closer to the core. Lower levels provide larger sizes and are less expensive, but slower.

Figure 1 - Memory Hierarchy of a DRAM-only System — Figure 1 - Memory Hierarchy Of a DRAM-only System

PMem extends the memory hierarchy by adding another level below the DRAM (Figure 2). In the new hierarchy, PMem becomes the main addressable memory and the DRAM acts as a near memory cache for PMem.

Figure 2 - Memory Hierarchy With PMem In Memory Mode — Figure 2 - Memory Hierarchy with PMem in Memory Mode

PMem enables you to build systems with large memory less expensive than DRAM-only systems. However, PMem has a relatively higher latency and a lower bandwidth. Using the DRAM as a cache allows the system to compensate for these deficits.PMem modules are available in 128GB, 256GB and 512GB capacities.

The performance of a workload on PMem memory mode is dependent on the memory access characteristics of the workload. There are scenarios where PMem can improve performance or provide a good tradeoff between system performance and its total cost of ownership (TCO). Some of these scenarios are:

Workload performance is limited by the size of the addressable memory.
Workload scalability depends on the availability of additional addressable memory.
Workload performance is negligibly impacted when DRAM is replaced with less expensive PMem.

PMem Enabling Workflow

Use the following workflow to determine whether your workload can benefit from PMem in Memory Mode:

In the general workflow you need to perform the following steps:

1. Run a workload on your DRAM-only system.

2. Collect the metrics required for analysis.

3. Use the collected metrics to decide if the workload fits one or more of the scenarios where PMem in the memory mode can provide benefits.

NOTE: If the workload does not match any scenario, consult the Intel team.

4. Evaluate the performance characteristics of a system with PMem in the memory mode for each scenario that matches with the workload to determine the optimal system configuration.

NOTE: This guide focuses on steps 1-3 only. To complete step 4, contact the Intel team.

Caution: To determine optimal system configuration, you must characterize workloads for a target system in PMem in the memory mode. Without characterization, applications with consistent data retrieval patterns (that the memory controller can predict) are likely to have a higher cache hit rate as well as performance close to DRAM-only configurations. Similarly, workloads with highly random data access over a wide address range can show some performance difference versus the DRAM alone.

Structure of a Scenario

This guide describes typical workload scenarios that can benefit from PMem in the memory mode. Each scenario contains:

Structure of Scenario
Section	Description
Description	A description of the scenario and relevant workloads.
Potential Benefit	The potential benefit obtained by introducing PMem in memory mode in this scenario.
Requierd Metrics	Applicable performance metrics that can help evaluate whether a workload fits the scenario.
Evaluation	Instructions to evaluate the workload for the scenario.
Recommendations for Memory Mode Evaluation	High-level guidelines for an optimal PMem memory mode configuration.

Scenario Evaluation Guidelines

Each scenario has an evaluation table with each row presenting a criterion. Review the criterion in the table, mark Yes if your workload meets the criterion, or No otherwise.

Color Scale

Color Scale
Value in range	Evaluation	Result
Green	Satisfied criterion.	YES
Yellow	Partially satisfied criterion.	YES (Must complete memory mode evaluation to confirm)
Red	Does not satisfy the criterion.	NO

NOTE: All conditions should be met throughout the entire program execution. Pay attention to “spikes” that cross the recommended thresholds for each metrics. Spike in metric values that violate the recommended threshold may lead to corresponding spikes in workload performance drops. These workload performance drop spikes might lead to violation of SLA.

Scenario for Memory Capacity Bound Applications - Single Node

In this scenario, your workload performance is taking a hit due to the limited size of addressable memory. Modifying your memory configuration to get additional memory capacity has the potential to improve workload performance.

The application is using all the available memory while showing a high disk or paging activity. The application is not bound by memory bandwidth or CPU saturation. This is an indicator that the application is bound by the capacity of the memory. Consider using the Memory Mode to increase memory capacity.

This scenario is less common in datacenter workloads as server admins usually make sure that the application has access to enough addressable memory.

Potential Benefit

Increase the performance of a single node.

Required Metrics

System Configuration: Total Memory Size.
Memory utilization.
CPU utilization.
CPU utilization in Kernel Mode.
Major page faults.
Storage throughput/IOPS.
Maximum achievable DRAM bandwidth.
Actual DRAM throughput (read and write).
CPU I/O Wait (see Appendix B).

Evaluation

Memory Capacity Bound Analysis Steps
Criterion	Condition(s)	Result
Workload is using all available memory and excess memory paged to the disk	Total memory utilization is close to 100% in considerable parts of the workload execution. The workload is causing major page faults. A high CPU utilization in Kernel Mode (more than10%) or disk I/O activity in the same execution windows where the application is using 100% of available memory.	YES / NO
Memory paging negatively affects the CPU utilization and a memory throughput	Decreased or low CPU utilization during the execution windows where the application is using 100% of available memory and disk I/O is present. DRAM memory throughput is low compared to the maximum achievable DRAM bandwidth during the same execution windows.	YES / NO

If all steps are marked as Yes, your workload performance is bound by the capacity of available memory and will likely perform better if you increase the amount of available memory. Since the workload performance is not sensitive to the memory throughput, you can increase addressable memory using PMem Memory Mode to avoid swapping and achieve additional performance at an affordable cost.

If any of the above steps are marked as No, your workload performance is not strictly bound by the memory capacity. Therefore, increasing memory capacity may not necessarily improve performance. In this case, consider analyzing for scenarios 3 and 4.

Recommendations for the Memory Mode Evaluation

The memory mode system should be designed to satisfy the following criteria:

The size of PMem should be larger than or equal to the addressable memory requirements of the workload which means that swapping to disk should not happen.
The size of the near memory cache (DRAM) should be larger than the working set size of the workload.

Examples

Detecting Workloads Performance Bound by Memory Capacity
Criterion	Condition(s)	Result
Workload is using all available memory and excess memory paged to the disk	Total memory utilization is close to 100% in considerable parts of the workload execution: The workload is causing major page faults: A high CPU utilization in Kernel Mode (more than 10%) or disk I/O activity in the same execution windows where the application is using 100% of available memory:	YES
Memory paging negatively affects the CPU utilization and the memory throughput	Low CPU utilization during the execution windows where the application is using 100% of available memory and disk I/O is present: DRAM memory throughput is low compared to the maximum achievable DRAM bandwidth during the same execution windows:	YES

The workload where performance is bound by the memory capacity will likely benefit from the large and affordable memory capacity of PMem in the memory mode.

Workload Performance Is Not Bound by Memory Capacity
Criterion	Condition(s)	Result
Workload is using all available memory and excess memory paged to the disk	Total memory utilization is close to 100% in considerable parts of the workload execution: Workload is causing major page faults: A high CPU utilization in Kernel Mode (more than 10%) or disk I/O activity in the same execution windows where the application is using 100% of available memory:	YES
Memory paging negatively affects the CPU utilization and the memory throughput	CPU utilization is mostly 100% except in few small execution regions:	NO

The high CPU utilization implies that the workload performance is not affected by the swapping going on due to the memory capacity limitation.

Scenario For Memory Capacity Bound Distributed Applications

In this scenario, you have an application that needs to process a large amount of data. The workload is distrusted across multiple worker nodes in a cluster such that the amount of data to be processed by each worker node fits entirely within the physical memory of the node. The number of nodes required for the workload depends on the total data and the size of memory available per node. There is a potential to decrease the number of nodes required for the workload by increasing the amount of per-node memory using Pmem in the memory mode. The use case finds out if the given workload is suitable for decreasing the number of required nodes by increasing the size of the workload chunk assigned to each node.