Do you want to unlock the power of private cloud big data analytics? This solution brief and reference architecture guide describes how a high-performance private cloud big data analytics solution from Cloudera and Intel can transform complex data into clear, actionable insights.
It is well understood that enterprises can extract business value from the large volume of data they generate. The difficulty lies in integrating isolated silos of data throughout the business and managing data efficiently. One common hurdle is competing business units that control access to the data but refuse to share it. Also, internal policies that were intended to protect the company and its customers can hamper data analysis and create a significant burden to extracting business value and improving decision making. The velocity of the data flood adds additional stress on business units, IT, and executive management, who must work through the complexities created in a data-driven world.
But don’t despair—a collaboration between Intel and Cloudera has created a big data analytics platform specifically designed for large-scale on-premises workloads. Cloudera Data Platform (CDP) Private Cloud powers on-premises, data-driven decision making by easily, quickly, and safely connecting and securing the business’s entire data lifecycle. This big data analytics platform helps business leaders modernize their data center by streamlining data management and workload orchestration. Separation of compute and storage leads to improved flexibility and efficiency. With CDP Private Cloud, enterprises can migrate to a container-based environment and take advantage of the agility and scalability of containers.
With the Intel® architecture underlying CDP Private Cloud, we provide the power that big data analytics demand. Intel and Cloudera collaborated to improve compute performance, storage efficiency, artificial intelligence (AI) acceleration, and more. The result is a private cloud data platform built to meet today’s big data analytics needs that can scale to meet your business needs today and into the future. Tests show that a modern version of CDP Private Cloud running on the latest Intel® hardware can improve the data analysis performance of several aspects of the CDP Private Cloud system. For example, upgrading to CDP Private Cloud from a legacy distribution, running on 2nd Generation Intel® Xeon® Scalable processors, can improve throughput by up to 2.23x. Using Intel® Optane™ persistent memory (PMem) for Apache Kudu block cache can provide up to a 6.3x increase in throughput and a 13.38x decrease in latency, compared to an all-DRAM-based configuration (depending on dataset size).
This document provides a business-level overview of CDP Private Cloud, describes a reference solution for deployment, and highlights the platform’s performance and scalability. And if you are already using a legacy distribution of a Cloudera product, this document also describes best practices for migrating to the latest distribution.