Preview is not available for this file. Please download the file.
Description
The Amazon EMR platform (formerly Amazon Elastic MapReduce) allows organizations to simplify running big data frameworks on AWS instances. Choosing an instance type with more powerful processors can speed up data analysis and help your bottom line. Using the TPC-DS 2.4 benchmark, we measured the EMR performance of several Amazon Web Services (AWS) EC2 cloud instances. We found that both medium-sized and larger M5 instances enabled by 2nd Gen Intel Xeon Scalable processors sped up EMR data analysis compared to same-size M5a instances with AMD EPYC processors.
Based on these test results across instance sizes, organizations seeking to speed EMR workloads (which include Apache Spark 3.1.1 and Hadoop 3.2.1) for quicker data analysis could gain insights faster by selecting AWS M5 instances featuring 2nd Gen Intel Xeon Scalable processors.