Hadoop Migration Success Story: How Intel IT Moved to Cloudera
Intel IT sees value in open source–based, big data processing using Apache Hadoop* software. Until recently, we used the Intel® Distribution for Apache Hadoop Software (IDH) to support our original three business intelligence (BI) big data use cases, and it delivered results worth millions of dollars to ...Intel.
From our experience with Intel's first internal big data platform with Apache Hadoop software, Intel IT identified new opportunities to reduce IT costs and extend our BI capabilities. Intel and Cloudera formed a strategic partnership. Intel IT adopted the Cloudera Distribution for Hadoop (CDH) and led the conversion from IDH to CDH. By converting to CDH quickly, the migration team showcased the ease of conversion while taking advantage of Cloudera’s enterprise-grade tools to improve performance, management, and ease of use for the key Hadoop components. Unique features and continuous development relieved Intel of the ongoing burden of maintaining and extending the software. CDH also has a wider customer base.
Intel IT formed a team to plan and manage the Hadoop migration. This paper explains the following six best practices and the benefits to be gained by any business IT group performing its own Hadoop migration:
• Do comparative evaluation in a sandbox environment.
• Define the implementation strategy.
• Upgrade the Hadoop version.
• Split the hardware environment.
• Create a preproduction-to-production pipeline.
• Rebalance the data.