Alan Gates describes how the Apache Pig* high-level data flow programming language and execution framework makes it easy to create Apache MapReduce* applications. This overview by an expert from the Apache Hadoop* open-source community covers how the Pig platform fits into the Apache Hadoop framework; the value of Pig Latin, an easy-to-learn programming language that focuses on data flow; the difference between the Pig platform and the Apache Hive* data warehouse infrastructure; Pig limitations; and where development of the platform is headed. Part of the Intel® IT Center’s Apache Hadoop Community Spotlight series. Also listen to the podcast of the interview.
How businesses can use its versatility and scalability to mine answers through object relationships.
Introducing an automation tool for rapidly preparing data for analysis so scientists can speed mining.
Apache HDFS* overview.
Apache Pig* overview.
Linda Feldt highlights big data research—video