Alan Gates describes the incubator project HCatalog, an integration tool that enables data interoperability for the Apache Hadoop* framework and external system users. This overview by an expert from the Apache Hadoop open-source community covers how HCatalog works as a table management layer for the Hadoop* framework, its ability to integrate with enterprise data management tools external to the Hadoop framework using a REST interface, statistical support, and next steps for this integration tool. Includes a description of the Apache Incubator* project process. Part of the Intel® IT Center’s Apache Hadoop Community Spotlight series. Also listen to the podcast of the interview.
How businesses can use its versatility and scalability to mine answers through object relationships.
Introducing an automation tool for rapidly preparing data for analysis so scientists can speed mining.
Apache HDFS* overview.
Apache Pig* overview.
Apache MapReduce overview.