As we have seen in the previous section, Hadoop is open stack community distribution with an integration of multiple components or interface layers. Many commercial vendors have built on the basic Hadoop platform and customized it to offer in the market, both as a hardware product platform and a service. We will discuss a few of the popular options as follows:
- Cloudera CDH Hadoop distribution
- Hortonworks Data Platform (HDP)
- MapR Hadoop distribution
- Amazon Elastic MapReduce
- IBM open platform
- Microsoft Azure's HDInsight--cloud-based Hadoop distribution
- Pivotal big data suite
The standard framework of Hadoop open source, with different layers and components, is presented as follows:
The Cloudera proprietary distribution framework is built on the Hadoop open source code, customizing the services as shown in the following topics. We will discuss the various components that make it a leading enterprise product.