Book Image

Cloudera Administration Handbook

By : Menon
Book Image

Cloudera Administration Handbook

By: Menon

Overview of this book

An easy-to-follow Apache Hadoop administrator’s guide filled with practical screenshots and explanations for each step and configuration. This book is great for administrators interested in setting up and managing a large Hadoop cluster. If you are an administrator, or want to be an administrator, and you are ready to build and maintain a production-level cluster running CDH5, then this book is for you.
Table of Contents (11 chapters)
10
Index

Installing CDH

With a good background knowledge of CDH and its components, let's go ahead and install CDH on a cluster. The remainder of this chapter is going to be hands-on, and we will try to cover everything that is needed to get a fully functional cluster that runs CDH5.

CDH can be installed using one of the following two methods:

  • Installation using the operating system's package manager (yum/rpm)
  • Installation using Cloudera Manager

In this section, we will cover installation using the operating system's package manager. Installation of CDH using Cloudera Manager will be covered in Chapter 5, Using Cloudera Manager.

For the installation, we will be using four servers that run CentOS 6.4 (64-bit) as the operating system.

The following diagram shows a simple four-node Hadoop cluster that runs MapReduce Version 1 (MRv1):

Installing CDH

By performing the following installation instructions, we will try to configure our cluster to look like the one shown in the preceding diagram:

  1. The first step...