Book Image

Building Hadoop Clusters [Video]

By : Sean Mikha
Book Image

Building Hadoop Clusters [Video]

By: Sean Mikha

Overview of this book

<p>Hadoop is an Apache top-level project that allows the distributed processing of large data sets across clusters of computers using simple programming models. It allows you to deliver a highly available service on top of a cluster of computers, each of which may be prone to failures. While Big Data and Hadoop have seen a massive surge in popularity over the last few years, many companies still struggle with trying to set up their own computing clusters.<br /><br />This video series will turn you from a faltering first-timer into a Hadoop pro through clear, concise descriptions that are easy to follow.<br /><br />We'll begin this course with an overview of Amazon's cloud service and its use. We'll then deploy Linux compute instances and you'll see how to connect your client machine to Linux hosts and configure your systems to run Hadoop. Finally, you'll install Hadoop, download data, and examine how to run a query.<br /><br />This video series will go beyond just Hadoop; it will cover everything you need to get your own clusters up and running. You will learn how to make network configuration changes as well as modify Linux services. After you've installed Hadoop, we'll then go over installing HUE—Hadoop's UI. Using HUE, you will learn how to download data to your Hadoop clusters, move it to HDFS, and finally query that data with Hive.<br /><br />Learn everything you need to deploy Hadoop clusters to the Cloud through these videos. You'll grasp all you need to know about handling large data sets over multiple nodes.</p> <h1>Style and Approach</h1> <p>Packt video courses are designed to cover the breadth of the topic in short, hands-on, task-based videos. Each course is divided into short manageable sections, so you can watch the whole thing or jump to the bit you need. The focus is on practical instructions and screencasts showing you how to get the job done.</p> <p>Packed with explanations for everything you'll need to set up, including simple systematic examples that will get you started with ease</p>
Table of Contents (8 chapters)
Chapter 4
Setting Up Network Connectivity and Access for Hadoop Clusters
Content Locked
Section 3
Gathering Network Details and Setting Up the HOSTS File
To install Hadoop properly, we will need to configure the network details on each node. - Determining how to obtain IP and FQDN information - Gathering details for each node in the cluster - Setting up the HOSTS file on each node in the cluster