Book Image

Hadoop Administration and Cluster Management [Video]

By : Gurmukh Singh
Book Image

Hadoop Administration and Cluster Management [Video]

By: Gurmukh Singh

Overview of this book

<p><span id="description" class="sugar_field">Hadoop is one of the most popular Big Data solutions for reliable and scalable distributed computing and storage. Administering your Hadoop cluster is the key to exploiting its rich features, and get the most out of it. This course focuses on planning, deploying and monitoring your cluster’s performance and looking at the optimal performance and health of this organizational cluster infrastructure. This course will help you understand the basics of Hadoop administration, with comprehensive coverage of various administrative tasks using the popular Apache Hadoop distribution.</span></p> <p>This video course will start by installing the Apache Hadoop for cluster installation and configuring the required services. You will also learn various cluster operations like validations, and expanding and shrinking Hadoop services.</p> <p><span id="description" class="sugar_field">You will then move onto gain a better understanding of administrative tasks like planning your cluster, monitoring, logging, security, troubleshooting and best practices. Techniques to keep your Hadoop clusters highly available and reliant are also covered in this course. By the end of this course, you will have a thorough understanding of the concepts related to Hadoop administration.</span></p> <h2><span class="sugar_field">Style and Approach</span></h2> <p><span class="sugar_field"><span id="trade_selling_points_c" class="sugar_field">This course will take a practical approach and cover solutions to real life problems that Hadoop administrators might encounter while administering a hadoop cluster. </span></span></p>
Table of Contents (10 chapters)
Chapter 3
Hadoop Cluster Installation
Content Locked
Section 1
Planning Hadoop Services Placement
The aim of this video is to study planning the layout of various Hadoop services to improve availability and performance. To have a balanced distribution of compute and memory across the cluster. - Distribute the services across nodes - Get to know how the failure of a node should not cause service disruption - Decide what all components you need