Monitoring Hadoop

Book Image

Monitoring Hadoop

By : Aman Singh

Book Image

Monitoring Hadoop

By: Aman Singh

Overview of this book

Monitoring Hadoop

Monitoring Hadoop

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Introduction to Monitoring

Introduction to Monitoring

The need for monitoring

The monitoring tools available in the market

Hadoop Daemons and Services

Hadoop Daemons and Services

Hadoop Logging

The need for logging events

Logging in Hadoop

HDFS Checks

Nagios master configuration

The Nagios client configuration

MapReduce Checks

MapReduce Checks

MapReduce overview

MapReduce control commands

MapReduce health checks

Nagios master configuration

Nagios client configuration

Hadoop Metrics and Visualization Using Ganglia

Hadoop Metrics and Visualization Using Ganglia

Metrics contexts

Metrics system design

Metrics configuration

Configuring Metrics2

Exploring the metrics contexts

Hadoop Ganglia integration

Hadoop configuration

Hive, HBase, and Monitoring Best Practices

Hive, HBase, and Monitoring Best Practices

Hive monitoring

HBase Nagios monitoring

Monitoring best practices

The Filter class

Nagios and Ganglia best practices

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Chapter 5. MapReduce Checks

The Hadoop cluster might have many jobs running on it at any given time, making it extremely important to monitor and make sure that it is running perfectly. The Hadoop clusters are multi-tenant clusters, which mean that multiple users with different use cases and data sizes run jobs on it. How do we make sure that each user or job is getting what it is configured for on the cluster?

In this chapter, we will look at the checks related to MapReduce and its related components. The following topics will be covered in this chapter:

MapReduce checks
JobTracker and related health checks
CPU utilization of MapReduce jobs
Memory utilization of MapReduce jobs
YARN component checks
Total cluster capacity in terms of memory and CPU