Book Image

Hadoop MapReduce v2 Cookbook - Second Edition: RAW

Book Image

Hadoop MapReduce v2 Cookbook - Second Edition: RAW

Overview of this book

Table of Contents (19 chapters)
Hadoop MapReduce v2 Cookbook Second Edition
Credits
About the Author
Acknowledgments
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 3. Hadoop Essentials – Configurations, Unit Tests, and Other APIs

In this chapter, we will cover:

  • Optimizing Hadoop YARN and MapReduce configurations for cluster deployments

  • Shared user Hadoop clusters – using Fair and Capacity schedulers

  • Setting classpath precedence to user-provided JARs

  • Speculative execution of straggling tasks

  • Unit testing Hadoop MapReduce applications using MRUnit

  • Integration testing Hadoop MapReduce applications using MiniYarnCluster

  • Adding a new DataNode

  • Decommissioning DataNodes

  • Using multiple disks/volumes and limiting HDFS disk usage

  • Setting the HDFS block size

  • Setting the file replication factor

  • Using the HDFS Java API