Book Image

Learning HBase

By : Shashwat Shriparv
Book Image

Learning HBase

By: Shashwat Shriparv

Overview of this book

Table of Contents (18 chapters)
Learning HBase
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

Hadoop optimization


If we have an optimized Hadoop cluster, a lot of problems are easily solved for other Hadoop ecosystem components on the cluster, for example HBase. So, now let's see some of the factors using which we can optimize Hadoop.

General optimization tips

These are some general optimization tips that will help us to optimize Hadoop:

  • Create a dedicated Hadoop/HBase user to run the daemons.

  • Try to use SSD for the NameNode metadata.

  • The NameNode metadata must be backed up periodically; it can be either on an hourly or daily basis. If it contains very valuable data, it must be backed up every 5 to 10 minutes. We can write crons to copy with the metadata directory.

  • We must have multiple metadata directories for NameNode, which can be specified using the parameters dfs.name.dir or dfs.namenode.name.dir; one can be located at local disk and another at network mount location (NFS). This provides redundancy of metadata, and robustness, in case of failure.

  • Set dfs.namenode.name.dir.restore to...