Book Image

Learning HBase

By : Shashwat Shriparv
Book Image

Learning HBase

By: Shashwat Shriparv

Overview of this book

Table of Contents (18 chapters)
Learning HBase
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

MapReduce and HBase


HBase supports writing MapReduce jobs for processing data from the HBase table using the org.apache.hadoop.hbase.mapreduce package, which has lots of methods for the same. This also provides HBase MapReduce input and output formats that can be utilized in MapReduce jobs, a table indexing MapReduce job, and many other MapReduce utilities. It utilizes Hadoop MapReduce framework to do so.

The following is a list of MapReduce classes provided by HBase:

  • Import: This utility is used to import sequence file from HDFS, which is exported by the export command.

  • ImportTsv: This utility is used to import the Tab-separated Value (TSV) file using the MapReduce task.

  • CellCounter: This counts the number of cells in the HBase table using the MapReduce job.

  • CopyTable: This is used to copy table from one HBase cluster to another HBase cluster. The destination can be the same cluster or another cluster.

  • Driver: This is the Driver class for MapReduce jobs in HBase.

  • Export: This exports or...