HBase supports writing MapReduce jobs for processing data from the HBase table using the org.apache.hadoop.hbase.mapreduce
package, which has lots of methods for the same. This also provides HBase MapReduce input and output formats that can be utilized in MapReduce jobs, a table indexing MapReduce job, and many other MapReduce utilities. It utilizes Hadoop MapReduce framework to do so.
The following is a list of MapReduce classes provided by HBase:
Import
: This utility is used to import sequence file from HDFS, which is exported by theexport
command.ImportTsv
: This utility is used to import the Tab-separated Value (TSV) file using the MapReduce task.CellCounter
: This counts the number of cells in the HBase table using the MapReduce job.CopyTable
: This is used to copy table from one HBase cluster to another HBase cluster. The destination can be the same cluster or another cluster.Driver
: This is theDriver
class for MapReduce jobs in HBase.