Now, we will see the HBase backup and restore methods as it is very important for any technology to be able to restore and create a backup of the data to avoid data loss.
We will now discuss these methods in detail. There are two kinds of HBase methodologies in general. The following are HBase backup methods that we can choose according to our requirement and setup:
Offline backup / full-shutdown backup
Use the
hadoop distcp
command
Online backup
Snapshots
Replication
Export
CopyTable
HTable API
Offline backup of HDFS data
Backup using a Mozilla tool
HDFS replication
Let's get started with offline backup.
This method includes full-shutdown backup of HBase on a file system, using the distcp
command that runs the MapReduce task. It copies the parallel data from one location to another, which can be a backup location on the same cluster or another backup cluster. This method is not recommended on a live cluster or a cluster that needs...