Book Image

Learning HBase

By : Shashwat Shriparv
Book Image

Learning HBase

By: Shashwat Shriparv

Overview of this book

Table of Contents (18 chapters)
Learning HBase
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

HBase backup and restore techniques


Now, we will see the HBase backup and restore methods as it is very important for any technology to be able to restore and create a backup of the data to avoid data loss.

We will now discuss these methods in detail. There are two kinds of HBase methodologies in general. The following are HBase backup methods that we can choose according to our requirement and setup:

  • Offline backup / full-shutdown backup

    • Use the hadoop distcp command

  • Online backup

    • Snapshots

    • Replication

    • Export

    • CopyTable

    • HTable API

    • Offline backup of HDFS data

    • Backup using a Mozilla tool

    • HDFS replication

Let's get started with offline backup.

Offline backup / full-shutdown backup

This method includes full-shutdown backup of HBase on a file system, using the distcp command that runs the MapReduce task. It copies the parallel data from one location to another, which can be a backup location on the same cluster or another backup cluster. This method is not recommended on a live cluster or a cluster that needs...