Book Image

Apache Cassandra Essentials

By : Nitin Padalia
Book Image

Apache Cassandra Essentials

By: Nitin Padalia

Overview of this book

Apache Cassandra Essentials takes you step-by-step from from the basics of installation to advanced installation options and database design techniques. It gives you all the information you need to effectively design a well distributed and high performance database. You’ll get to know about the steps that are performed by a Cassandra node when you execute a read/write query, which is essential to properly maintain of a Cassandra cluster and to debug any issues. Next, you’ll discover how to integrate a Cassandra driver in your applications and perform read/write operations. Finally, you’ll learn about the various tools provided by Cassandra for serviceability aspects such as logging, metrics, backup, and recovery.
Table of Contents (14 chapters)
Apache Cassandra Essentials
About the Author
About the Reviewers

Taking backup of a Casandra cluster

Cassandra takes backup of data in the form of snapshots of SSTables. While a node is online, we can take snapshots of data stored in data files of the Cassandra data directory. While taking snapshots, we can specify whether we want to take a snapshot of all data/keyspaces, a specific keyspace, or a specific column family. These snapshots can then be moved to another location for backup, or we can leave them at the default location. Snapshots are taken node wide, and all data is contained in the snapshot that is written before a snapshot is triggered. A node's snapshot may not be consistent with another replica node. However, when snapshots of all nodes are restored, data eventually becomes consistent.

Once we have taken snapshot of all nodes, we can configure Cassandra to take incremental snapshots. Incremental backup will start an automatic snapshot trigger whenever an SSTable is flushed.


A snapshot comprises only the data of the column family and doesn...