Book Image

Apache Cassandra Essentials

By : Nitin Padalia
Book Image

Apache Cassandra Essentials

By: Nitin Padalia

Overview of this book

Apache Cassandra Essentials takes you step-by-step from from the basics of installation to advanced installation options and database design techniques. It gives you all the information you need to effectively design a well distributed and high performance database. You’ll get to know about the steps that are performed by a Cassandra node when you execute a read/write query, which is essential to properly maintain of a Cassandra cluster and to debug any issues. Next, you’ll discover how to integrate a Cassandra driver in your applications and perform read/write operations. Finally, you’ll learn about the various tools provided by Cassandra for serviceability aspects such as logging, metrics, backup, and recovery.
Table of Contents (14 chapters)
Apache Cassandra Essentials
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Replication


Cassandra is able to automatically keep multiple copies of data on multiple nodes, hence providing no single point of failure. The number of copies it will create is configurable and determined by a configuration option called a replication_factor. While defining a keyspace, we mention its replication_factor function. This configuration option is applied to all column families of that keyspace. Let's suppose that column family users, which we discussed in the section Data distribution, are part of the keyspace called cassandrademodb; and replication_factor of the keyspace is 3, then every row of the column family users will have three replicas copied to three different nodes. All replicas are the same and there is no primary or secondary replica. First, replica placement is decided by the partitioner and subsequent replicas are placed on consecutives nodes in clockwise order. Cassandra tries not to have two replicas on the same rack while doing so for rack-aware strategies, using...