Book Image

Cassandra High Performance Cookbook

By : Edward Capriolo
Book Image

Cassandra High Performance Cookbook

By: Edward Capriolo

Overview of this book

<p>Apache Cassandra is a fault-tolerant, distributed data store which offers linear scalability allowing it to be a storage platform for large high volume websites. <br /><br />This book provides detailed recipes that describe how to use the features of Cassandra and improve its performance. Recipes cover topics ranging from setting up Cassandra for the first time to complex multiple data center installations. The recipe format presents the information in a concise actionable form.<br /><br />The book describes in detail how features of Cassandra can be tuned and what the possible effects of tuning can be. Recipes include how to access data stored in Cassandra and use third party tools to help you out. The book also describes how to monitor and do capacity planning to ensure it is performing at a high level. Towards the end, it takes you through the use of libraries and third party applications with Cassandra and Cassandra integration with Hadoop.</p>
Table of Contents (20 chapters)
Cassandra High Performance Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Nodetool Repair: When to use anti-entropy repair


Anti-Entropy Repair, also called Anti-Entropy Service or AES, is a process where nodes compare their data and ensure data is replicated properly and up-to-date. This recipe explains how to do an anti-entropy repair and the conditions for which it should be ran.

How to do it...

Use nodetool repair against a node to be repaired.

$<cassandra_home>/bin/nodetool -h 127.0.0.1 -p 8001 repair

How it works...

Anti-entropy repair is intensive for disk, CPU, and network resources. It is optimal to run this at times of low traffic. It can create excess copies of data on your nodes. If the storage on nodes grows significantly as a result of AES, use nodetool compact. Major compaction should remove duplicate data that resulted from the repair.

There's more...

Anti-entropy-repair should be run on a schedule that matches equal to or lower than the setting for gc_grace_seconds. There are also other situations when this operation should be run.

Raising the...