Book Image

Cassandra High Performance Cookbook

By : Edward Capriolo
Book Image

Cassandra High Performance Cookbook

By: Edward Capriolo

Overview of this book

<p>Apache Cassandra is a fault-tolerant, distributed data store which offers linear scalability allowing it to be a storage platform for large high volume websites. <br /><br />This book provides detailed recipes that describe how to use the features of Cassandra and improve its performance. Recipes cover topics ranging from setting up Cassandra for the first time to complex multiple data center installations. The recipe format presents the information in a concise actionable form.<br /><br />The book describes in detail how features of Cassandra can be tuned and what the possible effects of tuning can be. Recipes include how to access data stored in Cassandra and use third party tools to help you out. The book also describes how to monitor and do capacity planning to ensure it is performing at a high level. Towards the end, it takes you through the use of libraries and third party applications with Cassandra and Cassandra integration with Hadoop.</p>
Table of Contents (20 chapters)
Cassandra High Performance Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Choosing an operating system and distribution


The operating system affects software performance greatly. There are some practical points that should affect your decision. This recipe shows the important topics to consider before choosing.

How to do it...

Find an operating system and distribution that has:

  • A supported Java Virtual Machine

  • Java native architecture support

  • File system hard links

  • Package support for Cassandra

  • A large community of users and developers

How it works...

Cassandra runs on Java. Oracle's JVM supports Linux, Solaris, and Windows. Other JVM implementations and ports exist for other operating systems. However, the licensing and maturity of these other JVMs vary.

The Java Native Architecture is a component that allows an application to directly interact with system libraries. Several features of Cassandra use this to avoid using swap, create snapshot files, and optimize performance.

Cassandra has support for RPM and DEB package formats. These packages make it easy to install and...