Book Image

Cassandra High Performance Cookbook

By : Edward Capriolo
Book Image

Cassandra High Performance Cookbook

By: Edward Capriolo

Overview of this book

<p>Apache Cassandra is a fault-tolerant, distributed data store which offers linear scalability allowing it to be a storage platform for large high volume websites. <br /><br />This book provides detailed recipes that describe how to use the features of Cassandra and improve its performance. Recipes cover topics ranging from setting up Cassandra for the first time to complex multiple data center installations. The recipe format presents the information in a concise actionable form.<br /><br />The book describes in detail how features of Cassandra can be tuned and what the possible effects of tuning can be. Recipes include how to access data stored in Cassandra and use third party tools to help you out. The book also describes how to monitor and do capacity planning to ensure it is performing at a high level. Towards the end, it takes you through the use of libraries and third party applications with Cassandra and Cassandra integration with Hadoop.</p>
Table of Contents (20 chapters)
Cassandra High Performance Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

About the Reviewers

Vineet Daniel is a Systems Architect and has worked at various startups and managed high traffic web applications. He has over eight years of experience in Software development, managing servers/cloud, and team. The experience has made him a learned individual in technologies like optimization, high-availability, and scalability. He loves to use Linux commands and has a never ending appetite for penetration testing. Find him on www.vineetdaniel.me, @vineetdaniel.

Matthew Tovbin received a BSc degree in computer science from the Hadassah Academic College, Jerusalem, Israel in 2005. He has been a software engineer in Intelligence Corps, Israel Defense Force (IDF), 2005-2008, working on a variety of military IT systems, and later a software engineer and a team leader in web-based startup named AnyClip, making a dream of "Find any moment from any film, instantly" to come true.

Currently Matthew is an MSc student in the Department of Computer Science, Hadassah Academic College and a software engineer of data infrastructure at Conduit.

His experience covers aspects of the architecture, design, and development of high performance distributed web and data analysis systems. His background includes a wide range of programming languages (including Java and C#), search engines (including Lucene), databases, and NoSQL distributed data stores.

Matthew's research interests include search engines, distributed computing, image processing, computer vision, and machine learning.

Jing Song has been working in the IT industry as an engineer for more than 12 years after she graduated school. She enjoys solving problems and learning about new technologies in computer science space. Her interests and experiences lie across multiple tiers, from web frontend GUI to middleware, from middleware to backend SQL RDBMS and NoSQL data storage. In the last five years, she has mainly focused on the enterprise application performance and cloud computing areas.

Jing currently works for Apple as a tech lead with the Enterprise Technology Service group, leading various Java applications from design, to implementation, to performance tuning. She was one of the contributors to the internal private cloud application last year. Her team has POC on most of the NoSQL candidates, for example, Cassandra, CouchDB, MongoDB, Redis, Voldeomort, MemCache, as well as EC2, EMC atmos, and so on.