Book Image

Apache Cassandra Essentials

By : Nitin Padalia
Book Image

Apache Cassandra Essentials

By: Nitin Padalia

Overview of this book

Apache Cassandra Essentials takes you step-by-step from from the basics of installation to advanced installation options and database design techniques. It gives you all the information you need to effectively design a well distributed and high performance database. You’ll get to know about the steps that are performed by a Cassandra node when you execute a read/write query, which is essential to properly maintain of a Cassandra cluster and to debug any issues. Next, you’ll discover how to integrate a Cassandra driver in your applications and perform read/write operations. Finally, you’ll learn about the various tools provided by Cassandra for serviceability aspects such as logging, metrics, backup, and recovery.
Table of Contents (14 chapters)
Apache Cassandra Essentials
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


Cassandra keyspaces and column families are different from regular SQL databases and tables. A partition key row lies completely on a node, and it can have multiple logical rows uniquely identified by the primary key. A column family with one or more clustering columns in its primary key can have multiple logical rows per partition, so these column families are called wide rows. Rows per partition key are sorted based on the clustering columns. While reading records we can only search by either partition key or clustering columns. Secondary index columns can also be used to filter search data. Cassandra, by default, prevents running those queries that involve filtering. LWTs can help to serialize concurrent operations at some performance cost. Multiple related DML statements can be grouped together by BATCH statements. Default batch statements are atomic and are costlier in terms of performance. The UNLOGGED batches could be used if atomicity isn't required and performance is more...