Book Image

Mastering Apache Cassandra 3.x - Third Edition

By : Aaron Ploetz, Tejaswi Malepati, Nishant Neeraj
Book Image

Mastering Apache Cassandra 3.x - Third Edition

By: Aaron Ploetz, Tejaswi Malepati, Nishant Neeraj

Overview of this book

With ever-increasing rates of data creation, the demand for storing data fast and reliably becomes a need. Apache Cassandra is the perfect choice for building fault-tolerant and scalable databases. Mastering Apache Cassandra 3.x teaches you how to build and architect your clusters, configure and work with your nodes, and program in a high-throughput environment, helping you understand the power of Cassandra as per the new features. Once you’ve covered a brief recap of the basics, you’ll move on to deploying and monitoring a production setup and optimizing and integrating it with other software. You’ll work with the advanced features of CQL and the new storage engine in order to understand how they function on the server-side. You’ll explore the integration and interaction of Cassandra components, followed by discovering features such as token allocation algorithm, CQL3, vnodes, lightweight transactions, and data modelling in detail. Last but not least you will get to grips with Apache Spark. By the end of this book, you’ll be able to analyse big data, and build and manage high-performance databases for your application.
Table of Contents (12 chapters)

What this book covers

Chapter 1, Quick Start, walks the reader through getting started with Apache Cassandra. As the title suggests, explanations will be brief in favor of guiding the reader toward quickly standing up an Apache Cassandra single-node cluster.

Chapter 2, Cassandra Architecture, covers the ideas and theories behind how Apache Cassandra works. These concepts will be useful going forward, as an understanding of Cassandra's inner workings can help in building high-performing data models.

Chapter 3, Effective CQL, introduces the reader to the CQL. It describes building appropriate data models and how to leverage CQL to get the most out of your cluster.

Chapter 4, Configuring a Cluster, details the configuration files and settings that go into building an Apache Cassandra Cluster. In addition, this chapter also describes the effects that some of the settings have, and how they can be used to keep your cluster running well.

Chapter 5, Performance Tuning, discusses the extra settings, configurations, and design considerations that can help to improve performance or mitigate issues.

Chapter 6, Managing a Cluster, goes into detail when describing the nodetool utility, and how it can be used for operations on an Apache Cassandra cluster. Adding and removing nodes is covered, as well as taking and restoring from backups.

Chapter 7, Monitoring, describes how to integrate a technology stack that provides a window into an Apache Cassandra cluster's history and performance metrics.

Chapter 8, Application Development, takes the reader through design considerations around coding Java applications to work with an Apache Cassandra cluster.

Chapter 9, Integration with Apache Spark, talks about installing and using Apache Spark in order to analyze and discover value in your data.

Appendix A, References, In this chapter you will find links present for various references present throughout the book.