Book Image

Cassandra Design Patterns - Second Edition

By : Rajanarayanan Thottuvaikkatumana
Book Image

Cassandra Design Patterns - Second Edition

By: Rajanarayanan Thottuvaikkatumana

Overview of this book

If you are new to Cassandra but well-versed in RDBMS modeling and design, then it is natural to model data in the same way in Cassandra, resulting in poorly performing applications and losing the real purpose of Cassandra. If you want to learn to make the most of Cassandra, this book is for you. This book starts with strategies to integrate Cassandra with other legacy data stores and progresses to the ways in which a migration from RDBMS to Cassandra can be accomplished. The journey continues with ideas to migrate data from cache solutions to Cassandra. With this, the stage is set and the book moves on to some of the most commonly seen problems in applications when dealing with consistency, availability, and partition tolerance guarantees. Cassandra is exceptionally good at dealing with temporal data and patterns such as the time-series pattern and log pattern, which are covered next. Many NoSQL data stores fail miserably when a huge amount of data is read for analytical purposes, but Cassandra is different in this regard. Keeping analytical needs in mind, you’ll walk through different and interesting design patterns. No theoretical discussions are complete without a good set of use cases to which the knowledge gained can be applied, so the book concludes with a set of use cases you can apply the patterns you’ve learned.
Table of Contents (15 chapters)

Transformation pattern


Data transformation is the heart of data processing, which in turn supports the data analysis. Any framework that is built to perform data processing should have excellent data transformation capabilities in addition to seamless connectivity with various distributed and scalable data stores. These frameworks are built for specific data processing needs of the organizations by themselves, and they consist of a set of commonly available tools used in a specific way to achieve the data processing goals. Cassandra, in conjunction with Spark and a preferred programming language such as Scala, Java, Python, or R, can be used to perform very effective data transformation and data processing. When choosing the programming language, it is better to choose a functional programming language. In functional programming languages, functions are first-class citizens, and they can be used just like any other data type to pass it as parameters to functions and return functions from...