Seven NoSQL Databases in a Week

Seven NoSQL Databases in a Week

By : Sudarshan Kadambi, Xun (Brian) Wu

Buy this Book

Seven NoSQL Databases in a Week

By: Sudarshan Kadambi, Xun (Brian) Wu

Buy this Book

Overview of this book

This is the golden age of open source NoSQL databases. With enterprises having to work with large amounts of unstructured data and moving away from expensive monolithic architecture, the adoption of NoSQL databases is rapidly increasing. Being familiar with the popular NoSQL databases and knowing how to use them is a must for budding DBAs and developers. This book introduces you to the different types of NoSQL databases and gets you started with seven of the most popular NoSQL databases used by enterprises today. We start off with a brief overview of what NoSQL databases are, followed by an explanation of why and when to use them. The book then covers the seven most popular databases in each of these categories: MongoDB, Amazon DynamoDB, Redis, HBase, Cassandra, In?uxDB, and Neo4j. The book doesn't go into too much detail about each database but teaches you enough to get started with them. By the end of this book, you will have a thorough understanding of the different NoSQL databases and their functionalities, empowering you to select and use the right database according to your needs.

Title Page

Dedication

Packt Upsell

Contributors

Preface

Free Chapter

Introduction to NoSQL Databases

Consistency versus availability

ACID guarantees

Hash versus range partition

In-place updates versus appends

Row versus column versus column-family storage models

Strongly versus loosely enforced schemas

Summary

MongoDB

Installing of MongoDB

MongoDB data types

Data models in MongoDB

Introduction to MongoDB indexing

Replication

Sharding

Storing large data in MongoDB

Summary

Neo4j

What is Neo4j?

How does Neo4j work?

Features of Neo4j

Evaluating your use case

Neo4j anti-patterns

Neo4j hardware selection, installation, and configuration

Using Neo4j

Tips for success

Summary

References

Redis

Introduction to Redis

What are the key features of Redis?

Appropriate use cases for Redis

Data modeling and application design with Redis

Redis anti-patterns

Redis setup, installation, and configuration

Using Redis

Tips for success

Summary

References

Cassandra

Introduction to Cassandra

What problems does Cassandra solve?

What are the key features of Cassandra?

Appropriate use cases for Cassandra

Cassandra anti-patterns

Cassandra hardware selection, installation, and configuration

Summary

HBase

Logical and physical data models

Interacting with HBase – the HBase shell

Interacting with HBase – the HBase Client API

Advanced topics

Summary

DynamoDB

The difference between SQL and DynamoDB

Setting up DynamoDB

DynamoDB data types and terminology

Data models and CRUD operations in DynamoDB

Limitations of DynamoDB

Best practices

Summary

InfluxDB

Introduction to InfluxDB

Installation and configuration

Query language and API

InfluxDB ecosystem

InfluxDB operations

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

In-place updates versus appends

Another key difference between database systems is how they handle updates to the physical records stored on disk.

Relational databases, such as MySQL, maintain a variety of structures in both the memory and disk, where writes from in-flight transactions and writes from completed transactions are persisted. Once the transaction has been committed, the physical record on disk for a given key is updated to reflect that. On the other hand, many NoSQL databases, such as HBase and Cassandra, are variants of what is called a log-structured merge (LSM) database.

In such an LSM database, updates aren't applied to the record at transaction commit. Instead, updates are applied in memory. Once the memory structure gets full, the contents of the memory are flushed to the disk. This means that updates to a single record can be fragmented and located within separate flush files that are created over time. This means that when there is a read for that record, you need to read in fragments of the record from the different flush files and merge the fragments in reverse time order in order to construct the latest snapshot of the given record. We will discuss the mechanics of how an LSM database works in the Chapter 6, HBase.

Seven NoSQL Databases in a Week

By : Sudarshan Kadambi, Xun (Brian) Wu

Seven NoSQL Databases in a Week

By: Sudarshan Kadambi, Xun (Brian) Wu

Overview of this book

Related Content you might be interested in

Current Title:

Seven NoSQL Databases in a Week

Mastering Apache Cassandra 3.x

Learning Apache Cassandra

HBase High Performance Cookbook

In-place updates versus appends