Book Image

Mastering MongoDB 6.x - Third Edition

By : Alex Giamas

Book Image

Mastering MongoDB 6.x - Third Edition

By: Alex Giamas

Overview of this book

MongoDB is a leading non-relational database. This book covers all the major features of MongoDB including the latest version 6. MongoDB 6.x adds many new features and expands on existing ones such as aggregation, indexing, replication, sharding and MongoDB Atlas tools. Some of the MongoDB Atlas tools that you will master include Atlas dedicated clusters and Serverless, Atlas Search, Charts, Realm Application Services/Sync, Compass, Cloud Manager and Data Lake. By getting hands-on working with code using realistic use cases, you will master the art of modeling, shaping and querying your data and become the MongoDB oracle for the business. You will focus on broadly used and niche areas such as optimizing queries, configuring large-scale clusters, configuring your cluster for high performance and availability and many more. Later, you will become proficient in auditing, monitoring, and securing your clusters using a structured and organized approach. By the end of this book, you will have grasped all the practical understanding needed to design, develop, administer and scale MongoDB-based database applications both on premises and on the cloud.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Share Your Thoughts

Part 1 – Basic MongoDB – Design Goals and Architecture

Part 1 – Basic MongoDB – Design Goals and Architecture

Free Chapter

Chapter 1: MongoDB – A Database for the Modern Web

Chapter 1: MongoDB – A Database for the Modern Web

Technical requirements

The evolution of SQL and NoSQL

MongoDB for SQL developers

MongoDB for NoSQL developers

MongoDB’s key characteristics and use cases

MongoDB configuration and best practices

Reference documentation and further reading

Chapter 2: Schema Design and Data Modeling

Chapter 2: Schema Design and Data Modeling

Technical requirements

Relational schema design

Modeling data for atomic operations

Modeling relationships

Modeling data for keyword searches

Modeling data for Internet of Things

Connecting to MongoDB

Part 2 – Querying Effectively

Part 2 – Querying Effectively

Chapter 3: MongoDB CRUD Operations

Chapter 3: MongoDB CRUD Operations

Technical requirements

CRUD using the shell

The new mongosh shell

MongoDB Stable API

Chapter 4: Auditing

Chapter 4: Auditing

Technical requirements

Auditing and logging differences

Audit events and format

Audit setup in MongoDB Atlas

Audit case study

Chapter 5: Advanced Querying

Chapter 5: Advanced Querying

Technical requirements

MongoDB CRUD operations

Queryable encryption

Chapter 6: Multi-Document ACID Transactions

Chapter 6: Multi-Document ACID Transactions

Technical requirements

Transactions background

Exploring ACID properties

E-commerce using MongoDB

Chapter 7: Aggregation

Chapter 7: Aggregation

Technical requirements

Why aggregation?

Aggregation options

Aggregation operators

Time series collections

Optimizing aggregation pipelines

Aggregation use case

Chapter 8: Indexing

Chapter 8: Indexing

Index internals

Building and managing indexes

Using indexes efficiently

Further reading

Part 3 – Administration and Data Management

Part 3 – Administration and Data Management

Chapter 9: Monitoring, Backup, and Security

Chapter 9: Monitoring, Backup, and Security

Technical requirements

Monitoring clusters

Cluster backups

Securing our clusters

Chapter 10: Managing Storage Engines

Chapter 10: Managing Storage Engines

Pluggable storage engines

Locking in MongoDB

Further reading

Chapter 11: MongoDB Tooling

Chapter 11: MongoDB Tooling

Technical requirements

Introduction to MongoDB tools

MongoDB Kubernetes Operator

MongoDB Atlas Serverless

MongoDB Compass

MongoDB Cloud Manager

Chapter 12: Harnessing Big Data with MongoDB

Chapter 12: Harnessing Big Data with MongoDB

Technical requirements

What is big data?

Big data use case with servers on-premises

MongoDB Atlas Data Lake

Further reading

Part 4 – Scaling and High Availability

Part 4 – Scaling and High Availability

Chapter 13: Mastering Replication

Chapter 13: Mastering Replication

Technical requirements

An architectural overview

How do elections work?

What is the use case for a replica set?

Setting up a replica set

Connecting to a replica set

Replica set administration

Cloud options for a replica set

Replica set limitations

Chapter 14: Mastering Sharding

Chapter 14: Mastering Sharding

Technical requirements

Why do we need sharding?

Architectural overview

Setting up sharding

Sharding administration and monitoring

Querying sharded data

Sharding recovery

Further reading

Chapter 15: Fault Tolerance and High Availability

Chapter 15: Fault Tolerance and High Availability

Application design

Elevating operations

Boosting security

Further reading

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Why aggregation?

The aggregation framework was introduced by MongoDB in version 2.2 (version 2.1 in the development branch). It serves as an alternative to both the MapReduce framework, which is deprecated as of version 5.0, and querying the database directly.

Using the aggregation framework, we can perform GROUP BY operations in the server. Therefore, we can project only the fields that are needed in the result set. Using the $match and $project operators, we can reduce the amount of data passed through the pipeline, resulting in faster data processing.

Self-joins—that is, joining data within the same collection—can also be performed using the aggregation framework, as we will see in our use case.

When comparing the aggregation framework to simply using the queries available via the shell or various other drivers, it is important to remember that there is a use case for both.

For selection and projection queries, it’s almost always better to use...