MongoDB Cookbook - Second Edition

MongoDB Cookbook - Second Edition - Second Edition

By : Amol Nayak

Buy this Book

MongoDB Cookbook - Second Edition - Second Edition

By: Amol Nayak

Buy this Book

Overview of this book

MongoDB is a high-performance and feature-rich NoSQL database that forms the backbone of the systems that power many different organizations – it’s easy to see why it’s the most popular NoSQL database on the market. Packed with many features that have become essential for many different types of software professionals and incredibly easy to use, this cookbook contains many solutions to the everyday challenges of MongoDB, as well as guidance on effective techniques to extend your skills and capabilities. This book starts with how to initialize the server in three different modes with various configurations. You will then be introduced to programming language drivers in both Java and Python. A new feature in MongoDB 3 is that you can connect to a single node using Python, set to make MongoDB even more popular with anyone working with Python. You will then learn a range of further topics including advanced query operations, monitoring and backup using MMS, as well as some very useful administration recipes including SCRAM-SHA-1 Authentication. Beyond that, you will also find recipes on cloud deployment, including guidance on how to work with Docker containers alongside MongoDB, integrating the database with Hadoop, and tips for improving developer productivity. Created as both an accessible tutorial and an easy to use resource, on hand whenever you need to solve a problem, MongoDB Cookbook will help you handle everything from administration to automation with MongoDB more effectively than ever before.

MongoDB Cookbook Second Edition

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Installing and Starting the Server

Introduction

Installing single node MongoDB

Starting a single node instance using command-line options

Single node installation of MongoDB with options from the config file

Connecting to a single node in the Mongo shell with JavaScript

Connecting to a single node using a Java client

Connecting to a single node using a Python client

Starting multiple instances as part of a replica set

Connecting to the replica set in the shell to query and insert data

Connecting to the replica set to query and insert data from a Java client

Connecting to the replica set to query and insert data using a Python client

Starting a simple sharded environment of two shards

Connecting to a shard in the shell and performing operations

Command-line Operations and Indexes

Introduction

Creating test data

Performing simple querying, projections, and pagination from Mongo shell

Updating and deleting data from the shell

Creating index and viewing plans of queries

Creating a background and foreground index in the shell

Creating and understanding sparse indexes

Expiring documents after a fixed interval using the TTL index

Expiring documents at a given time using the TTL index

Programming Language Drivers

Introduction

Executing query and insert operations with PyMongo

Executing update and delete operations using PyMongo

Implementing aggregation in Mongo using PyMongo

Executing MapReduce in Mongo using PyMongo

Executing query and insert operations using a Java client

Executing update and delete operations using a Java client

Implementing aggregation in Mongo using a Java client

Executing MapReduce in Mongo using a Java client

Administration

Introduction

Renaming a collection

Viewing collection stats

Viewing database stats

Manually padding a document

The mongostat and mongotop utilities

Getting current executing operations and killing them

Using profiler to profile operations

Setting up users in Mongo

Interprocess security in Mongo

Modifying collection behavior using the collMod command

Setting up MongoDB as a windows service

Replica set configurations

Stepping down as primary from the replica set

Exploring the local database of a replica set

Understanding and analyzing oplogs

Building tagged replica sets

Configuring the default shard for non-sharded collections

Manual split and migration of chunks

Domain-driven sharding using tags

Exploring the config database in a sharded setup

Advanced Operations

Introduction

Atomic find and modify operations

Implementing atomic counters in Mongo

Implementing server-side scripts

Creating and tailing a capped collection cursors in MongoDB

Converting a normal collection to a capped collection

Storing binary data in Mongo

Storing large data in Mongo using GridFS

Storing data to GridFS from Java client

Storing data to GridFS from Python client

Implementing triggers in Mongo using oplog

Flat plane 2D geospatial queries in Mongo using geospatial indexes

Spherical indexes and GeoJSON compliant data in Mongo

Implementing full text search in Mongo

Integrating MongoDB for full text search with Elasticsearch

Monitoring and Backups

Introduction

Signing up for MMS and setting up an MMS monitoring agent

Managing users and groups in MMS console

Monitoring instances and setting up alerts on MMS

Setting up monitoring alerts in MMS

Back up and restore data in Mongo using out-of-the-box tools

Configuring MMS Backup service

Managing backups in MMS Backup service

Deploying MongoDB on the Cloud

Introduction

Setting up and managing the MongoLab account

Setting up a sandbox MongoDB instance on MongoLab

Performing operations on MongoDB from MongoLab GUI

Setting up MongoDB on Amazon EC2 manually

Setting up MongoDB using the Docker containers

Integration with Hadoop

Introduction

Executing our first sample MapReduce job using the mongo-hadoop connector

Writing our first Hadoop MapReduce job

Running MapReduce jobs on Hadoop using streaming

Running a MapReduce job on Amazon EMR

Open Source and Proprietary Tools

Introduction

Developing using spring-data-mongodb

Accessing MongoDB using JPA

Accessing MongoDB over REST

Installing a GUI-based client, MongoVUE, for MongoDB

Concepts for Reference

Write concern and its significance

Read preference for querying

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Running MapReduce jobs on Hadoop using streaming

In our previous recipe, we implemented a simple MapReduce job using the Java API of Hadoop. The use case was the same as what we did in the recipes in Chapter 3, Programming Language Drivers where we implemented MapReduce using the Mongo client APIs in Python and Java. In this recipe, we will use Hadoop streaming to implement MapReduce jobs.

The concept of streaming works on communication using stdin and stdout. You can get more information on Hadoop streaming and how it works at http://hadoop.apache.org/docs/r1.2.1/streaming.html.

Getting ready…

Refer to the Executing our first sample MapReduce job using the mongo-hadoop connector recipe in this chapter to see how to set up Hadoop for development purposes and build the mongo-hadoop project using Gradle. As far as the Python libraries are concerned, we will be installing the required library from the source; however, you can use pip (Python's package manager) to set up if you do not wish to build...

MongoDB Cookbook - Second Edition - Second Edition

By : Amol Nayak

MongoDB Cookbook - Second Edition - Second Edition

By: Amol Nayak

Overview of this book

Related Content you might be interested in

Current Title:

MongoDB Cookbook - Second Edition - Second Edition

Running MapReduce jobs on Hadoop using streaming

Getting ready…