Book Image

Getting Started with RethinkDB

By : Gianluca Tiepolo
Book Image

Getting Started with RethinkDB

By: Gianluca Tiepolo

Overview of this book

RethinkDB is a high-performance document-oriented database with a unique set of features. This increasingly popular NoSQL database is used to develop real-time web applications and, together with Node.js, it can be used to easily deploy them to the cloud with very little difficulty. Getting Started with RethinkDB is designed to get you working with RethinkDB as quickly as possible. Starting with the installation and configuration process, you will learn how to start importing data into the database and run simple queries using the intuitive ReQL query language. After successfully running a few simple queries, you will be introduced to other topics such as clustering and sharding. You will get to know how to set up a cluster of RethinkDB nodes and spread database load across multiple machines. We will then move on to advanced queries and optimization techniques. You will discover how to work with RethinkDB from a Node.js environment and find out all about deployment techniques. Finally, we’ll finish by working on a fully-fledged example that uses the Node.js framework and advanced features such as Changefeeds to develop a real-time web application.
Table of Contents (15 chapters)
Getting Started with RethinkDB
Credits
About the Author
Acknowledgement
About the Reviewer
www.PacktPub.com
Preface
Index

Bulk data import


In the following section, we'll be talking a lot about indexing and advanced queries; however, it doesn't make sense to run advanced queries on a database that contains just one or two documents! Sometimes, you need to load lots of data into RethinkDB to use as sample data. Such data can include names, numbers, zip codes, locations, and so on.

To simulate a real-world scenario, we're going to import a big dataset into RethinkDB that includes some fake data (name, surname, e-mail, and age) for 30,000 people. This dataset is included in the data.json file that you can find in the code folder that accompanies this book.

RethinkDB includes a bulk loader that can be run from a shell; it is designed to import huge quantities of data into a particular database table on the server.

The import utility can load data from files in these formats:

  • CSV: In this file format, also known as comma-separated values, each line within the file represents a document, and each field within one single...