Book Image

MySQL 8 for Big Data

By : Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya
Book Image

MySQL 8 for Big Data

By: Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya

Overview of this book

With organizations handling large amounts of data on a regular basis, MySQL has become a popular solution to handle this structured Big Data. In this book, you will see how DBAs can use MySQL 8 to handle billions of records, and load and retrieve data with performance comparable or superior to commercial DB solutions with higher costs. Many organizations today depend on MySQL for their websites and a Big Data solution for their data archiving, storage, and analysis needs. However, integrating them can be challenging. This book will show you how to implement a successful Big Data strategy with Apache Hadoop and MySQL 8. It will cover real-time use case scenario to explain integration and achieve Big Data solutions using technologies such as Apache Hadoop, Apache Sqoop, and MySQL Applier. Also, the book includes case studies on Apache Sqoop and real-time event processing. By the end of this book, you will know how to efficiently use MySQL 8 to manage data for your Big Data applications.
Table of Contents (17 chapters)
Title Page
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface

NoSQL versus SQL


Let's see the major differences between NoSQL and SQL databases:

NoSQL

SQL

NoSQL is referred to as a non-relational and distributed database system.

SQL is referred to as relational database system.

NoSQL is horizontally scalable.

SQL is vertically scalable.

NoSQL does not use a structured query language. It has an unstructured query language, which varies from database to database.

SQL uses the structured query language to manipulate the data.

NoSQL does not have a fixed or predefined schema.

SQL has a predefined static database.

NoSQL stores data in key/value pairs.

SQL stores data in tabular format.

For complex relational queries, NoSQL is not suitable.

SQL is best suited for complex relational queries.

Preferable to handle big data.

Preferable to handle relational data.

NoSQL stores data in the form of collections where data can be duplicated and stored in a single entity. Hence, reading/writing on a single entity is easier and faster.

SQL stores data in a normalized way and breaks down...