Book Image

MySQL 8 for Big Data

By : Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya
Book Image

MySQL 8 for Big Data

By: Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya

Overview of this book

With organizations handling large amounts of data on a regular basis, MySQL has become a popular solution to handle this structured Big Data. In this book, you will see how DBAs can use MySQL 8 to handle billions of records, and load and retrieve data with performance comparable or superior to commercial DB solutions with higher costs. Many organizations today depend on MySQL for their websites and a Big Data solution for their data archiving, storage, and analysis needs. However, integrating them can be challenging. This book will show you how to implement a successful Big Data strategy with Apache Hadoop and MySQL 8. It will cover real-time use case scenario to explain integration and achieve Big Data solutions using technologies such as Apache Hadoop, Apache Sqoop, and MySQL Applier. Also, the book includes case studies on Apache Sqoop and real-time event processing. By the end of this book, you will know how to efficiently use MySQL 8 to manage data for your Big Data applications.
Table of Contents (17 chapters)
Title Page
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface

Importing unstructured data to Hadoop HDFS from MySQL


Using Sqoop, we can transfer data from relational database to Hadoop HDFS. As Sqoop uses Java Database Connectivity (JDBC) driver for connecting with the source, it can be used with any relational database having support of JDBC connection strings. In the previous section, we downloaded and configured Sqoop's MySQL connector, so now let's see how to connect with MySQL databases from Sqoop and transfer the data to HDFS.

Sqoop import for fetching data from MySQL 8

To understand Sqoop's import process, let's create a database and table in MySQL 8, which we will use throughout the chapter for demonstrating examples:

Sqoop provides import command to import data from relational database to HDFS. Following are generic commands used for importing data using Sqoop:

sqoop import (generic-args) (import-args)
sqoop-import (generic-args) (import-args)
  • generic-args are common parameter for export such as providing JDBC connection string, JDBC driver name...