Book Image

MySQL 8 for Big Data

By : Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya
Book Image

MySQL 8 for Big Data

By: Shabbir Challawala, Chintan Mehta, Kandarp Patel, Jaydip Lakhatariya

Overview of this book

With organizations handling large amounts of data on a regular basis, MySQL has become a popular solution to handle this structured Big Data. In this book, you will see how DBAs can use MySQL 8 to handle billions of records, and load and retrieve data with performance comparable or superior to commercial DB solutions with higher costs. Many organizations today depend on MySQL for their websites and a Big Data solution for their data archiving, storage, and analysis needs. However, integrating them can be challenging. This book will show you how to implement a successful Big Data strategy with Apache Hadoop and MySQL 8. It will cover real-time use case scenario to explain integration and achieve Big Data solutions using technologies such as Apache Hadoop, Apache Sqoop, and MySQL Applier. Also, the book includes case studies on Apache Sqoop and real-time event processing. By the end of this book, you will know how to efficiently use MySQL 8 to manage data for your Big Data applications.
Table of Contents (17 chapters)
Title Page
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface

Preface

With the organizations handling large amounts of data on a regular basis, MySQL has become a popular solution to handle this Structured Big Data. In this book, you will see how Database Administrators (DAs) can use MySQL to handle billions of records and load and retrieve data with performance comparable or superior to commercial DB solutions with higher costs.

Many organizations today depend on MySQL for their websites, and Big Data solutions for their data archiving, storage, and analysis needs. However, integrating them can be challenging. This book will show how to implement a successful Big Data strategy with Apache Hadoop and MySQL 8. It will cover real-time use case scenarios to explain integration and achieving Big Data solutions using different technologies such as Apache Hadoop, Apache Sqoop, and MySQL Applier.

The book will have discussion on topics such as features of MySQL 8, best practices for using MySQL 8, and NoSQL APIs provided by MySQL 8, and will also have a use case on using MySQL 8 for managing Big Data. By the end of this book, you will learn how to efficiently use MySQL 8 to manage data for your Big Data applications.

What this book covers

Chapter 1, Introduction to Big Data and MySQL 8, provides an overview of Big Data and MySQL 8, their importance, and life cycle of big data. It covers the basic idea of Big Data and its trends in the current market. Along with that, it also explains the benefits of using MySQL, takes us through the steps to install MySQL 8, and acquaints us with newly introduced features in MySQL 8.

Chapter 2, Data Query Techniques in MySQL 8, covers the basics of querying data on MySQL 8 and how to join or aggregate data set in it.

Chapter 3, Indexing your data for High-Performing Queries, explains about indexing in MySQL 8, introduces the different types of indexing available in MySQL, and shows how to do indexing for faster performance on large quantities of data.

Chapter 4, Using Memcached with MySQL 8, provides an overview of Memcached with MySQL and informs us of the various advantages of using it. It covers the Memcached installation steps, replication configuration, and various Memcached APIs in different programming languages.

Chapter 5, Partitioning High Volume Data, explains how high-volume data can be partitioned in MySQL 8 using different partitioning methods. It covers the various types of partitioning that we can implement in MySQL 8 and their use with Big Data.

Chapter 6, Replication for building highly available solutions, explains implementing group replication in MySQL 8. Chapter talks about how large data can be scaled and replicating of data can be faster using different techniques of replication.

Chapter 7, MySQL 8 Best Practices, covers the best practices of using MySQL 8 for Big Data. It has all the different kinds of dos and don'ts for using MySQL 8.

Chapter 8, NoSQL API for Integrating with Big Data Solutions, explains integration of NoSQL API for acquiring data. It also explains NoSQL and its various APIs in different programming languages for connecting NoSQL with MySQL.

Chapter 9, Case Study: Part I - Apache Sqoop for Exchanging Data between MySQL and Hadoop, explains how bulk data can be efficiently transferred between Hadoop and MySQL using Apache Sqoop.

Chapter 10, Case Study: Part II - Realtime event processing using MySQL applier, explains real-time integration of MySQL with Hadoop, and reading binary log events as soon as they are committed and writing them into a file in HDFS.

What you need for this book

This book will guide you through the installation of all the tools that you need to follow the examples. You will need to install the following software to effectively run the code samples present in this book:

  • MySQL 8.0.3
  • Hadoop 2.8.1
  • Apache Sqoop 1.4.6

Who this book is for

This book is intended for MySQL database administrators and Big Data professionals looking to integrate MySQL and Hadoop to implement a high performance Big Data solution. Some previous experience with MySQL will be helpful.

Conventions

In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and explanations of their meanings.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: "We can include other contexts through the use of the include directive."

A block of code is set as follows:

[default]
exten => s,1,Dial(Zap/1|30)
exten => s,2,Voicemail(u100)
exten => s,102,Voicemail(b100)
exten => i,1,Voicemail(s0)

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

[default]
exten => s,1,Dial(Zap/1|30)
exten => s,2,Voicemail(u100)
exten => s,102,Voicemail(b100)
exten => i,1,Voicemail(s0)

Any command-line input or output is written as follows:

# cp /usr/src/asterisk-addons/configs/cdr_mysql.conf.sample /etc/asterisk/cdr_mysql.conf

New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: "Clicking the Next button moves you to the next screen".

Note

Warnings or important notes appear in a box like this.

Note

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book--what you liked or disliked. Reader feedback is important for us as it helps us develop titles that you will really get the most out of.

To send us general feedback, simply email us at [email protected] and mention the book's title in the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide at www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you get the most from your purchase.

Downloading the example code

You can download the example code files for this book from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files emailed directly to you.

You can download the code files by following these steps:

  1. Log in or register to our website using your email address and password.
  2. Hover the mouse pointer on the SUPPORT tab at the top.
  3. Click on Code Downloads & Errata.

 

  1. Enter the name of the book in the Search box.
  2. Select the book for which you're looking to download the code files.
  3. Choose from the drop-down menu where you purchased this book from.
  4. Click on Code Download.

You can also download the code files by clicking on the Code Files button on the book's webpage at the Packt Publishing website. This page can be accessed by entering the book's name in the Search box. Note that you need to be logged in to your Packt account.

Once the file is downloaded, make sure that you unzip or extract the folder using the latest version of:

  • WinRAR / 7-Zip for Windows
  • Zipeg / iZip / UnRarX for Mac
  • 7-Zip / PeaZip for Linux

The code bundle for the book is also hosted on GitHub at https://github.com/PacktPublishing/MySQL-8-for-Big-Data. We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Downloading the color images of this book

We also provide you with a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output. You can download this file from https://www.packtpub.com/sites/default/files/downloads/MySQL8forBigData_ColorImages.pdf.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books, maybe a mistake in the text or the code, we would be grateful if you could report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the Errata Submission Form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded to our website or added to any list of existing errata under the Errata section of that title.

To view the previously submitted errata, go to https://www.packtpub.com/books/content/support and enter the name of the book in the search field. The required information will appear under the Errata section.

Piracy

Piracy of copyrighted material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works in any form on the Internet, do provide us with the location address or the website name immediately, so that we can pursue a remedy.

Contact us at [email protected] with a link to the suspected pirated material.

We appreciate your help in protecting our authors and our ability to bring you valuable content.

Questions

If you have a problem with any aspect of this book, you can contact us at [email protected], and we will do our best to address the problem.