Chapter 9. Case study: Part I - Apache Sqoop for exchanging data between MySQL and Hadoop
In the previous chapter, we learnt about different techniques of using NoSQL API for MySQL. We went through examples of PHP, JSON, and Java to connect and consume data using NoSQL API. In this chapter, we will learn how we can use Apache Sqoop and Hadoop in Big Data life cycle for processing unstructured data to structured data which can be easily manipulated by relational databases such as MySQL.
Below are the topics we are going to cover in this chapter:
- Case study for log analysis
- Apache Sqoop overview
- Integrating Apache Sqoop with MySQL and Hadoop
- Importing unstructured data to Hadoop HDFS from MySQL
- Loading structured data to MySQL using Apache Sqoop
In Chapter 1, Introduction to Big Data and MySQL 8, we learnt how MySQL fits into the life cycle of Big Data. Let's say we are creating a people sentiment analysis system based on tweets and Facebook posts made by the users. As we already know, there are...