Book Image

Hadoop MapReduce v2 Cookbook - Second Edition: RAW

Book Image

Hadoop MapReduce v2 Cookbook - Second Edition: RAW

Overview of this book

Table of Contents (19 chapters)
Hadoop MapReduce v2 Cookbook Second Edition
Credits
About the Author
Acknowledgments
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 7. Hadoop Ecosystem II – Pig, HBase, Mahout, and Sqoop

In this chapter, we will cover the following topics:

  • Getting started with Apache Pig

  • Joining two datasets using Pig

  • Accessing a Hive table data in Pig using HCatalog

  • Getting started with Apache HBase

  • Data random access using Java client APIs

  • Running MapReduce jobs on HBase

  • Using Hive to insert data into HBase tables

  • Getting started with Apache Mahout

  • Running K-means with Mahout

  • Importing data to HDFS from a relational database using Apache Sqoop

  • Exporting data from HDFS to a relational database using Apache Sqoop