Book Image

HDInsight Essentials - Second Edition

By : Rajesh Nadipalli
Book Image

HDInsight Essentials - Second Edition

By: Rajesh Nadipalli

Overview of this book

Table of Contents (16 chapters)
HDInsight Essentials Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 5. Ingest and Organize Data Lake

In this chapter, we will look at how to ingest and organize data to the newly created Data Lake to make it effective and useful. The topics covered in this chapter are as follows:

  • End-to-end Data Lake solution

  • Ingest data using HDFS commands

  • Ingest data to Azure Blob using Azure PowerShell

  • Ingest data using CloudXplorer

  • Using Sqoop to move data from RDBMS to cluster

  • Organizing your data in HDFS

  • Managing metadata using HCatalog