In this chapter, we will look at how to ingest and organize data to the newly created Data Lake to make it effective and useful. The topics covered in this chapter are as follows:
End-to-end Data Lake solution
Ingest data using HDFS commands
Ingest data to Azure Blob using Azure PowerShell
Ingest data using CloudXplorer
Using Sqoop to move data from RDBMS to cluster
Organizing your data in HDFS
Managing metadata using HCatalog