Microsoft Azure is a flexible cloud platform that enables enterprises of any size to quickly rent resources on demand to build, deploy, and manage a wide range of IT applications. HDInsight is a 100 percent Apache Hadoop-based cloud service that leverages the Azure platform.
In this chapter, we will discuss how to build your Data Lake using the HDInsight service on the Azure cloud, which can scale to petabytes on demand. We will cover the following topics:
Registering for an Azure account
Provisioning an HDInsight cluster
HDInsight Management dashboard
Exploring a cluster using the remote desktop
Deleting the HDInsight cluster
HDInsight Emulator for development