Book Image

HDInsight Essentials - Second Edition

By : Rajesh Nadipalli
Book Image

HDInsight Essentials - Second Edition

By: Rajesh Nadipalli

Overview of this book

Table of Contents (16 chapters)
HDInsight Essentials Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 2. Enterprise Data Lake using HDInsight

Current IT architecture uses a Enterprise Data Warehouse (EDW) as the centralized repository that feeds several business data marts to drive business intelligence and data mining systems. With the advent of smart connected devices and social media that generate petabytes of data, these current relational EDWs are not able to scale and meet the business needs. This chapter will discuss how to build a modern data architecture that extends the EDW with the Hadoop ecosystem.

In this chapter, we will cover the following topics:

  • Enterprise Data Warehouse architecture

  • Next generation Hadoop-based Data Lake architecture

  • The journey to your Data Lake dream

  • Tools and technology in the Hadoop ecosystem

  • Use case powered by Microsoft HDInsight