Current IT architecture uses a Enterprise Data Warehouse (EDW) as the centralized repository that feeds several business data marts to drive business intelligence and data mining systems. With the advent of smart connected devices and social media that generate petabytes of data, these current relational EDWs are not able to scale and meet the business needs. This chapter will discuss how to build a modern data architecture that extends the EDW with the Hadoop ecosystem.
In this chapter, we will cover the following topics:
Enterprise Data Warehouse architecture
Next generation Hadoop-based Data Lake architecture
The journey to your Data Lake dream
Tools and technology in the Hadoop ecosystem
Use case powered by Microsoft HDInsight