In this chapter, we will learn to build big data applications, analyze a traditional end-to end data workflow life cycle, and on similar lines build a big data application step by step. We will cover the big data process--discovery, ingestion, visualization, and governance. The emphasis will be on the Spark platform and data science prediction models. DevOps applications to various phases of big data will be explored in the subsequent chapters.
- Traditional data platforms
- Big data platform core principles
- Big data life cycle:
- Data discovery
- Data quality
- Data ingestion
- Data analytics
- Spark platform
- Data visualization
- Data governance
- Building enterprise applications
- Data science--prediction models