Book Image

Learning Hadoop 2

By : Gerald Turkington, GABRIELE MODENA
Book Image

Learning Hadoop 2

By: Gerald Turkington, GABRIELE MODENA

Overview of this book

Table of Contents (18 chapters)
Learning Hadoop 2
About the Authors
About the Reviewers

Chapter 8. Data Lifecycle Management

Our previous chapters were quite technology focused, describing particular tools or techniques and how they can be used. In this and the next chapter, we are going to take a more top-down approach whereby we will describe a problem space you are likely to encounter and then explore how to address it. In particular, we'll cover the following topics:

  • What we mean by the term data life cycle management

  • Why data life cycle management is something to think about

  • The categories of tools that can be used to address the problem

  • How to use these tools to build the first half of a Twitter sentiment analysis pipeline