Book Image

Programming MapReduce with Scalding

By : Antonios Chalkiopoulos
Book Image

Programming MapReduce with Scalding

By: Antonios Chalkiopoulos

Overview of this book

Table of Contents (16 chapters)
Programming MapReduce with Scalding
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 4. Intermediate Examples

This chapter goes through a real implementation in Scalding of non-trivial applications using the operations presented in the previous chapter. We will go through the data analysis, design, implementation, and optimization of data-transformation jobs for the following:

  • Logfile analysis

  • Ad targeting

Analyzing logfiles that have been stored for some time is a usual starting application of a new Hadoop team in an organization. The type of value to extract from the logfiles depends on the use case. As an example, we will use a case where we will need to think a lot about how to manage the data.

Another example of Ad targeting will make us look at how to structure and store the data to allow us to run daily jobs. It will involve input from data scientists and deep analysis of customer behavior to recommend personalized advertisements.