Book Image

Programming MapReduce with Scalding

By : Antonios Chalkiopoulos
Book Image

Programming MapReduce with Scalding

By: Antonios Chalkiopoulos

Overview of this book

Table of Contents (16 chapters)
Programming MapReduce with Scalding
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


Scalding provides a number of ways to implement and execute machine learning algorithms. As presented, we can manipulate pipes, use the Matrix API or algebird, and interoperate with existing libraries such as Mahout.

The majority of ML jobs originate as Big Data ETL jobs that reduce to a smaller data space. The final result usually needs some form of post-processing, and it is then stored in an external source. Scalding provides great interoperability with external systems, and it is thus one of the most suitable technologies to solve such problems.