Book Image

Clojure Data Analysis Cookbook - Second Edition

By : Eric Richard Rochester
Book Image

Clojure Data Analysis Cookbook - Second Edition

By: Eric Richard Rochester

Overview of this book

Table of Contents (19 chapters)
Clojure Data Analysis Cookbook Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 5. Distributed Data Processing with Cascalog

In this chapter, we will cover the following recipes:

  • Initializing Cascalog and Hadoop for distributed processing

  • Querying data with Cascalog

  • Distributing data with Apache HDFS

  • Parsing CSV files with Cascalog

  • Executing complex queries with Cascalog

  • Aggregating data with Cascalog

  • Defining new Cascalog operators

  • Composing Cascalog queries

  • Transforming data with Cascalog