Book Image

Learning Cascading

Book Image

Learning Cascading

Overview of this book

Table of Contents (18 chapters)
Learning Cascading
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Preface
7
Optimizing the Performance of a Cascading Application
Index

Putting it all together


This code illustrates how to put together a simple Cascading application, which takes inputs from two CSV (comma-separated value) files and joins them in common fields.

Here is the problem: in some organizations, employees are also published authors. We would like to compile a list of employees who are authors, and add department information to the list.

We have two CSV files: one file contains data on author's name, organization, journal title, and a keyword (the topic of their published work), and the second file contains department names and employee names. Our desired resultant list is the list of employees who are also authors, including their organization, department, journal title, and keyword. We will describe step by step how to build this application.

  1. Here are the important libraries to import in order for the program to work:

    import cascading.property.AppProps;
    import cascading.scheme.Scheme;
    import cascading.scheme.local.TextDelimited;
    import cascading.scheme...