Book Image

Learning Cascading

Book Image

Learning Cascading

Overview of this book

Table of Contents (18 chapters)
Learning Cascading
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Preface
7
Optimizing the Performance of a Cascading Application
Index

Building the workflow


We are finished with the components of our workflow, and are assuming the role of the process architect. Keeping in mind the high-level task given to us by our process owner, and also our functional decomposition diagram, we are going to assemble the workflow for this project using flows then and assemble them into a cascade.

There is more, however, to building the workflow. We also need to come up with an efficient test plan for the project and, since we are developing in a local mode, we need to port it to a Hadoop cluster. Here is an in-depth decomposition of our project:

Figure 8.5 - Final project decomposition

Building flows

The following steps describe the process of creating flows for both sequential and parallel processing of the tasks within our project.

  1. There is no need for us to save tokenized text as an individual output, but we do want to save extracted named entities with a corresponding document name and sentence number. So, our first flow will include two...