Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Learning Cascading

By : Covert, Victoria Loewengart

5 (3)

Learning Cascading

5 (3)

By: Covert, Victoria Loewengart

Overview of this book

This book is intended for software developers, system architects and analysts, big data project managers, and data scientists who wish to deploy big data solutions using the Cascading framework. You must have a basic understanding of the big data paradigm and should be familiar with Java development techniques.

Preface

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Free Chapter

1. The Big Data Core Technology Stack

1. The Big Data Core Technology Stack

Reviewing Hadoop

MapReduce execution framework

The Cascading framework

Summary

2. Cascading Basics in Detail

2. Cascading Basics in Detail

Understanding common Cascading themes

Understanding how Cascading represents records

Understanding how Cascading controls data flow

Putting it all together

Summary

3. Understanding Custom Operations

3. Understanding Custom Operations

Understanding operations

Summary

4. Creating Custom Operations

4. Creating Custom Operations

Writing custom operations

Identifying common use cases for custom operations

Summary

5. Code Reuse and Integration

5. Code Reuse and Integration

Creating and using subassemblies

Using cascades

Dynamically controlling flows

Integrating external components

Summary

6. Testing a Cascading Application

6. Testing a Cascading Application

Debugging a Cascading application

Testing strategies

Summary

7. Optimizing the Performance of a Cascading Application

7. Optimizing the Performance of a Cascading Application

Optimizing performance

Summary

8. Creating a Real-world Application in Cascading

8. Creating a Real-world Application in Cascading

Project description – Business Intelligence case study on monitoring the competition

Project scope – understanding requirements

Defining the project – the Cascading development methodology

Building the workflow

Next steps

Summary

9. Planning for Future Growth

9. Planning for Future Growth

Finding online resources

Using other Cascading tools

Custom taps

Cascading serializers

Java open source mock frameworks

Summary

A. Downloadable Software

A. Downloadable Software

Contents

Installing and using

Index

Index

Chapter 1. The Big Data Core Technology Stack

This chapter will introduce the reader to the concepts of big data and the core technologies that comprise it. This knowledge is essential as the foundation to learn Cascading. Cascading provides a comprehensive framework that definitely eases many of the verbose and mundane tasks associated with writing Hadoop, and in the future, many other types of big data jobs. However, as with all complex subjects, an understanding of how Hadoop works is required to gain a full understanding of how to best use Cascading.

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Learning Cascading

Search

Your notes and bookmarks