Implementing StreamGroupings in Apache Storm | Learning Apache Storm for Big Data Processing

Book Overview & Buying
Table Of Contents

Learning Apache Storm for Big Data Processing

By : Prashant Nair

Buy this Video

Learning Apache Storm for Big Data Processing

By: Prashant Nair

Buy this Video

Overview of this book

Apache Storm is a distributed real-time processing engine. Created by Nathanmarz for Backtype and later open sourced under Apache License 2, it's a scalable and a fault-tolerant engine used to process a massive number of unbounded streams. In this course you will see how simple yet efficient Apache Storm is when it comes to real-time processing. In the course, you will learn about data processing types followed by Apache Storm and its features. You'll learn the core concepts of Apache Storm such as spouts, bolts, topology, and stream grouping, and set up Apache Storm in single-node and multi-node configurations. Also you'll explore how fault-tolerant Apache Storm is. Taking this course will kick-start your experience with Apache Storm; you'll create a scalable, fault-tolerant, real-time processing application while setting a strong base for the fundamentals of the real-time processing paradigm and Apache Storm. All the code and supporting files for the course can be found here- https://github.com/PacktPublishing/Learning-Apache-Storm-for-Big-Data-Processing

Introducing Real-time Processing

The Course Overview

Understanding Lambda Architecture

Big Data Processing Types

What Is Apache Storm?

When to Use Apache Storm?

Apache Storm Concepts

Topology

Tuples

Spouts and Bolts

Streams and StreamGrouping

Setting Up Your Apache Storm Development Environment

Introduction – Prerequisites and System Requirements

Installing Java and Setting Environment Variables

Installing and Configuring Eclipse

Building Apache Storm Project Using Maven

Building Apache Storm Project Using External JAR Configuration

Creating Our First Storm Topology

Understanding the Problem Statement

Developing Spout Class to Emit the Data

Develop a Bolt Class to Perform Calculation

Develop a Bolt Class to Print Result in Console

Developing Topology Class

Executing Our Application in Eclipse

Setting Up Apache Storm as a Single-Node Cluster

Understanding Storm Daemons

Prerequisites

Setting Up Zookeeper in Standalone Mode

Install and Configure Apache Storm in Single-Node

Deploy NumSquareTopology in Cluster

Explore Storm UI and Understand Essential Features

Setting Up Apache Storm in Multi-Node Cluster

Setting Up Zookeeper in Multi-Node Mode

Setting Up Apache Storm in Multi-Node Cluster

Implementing StreamGroupings in Apache Storm

Introduction

Implementing ShuffleGrouping

Implementing FieldGrouping

Implementing AllGrouping

Implementing CustomGrouping

Implementing DirectGrouping

Integrating Hadoop with Apache Storm

Introduction

Writing a HDFS Bolt

Understanding and Implementing Tridents in Apache Storm

Introduction

Building Topology Using Trident

Understand and Implement Map, Filter, and Aggregate Function

Windowing Operations

Joining Stream Tuples in Storm

Learning Apache Storm for Big Data Processing

By : Prashant Nair

Learning Apache Storm for Big Data Processing

By: Prashant Nair

Overview of this book

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access