Book Image

Learning Apache Storm for Big Data Processing [Video]

By : Prashant Nair
Book Image

Learning Apache Storm for Big Data Processing [Video]

By: Prashant Nair

Overview of this book

Apache Storm is a distributed real-time processing engine. Created by Nathanmarz for Backtype and later open sourced under Apache License 2, it's a scalable and a fault-tolerant engine used to process a massive number of unbounded streams. In this course you will see how simple yet efficient Apache Storm is when it comes to real-time processing. In the course, you will learn about data processing types followed by Apache Storm and its features. You'll learn the core concepts of Apache Storm such as spouts, bolts, topology, and stream grouping, and set up Apache Storm in single-node and multi-node configurations. Also you'll explore how fault-tolerant Apache Storm is. Taking this course will kick-start your experience with Apache Storm; you'll create a scalable, fault-tolerant, real-time processing application while setting a strong base for the fundamentals of the real-time processing paradigm and Apache Storm. All the code and supporting files for the course can be found here- https://github.com/PacktPublishing/Learning-Apache-Storm-for-Big-Data-Processing
Table of Contents (9 chapters)
8
Integrating Hadoop with Apache Storm
Chapter 7
Implementing StreamGroupings in Apache Storm
Content Locked
Section 6
Implementing DirectGrouping
In this video, we will Understand and implement DirectGrouping. - Implement DirectGrouping