Book Image

Mastering Hadoop

By : Sandeep Karanth
Book Image

Mastering Hadoop

By: Sandeep Karanth

Overview of this book

Table of Contents (21 chapters)
Mastering Hadoop
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

Apache Storm


Apache Storm is one of the most popular open source streaming engines and provides real-time analytics on unbounded streams of data. It is a distributed framework that can work on multiple nodes, providing both fault tolerance and horizontal scalability. Another primary feature of Apache Storm is that it provides guaranteed event processing, that is, every event that enters the system is processed without a loss of events. Apache Storm applications can be deployed with any programming language of the developer's choice, making it extremely attractive for usage in low-latency analytics.

MapReduce provides the Map and Reduce function primitives to build batch applications. Similarly, Storm provides its own set of primitives to support real-time analytics. If a framework such as Apache Storm is not available, writing real-time applications will mean a lot of complexity. It will involve adding and maintaining processing queues to guarantee processing of all events in the system....