Book Image

Storm Real-time Processing Cookbook

By : Quinton Anderson
Book Image

Storm Real-time Processing Cookbook

By: Quinton Anderson

Overview of this book

<p>Storm is a free and open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!<br />Storm Real Time Processing Cookbook will have basic to advanced recipes on Storm for real-time computation.<br /><br />The book begins with setting up the development environment and then teaches log stream processing. This will be followed by real-time payments workflow, distributed RPC, integrating it with other software such as Hadoop and Apache Camel, and more.</p>
Table of Contents (16 chapters)
Storm Real-time Processing Cookbook
Credits
About the Author
About the Reviewers
www.packtpub.com
Preface
Index

Introduction


We break away briefly from the TF-IDF thread to explore the polyglot capabilities of Storm. Polyglot development is becoming increasingly important, particularly in the open source world where mashups present a rapid path to delivery, regardless of the underlying technology. There is an increasing number of JVM-based languages that maintain binary-level compatibility with Java, such as Scala. In these cases, the Polyglot project is simply a composition of JAR files with appropriate levels of modularity. In the cases where the underlying execution environment isn't common, other approaches are required. There are many use cases, over and above convenient re-use, such as high-performance computing, where native implementations provide for greater levels of optimization or leverage of low-level hardware capabilities, such as the rich functionality of GPUs, in most modern PCs.

There are many approaches to integrating systems developed in incompatible languages, including messaging...