Book Image

Storm Real-time Processing Cookbook

By : Quinton Anderson
Book Image

Storm Real-time Processing Cookbook

By: Quinton Anderson

Overview of this book

<p>Storm is a free and open source distributed real-time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!<br />Storm Real Time Processing Cookbook will have basic to advanced recipes on Storm for real-time computation.<br /><br />The book begins with setting up the development environment and then teaches log stream processing. This will be followed by real-time payments workflow, distributed RPC, integrating it with other software such as Hadoop and Apache Camel, and more.</p>
Table of Contents (16 chapters)
Storm Real-time Processing Cookbook
Credits
About the Author
About the Reviewers
www.packtpub.com
Preface
Index

Setting up system environments


Automating all your environments is critical, but the first step in this process is to understand how to establish a stable environment from scratch for your particular solution. Once this process has been validated, it is then easy to automate and add into the pipeline. In this recipe, we will manually establish a complete environment for Jenkins.

Getting ready

We will use the Random Forest topology from Chapter 7, Real-time Machine Learning, as the project that will be managed by our Jenkins CI. In order to build and deploy this project, there are a number of things that must be installed into our CI:

How to do it…

  1. Connect to your Ubuntu instance using Vagrant SSH. Then, issue the following commands to clear the existing openjdk, and install the Oracle JDK:

    sudo apt-get purge openjdk-\*
    sudo mkdir...