Book Image

Building Python Real-Time Applications with Storm

By : Kartik Bhatnagar, Barry Hart
Book Image

Building Python Real-Time Applications with Storm

By: Kartik Bhatnagar, Barry Hart

Overview of this book

Big data is a trending concept that everyone wants to learn about. With its ability to process all kinds of data in real time, Storm is an important addition to your big data “bag of tricks.” At the same time, Python is one of the fastest-growing programming languages today. It has become a top choice for both data science and everyday application development. Together, Storm and Python enable you to build and deploy real-time big data applications quickly and easily. You will begin with some basic command tutorials to set up storm and learn about its configurations in detail. You will then go through the requirement scenarios to create a Storm cluster. Next, you’ll be provided with an overview of Petrel, followed by an example of Twitter topology and persistence using Redis and MongoDB. Finally, you will build a production-quality Storm topology using development best practices.
Table of Contents (14 chapters)

Running the topology – the Redis case

We have a few more small things to address before we run the topology:

  1. Copy the logconfig.ini file from the second example in Chapter 3, Introducing Petrel, to this topology's directory.

  2. Create a file called Petrel will package this script with the topology and run it at startup. This script installs the third-party Python libraries used by the topology. The file looks like this:

    pip install -U pip
    pip install nltk==3.0.1 oauthlib==0.7.2
  3. Create a file called manifest.txt with these two lines:

  4. Install the Redis server on a well-known node. All workers will store state here:

             sudo apt-get install redis-server
  5. Install the Python Redis client on all Storm worker machines:

             sudo apt-get install python-redis
  6. Before running the topology, let's review the list of files that we've created. Make sure you have created these files correctly:

    • topology.yaml




    • totalrankings...