Book Image

Storm Blueprints: Patterns for Distributed Real-time Computation

Book Image

Storm Blueprints: Patterns for Distributed Real-time Computation

Overview of this book

Table of Contents (17 chapters)
Storm Blueprints: Patterns for Distributed Real-time Computation
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Examining our use case


To better understand both the Trident topologies, as well as using Storm with sensor data, we will implement a Trident topology that collects medical reports to identify the outbreak of a disease.

The topology will process diagnosis events that contain the following pieces of information:

Latitude

Longitude

Timestamp

Diagnosis Code (ICD9-CM)

39.9522

-75.1642

03/13/2013 at 3:30 PM

320.0 (Hemophilus meningitis)

40.3588

-75.6269

03/13/2013 at 3:50 PM

324.0 (Intracranial abscess)

Each event will include the Global Positioning System (GPS) coordinates of the occurrence. The latitude and longitude are specified in the decimal format. The event also contains the ICD9-CM code, which indicates the diagnosis and a timestamp for the event. A complete list of ICD-9-CM codes are available at:

http://www.icd9data.com/ .

To detect an outbreak, the system will count the occurrences of specific disease codes within a geographic location over a specified period of time. To...