Book Image

Practical Data Analysis - Second Edition

By : Hector Cuesta, Dr. Sampath Kumar
Book Image

Practical Data Analysis - Second Edition

By: Hector Cuesta, Dr. Sampath Kumar

Overview of this book

Beyond buzzwords like Big Data or Data Science, there are a great opportunities to innovate in many businesses using data analysis to get data-driven products. Data analysis involves asking many questions about data in order to discover insights and generate value for a product or a service. This book explains the basic data algorithms without the theoretical jargon, and you’ll get hands-on turning data into insights using machine learning techniques. We will perform data-driven innovation processing for several types of data such as text, Images, social network graphs, documents, and time series, showing you how to implement large data processing with MongoDB and Apache Spark.
Table of Contents (21 chapters)
Practical Data Analysis - Second Edition
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface

Introduction to epidemiology


We can define epidemiology as the study of the determinants and a distribution of health-related states. We will study how a pathogen, such as the common flu or the influenza AH1N1, is spread within a population. This is particularly important because an outbreak can cause severe human and economic losses, as with the Spanish flu of 1918, which killed 40 million people globally. Take a look at the following screenshot:

We can use the Center for Disease Control (CDC) data, which is freely available from their website. With these time series, we can perform statistical methods for descriptive epidemiology or causal inference. The CDC data is obtained using typical surveys and medical reports, providing real results.

We can use the dashboard for CDC Flu Trends and its data, which is freely available from the following link:

http://gis.cdc.gov/grasp/fluview/fluportaldashboard.html

Seasonal influenza (flu) data can be found at http://www.cdc.gov/flu/.

The epidemiology...