Book Image

Practical Data Analysis - Second Edition

By : Hector Cuesta, Dr. Sampath Kumar
Book Image

Practical Data Analysis - Second Edition

By: Hector Cuesta, Dr. Sampath Kumar

Overview of this book

Beyond buzzwords like Big Data or Data Science, there are a great opportunities to innovate in many businesses using data analysis to get data-driven products. Data analysis involves asking many questions about data in order to discover insights and generate value for a product or a service. This book explains the basic data algorithms without the theoretical jargon, and you’ll get hands-on turning data into insights using machine learning techniques. We will perform data-driven innovation processing for several types of data such as text, Images, social network graphs, documents, and time series, showing you how to implement large data processing with MongoDB and Apache Spark.
Table of Contents (21 chapters)
Practical Data Analysis - Second Edition
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface

Acquiring the Facebook graph


In Facebook, friends are represented by nodes and the relationship between friends is represented by links, but we can get a lot more information from it, such as gender, age, post list, likes, political affiliation, religion, and so on, and Facebook provides us with a complete HTTP-based API (Application Programming Interface) to work with its data. Follow this link for more information:

https://developers.facebook.com/

Another interesting option is the Stanford Large Network Dataset Collection, where we can find social network datasets well-formatted and anonymized for educational proposes. Follow this link for more information:

http://snap.stanford.edu/data/

Tip

Using anonymized data, it is possible to determine whether two users have the same affiliations, but not what their individual affiliations represent.

In this chapter, we will use a Facebook graph with 1,274 friends and 43,000 relationships between them. This will help us understand how friends from...