Book Image

Mastering Social Media Mining with R

Book Image

Mastering Social Media Mining with R

Overview of this book

With an increase in the number of users on the web, the content generated has increased substantially, bringing in the need to gain insights into the untapped gold mine that is social media data. For computational statistics, R has an advantage over other languages in providing readily-available data extraction and transformation packages, making it easier to carry out your ETL tasks. Along with this, its data visualization packages help users get a better understanding of the underlying data distributions while its range of "standard" statistical packages simplify analysis of the data. This book will teach you how powerful business cases are solved by applying machine learning techniques on social media data. You will learn about important and recent developments in the field of social media, along with a few advanced topics such as Open Authorization (OAuth). Through practical examples, you will access data from R using APIs of various social media sites such as Twitter, Facebook, Instagram, GitHub, Foursquare, LinkedIn, Blogger, and other networks. We will provide you with detailed explanations on the implementation of various use cases using R programming. With this handy guide, you will be ready to embark on your journey as an independent social media analyst.
Table of Contents (13 chapters)
Mastering Social Media Mining with R
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

GitHub package installation and authentication


We will use GitHub API Version 3 for accessing the data. All the API access that happens is over HTTPS and it is accessed from the domain api.github.com or yourdomain.com/api/v3 for the enterprise accounts. All the data received will be in the JSON format.

Now that you know how to create a GitHub app, we will see how to proceed further towards accessing the data from R. In order to connect to the GitHub app, we need to install the rgithub package and other dependent packages like devtools. The devtools package is required so that we can install the latest rgithub package directly from the source.

require(devtools)
install_github("cscheid/rgithub")
library(github)

After the installation and loading the required packages in R, we can use the client ID and the client secret that were generated to authenticate the access. In the following example, we are first passing the client ID and the client secret to a variable and then using the function interactive...