Book Image

RStudio for R Statistical Computing Cookbook

By : Andrea Cirillo
Book Image

RStudio for R Statistical Computing Cookbook

By: Andrea Cirillo

Overview of this book

The requirement of handling complex datasets, performing unprecedented statistical analysis, and providing real-time visualizations to businesses has concerned statisticians and analysts across the globe. RStudio is a useful and powerful tool for statistical analysis that harnesses the power of R for computational statistics, visualization, and data science, in an integrated development environment. This book is a collection of recipes that will help you learn and understand RStudio features so that you can effectively perform statistical analysis and reporting, code editing, and R development. The first few chapters will teach you how to set up your own data analysis project in RStudio, acquire data from different data sources, and manipulate and clean data for analysis and visualization purposes. You'll get hands-on with various data visualization methods using ggplot2, and you will create interactive and multidimensional visualizations with D3.js. Additional recipes will help you optimize your code; implement various statistical models to manage large datasets; perform text analysis and predictive analysis; and master time series analysis, machine learning, forecasting; and so on. In the final few chapters, you'll learn how to create reports from your analytical application with the full range of static and dynamic reporting tools that are available in RStudio so that you can effectively communicate results and even transform them into interactive web applications.
Table of Contents (15 chapters)
RStudio for R Statistical Computing Cookbook
Credits
About the Author
About the Reviewer
www.PacktPub.com
Preface
Index

Showing communities in a network with the linkcomm package


The linkcomm package is an R package developed with the main aim of letting you discover and study communities that exist within your network. These communities are discovered by applying an algorithm derived from the paper Link communities reveal multiscale complexity in networks by Ahn Y.Y., Bagrow J.P., and Lehmann.

Getting ready

In order to use linkcomm functionalities, we first need to install and load the linkcomm package:

install.packages("linkcomm")
library(linkcomm)

As a sample dataset, we will use the lesmiserables hedge list, provided in the linkcomm package. This dataset basically shows relations hips between characters in Victor Hugo's novel Les Misérables.

You can get a sense of the dataset by running str() on it:

> str(lesmiserables)
'data.frame':  254 obs. of  2 variables:
 $ V1: Factor w/ 73 levels "Anzelma","Babet",..: 61 49 55 55 21 33 12 23 20 62 ...
$ V2: Factor w/ 49 levels "Babet","Bahorel",..: 42 42 42 36 42...