Book Image

R Data Visualization Recipes

By : Vitor Bianchi Lanzetta
Book Image

R Data Visualization Recipes

By: Vitor Bianchi Lanzetta

Overview of this book

R is an open source language for data analysis and graphics that allows users to load various packages for effective and better data interpretation. Its popularity has soared in recent years because of its powerful capabilities when it comes to turning different kinds of data into intuitive visualization solutions. This book is an update to our earlier R data visualization cookbook with 100 percent fresh content and covering all the cutting edge R data visualization tools. This book is packed with practical recipes, designed to provide you with all the guidance needed to get to grips with data visualization using R. It starts off with the basics of ggplot2, ggvis, and plotly visualization packages, along with an introduction to creating maps and customizing them, before progressively taking you through various ggplot2 extensions, such as ggforce, ggrepel, and gganimate. Using real-world datasets, you will analyze and visualize your data as histograms, bar graphs, and scatterplots, and customize your plots with various themes and coloring options. The book also covers advanced visualization aspects such as creating interactive dashboards using Shiny By the end of the book, you will be equipped with key techniques to create impressive data visualizations with professional efficiency and precision.
Table of Contents (19 chapters)
Title Page
Credits
About the Author
About the Reviewer
www.PacktPub.com
Customer Feedback
Preface

Crafting faceted histograms


The previous recipe had a discrete variable displayed by the x axis. In order to turn ourselves toward a single continuous variable, one option is to draw histograms. Facets are also available for histograms and are a good way to investigate how a variable is distributed across some categories.

For this recipe, we will dig into how the hourly wages are distributed among men and women, married and single. Data comes from the 1976 Current Population Survey. Data was collected by Henry Farber and Justin M. Shea in 1988. Both categories are represented by binaries so this recipe brings a new way to label the facets. The requirements department lies ahead.

Getting ready

In order to make it happen, we need the wooldridge::wage1 data frame, which is another way to say, "wooldridge package has to be installed". Following code takes care of that:

> if( !require(wooldridge)){ install.packages('wooldridge')}

As is usual for this chapter, the packages ggplot2 and plotly are...