Chapter 2: Loading and Exploring Datasets

Book Overview & Buying
Table Of Contents

Data Wrangling with R

By : Gustavo Santos

4.9 (7)

Buy this Book

Data Wrangling with R

4.9 (7)

By: Gustavo Santos

Buy this Book

Overview of this book

In this information era, where large volumes of data are being generated every day, companies want to get a better grip on it to perform more efficiently than before. This is where skillful data analysts and data scientists come into play, wrangling and exploring data to generate valuable business insights. In order to do that, you’ll need plenty of tools that enable you to extract the most useful knowledge from data. Data Wrangling with R will help you to gain a deep understanding of ways to wrangle and prepare datasets for exploration, analysis, and modeling. This data book enables you to get your data ready for more optimized analyses, develop your first data model, and perform effective data visualization. The book begins by teaching you how to load and explore datasets. Then, you’ll get to grips with the modern concepts and tools of data wrangling. As data wrangling and visualization are intrinsically connected, you’ll go over best practices to plot data and extract insights from it. The chapters are designed in a way to help you learn all about modeling, as you will go through the construction of a data science project from end to end, and become familiar with the built-in RStudio, including an application built with Shiny dashboards. By the end of this book, you’ll have learned how to create your first data model and build an application with Shiny in R.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Load and Explore Data

Free Chapter

Chapter 1: Fundamentals of Data Wrangling

What is data wrangling?

Why data wrangling?

The key steps of data wrangling

Summary

Exercises

Further reading

Chapter 2: Loading and Exploring Datasets

Technical requirements

How to load files to RStudio

Tibbles versus Data Frames

Saving files

A workflow for data exploration

Basic Web Scraping

Summary

Exercises

Further reading

Chapter 3: Basic Data Visualization

Technical requirements

Data visualization

Creating single-variable plots

Creating two-variable plots

Working with multiple variables

Summary

Exercises

Further reading

Part 2: Data Wrangling

Chapter 4: Working with Strings

Introduction to stringr

Working with regular expressions

Creating frequency data summaries in R

Text mining

Factors

Summary

Exercises

Further reading

Chapter 5: Working with Numbers

Technical requirements

Numbers in vectors, matrices, and data frames

Math operations with variables

Descriptive statistics

Summary

Exercises

Further reading

Chapter 6: Working with Date and Time Objects

Technical requirements

Introduction to date and time

Date and time with lubridate

Date and time using regular expressions (regexps)

Practicing

Summary

Exercises

Further reading

Chapter 7: Transformations with Base R

Technical requirements

The dataset

Slicing and filtering

Grouping and summarizing

Replacing and filling

Arranging

Creating new variables

Binding

Using data.table

Summary

Exercises

Further reading

Chapter 8: Transformations with Tidyverse Libraries

Technical requirements

What is tidy data

Slicing and filtering

Grouping and summarizing data

Replacing and filling data

Arranging data

Creating new variables

Joining datasets

Reshaping a table

Do more with tidyverse

Summary

Exercises

Further reading

Chapter 9: Exploratory Data Analysis

Technical requirements

Loading the dataset to RStudio

Understanding the data

Treating missing data

Exploring and visualizing the data

Analysis report

Summary

Exercises

Further reading

Part 3: Data Visualization

Chapter 10: Introduction to ggplot2

Technical requirements

The grammar of graphics

The basic syntax of ggplot2

Plot types

Summary

Exercises

Further reading

Chapter 11: Enhanced Visualizations with ggplot2

Technical requirements

Facet grids

Map plots

Time series plots

3D plots

Adding interactivity to graphics

Summary

Exercises

Further reading

Chapter 12: Other Data Visualization Options

Technical requirements

Plotting graphics in Microsoft Power BI using R

Preparing data for plotting

Creating word clouds in RStudio

Summary

Exercises

Further reading

Part 4: Modeling

Chapter 13: Building a Model with R

Technical requirements

Machine learning concepts

Understanding the project

Preparing data for modeling in R

Exploring the data with a few visualizations

Selecting the best variables

Modeling

Summary

Exercises

Further reading

Chapter 14: Build an Application with Shiny in R

Technical requirements

Learning the basics of Shiny

Creating an application

Deploying the application on the web

Summary

Exercises

Further reading

Conclusion

References

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Data Wrangling with R

By : Gustavo Santos

Data Wrangling with R

By: Gustavo Santos

Overview of this book

Summary

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access