Learning R Programming

Book Image

Learning R Programming

By : Kun Ren

Book Image

Learning R Programming

By: Kun Ren

Overview of this book

R is a high-level functional language and one of the must-know tools for data science and statistics. Powerful but complex, R can be challenging for beginners and those unfamiliar with its unique behaviors. Learning R Programming is the solution - an easy and practical way to learn R and develop a broad and consistent understanding of the language. Through hands-on examples you'll discover powerful R tools, and R best practices that will give you a deeper understanding of working with data. You'll get to grips with R's data structures and data processing techniques, as well as the most popular R packages to boost your productivity from the offset. Start with the basics of R, then dive deep into the programming techniques and paradigms to make your R code excel. Advance quickly to a deeper understanding of R's behavior as you learn common tasks including data analysis, databases, web scraping, high performance computing, and writing documents. By the end of the book, you'll be a confident R programmer adept at solving problems with the right techniques.

Learning R Programming

Learning R Programming

Credits

About the Author

About the Author

About the Reviewer

About the Reviewer

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Quick Start

A quick example

Basic Objects

Managing Your Workspace

Managing Your Workspace

R's working directory

Inspecting the environment

Modifying global options

Managing the library of packages

Basic Expressions

Basic Expressions

Assignment expressions

Conditional expressions

Loop expressions

Working with Basic Objects

Working with Basic Objects

Using object functions

Using logical functions

Using math functions

Applying numeric methods

Using statistical functions

Using apply-family functions

Working with Strings

Working with Strings

Getting started with strings

Formatting date/time

Using regular expressions

Working with Data

Working with Data

Reading and writing data

Visualizing data

Inside R

Understanding lazy evaluation

Understanding the copy-on-modify mechanism

Understanding lexical scoping

Understanding how an environment works

Metaprogramming

Metaprogramming

Understanding functional programming

Computing on language

Object-Oriented Programming

Object-Oriented Programming

Introducing object-oriented programming

Working with the S3 object system

Working with S4

Working with the reference class

Working with R6

Working with Databases

Working with Databases

Working with relational databases

Working with NoSQL databases

Data Manipulation

Data Manipulation

Using built-in functions to manipulate data frames

Using SQL to query data frames via the sqldf package

Using data.table to manipulate data

Using dplyr pipelines to manipulate data frames

Using rlist to work with nested data structures

High-Performance Computing

High-Performance Computing

Understanding code performance issues

Boosting code performance

Web Scraping

Looking inside web pages

Extracting data from web pages using CSS selectors

Learning XPath selectors

Analysing HTML code and extracting data

Boosting Productivity

Boosting Productivity

Writing R Markdown documents

Creating interactive apps

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Analysing HTML code and extracting data

In the previous sections, we learned the basics of HTML, CSS, and XPath. To scrape real-world web pages, the problem now becomesa question of writing the proper CSS or XPath selectors. In this section, we introduce some simple ways to figure out working selectors.

Suppose we want to scrape all available R packages at https://cran.rstudio.com/web/packages/available_packages_by_name.html. The web page looks simple. To figure out the selector expression, right-click on the table and select Inspect Element in the context menu, which should be available in most modern web browsers:

Then the inspector panel shows up and we can see the underlying HTML of the web page. In Firefox and Chrome, the selected node is highlighted so it can be located more easily:

The HTML contains a unique <table> so we can directly use table to select it and use html_table() to extract it out as a data frame:

page <- read_html("https://cran.rstudio.com/web/packages/available_packages_by_name...