The dataset usually contains a large number of variables that are not relevant for every type of analysis. Working with the entire dataset consumes more memory, and it is recommended that you use only the smaller number of variables for the analysis that is required to achieve the task. Taking the smaller number of variables from the entire dataset is usually known as subsetting, but when the term subset has been used, the user could interpret this in two ways: subset of dataset with a smaller number of variables and also subset by taking fewer rows from the entire dataset. In the dplyr library, these two aspects are covered by the select() and filter() verbs. In this recipe, you will subset a dataset by taking only a handful of variables by using the select() verb.
-
Book Overview & Buying
-
Table Of Contents
-
Feedback & Rating
Modern R Programming Cookbook
By :
Modern R Programming Cookbook
By:
Overview of this book
R is a powerful tool for statistics, graphics, and statistical programming. It is used by tens of thousands of people daily to perform serious statistical analyses. It is a free, open source system whose implementation is the collective accomplishment of many intelligent, hard-working people. There are more than 2,000 available add-ons, and R is a serious rival to all commercial statistical packages. The objective of this book is to show how to work with different programming aspects of R. The emerging R developers and data science could have very good programming knowledge but might have limited understanding about R syntax and semantics. Our book will be a platform develop practical solution out of real world problem in scalable fashion and with very good understanding. You will work with various versions of R libraries that are essential for scalable data science solutions. You will learn to work with Input / Output issues when working with relatively larger dataset. At the end of this book readers will also learn how to work with databases from within R and also what and how meta programming helps in developing applications.
Table of Contents (10 chapters)
Preface
Installing and Configuring R and its Libraries
Data Structures in R
Writing Customized Functions
Conditional and Iterative Operations
R Objects and Classes
Querying, Filtering, and Summarizing
R for Text Processing
R and Databases
Customer Reviews