Book Image

Modern R Programming Cookbook

By : Jaynal Abedin
Book Image

Modern R Programming Cookbook

By: Jaynal Abedin

Overview of this book

R is a powerful tool for statistics, graphics, and statistical programming. It is used by tens of thousands of people daily to perform serious statistical analyses. It is a free, open source system whose implementation is the collective accomplishment of many intelligent, hard-working people. There are more than 2,000 available add-ons, and R is a serious rival to all commercial statistical packages. The objective of this book is to show how to work with different programming aspects of R. The emerging R developers and data science could have very good programming knowledge but might have limited understanding about R syntax and semantics. Our book will be a platform develop practical solution out of real world problem in scalable fashion and with very good understanding. You will work with various versions of R libraries that are essential for scalable data science solutions. You will learn to work with Input / Output issues when working with relatively larger dataset. At the end of this book readers will also learn how to work with databases from within R and also what and how meta programming helps in developing applications.
Table of Contents (10 chapters)

Importing unstructured text data from a plain text file

In some cases, it could happen that your source text data has been stored in a plain text (.txt) file. In this type of situation, if you want to do any kind of text analytics, you have to import plain text data into the R environment. In this recipe, you will import plain text data from a .txt file and store it into an object of class text.

Getting ready

Suppose you have stored a text file containing a newspaper article or several abstracts related to a particular topic. In this example, you will use a text file that contains 10 abstracts retrieved from PubMed by doing a literature search with the keyword term "Deep Learning". The filename is deapLearning.txt...