Book Image

R Data Visualization Cookbook

Book Image

R Data Visualization Cookbook

Overview of this book

Table of Contents (17 chapters)
R Data Visualization Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Constructing a word cloud from a document


In the previous recipe, we studied a quick and easy way to generate a word cloud. In this recipe, we will learn how to create a word cloud using an entire document, such as a transcript of the complete inaugural speech by President Obama. We will also learn how to process the text and structure it using the text mining package.

Getting ready

To generate a word cloud and structure our data, we will use the following packages:

  • wordcloud

  • tm

How to do it…

We will start this recipe by installing and loading the required packages in R using the install.packages() and library() functions:

install.packages(c("wordcloud","tm"))
library(tm)
library(wordcloud)

The readLines() function allows us to read the file in R that contains our text. The obama.txt file should be saved in our current R directory:

file = readLines("obama.txt")

The text in our file is not well structured. The text file consists of punctuation, numbers, and stop words, which need to be cleaned...