The plyrmr
package provides common operations (as found in plyr
or reshape2
) for users to easily perform data manipulation through the MapReduce framework. In this recipe, we will look at how to install plyrmr
on the Hadoop system.
Ensure that you have completed the previous recipe by starting the Cloudera QuickStart VM and connecting the VM to the internet. Also, you need to have the rmr2
package installed beforehand.
Perform the following steps to install plyrmr
on the Hadoop system:
- First, you should install
libxml2-devel
andcurl-devel
in the Linux shell:
$ yum install libxml2-devel$ sudo yum install curl-devel
- You can then access R and install the dependent packages:
$ sudo R> Install.packages(c(" Rcurl", "httr"), dependencies = TRUE> Install.packages("devtools", dependencies = TRUE)> library(devtools)> install_github("pryr", "hadley")> install.packages(c(" R.methodsS3", "hydroPSO"), dependencies = TRUE)> q()
- Next, you...