In this section, we will try to provide a concise overview of different packages in R for handling massive data and illustrate some of them.
A popular approach to dealing with bigger datasets is the use of SQL, a different programming language. It might not be difficult for someone to learn another programming language, but as we are dealing with and talking about using R, the community of R users try to develop specialized packages to deal with large datasets. Those contributed packages successfully create interfaces between R and different database software packages that use relational database management systems, such as MySQL (RMySQL
), PostgreSQL (RPgSQL
), and Oracle (ROracle
). To get the full benefit of these specialized packages, we have to install third-party software, and one of the most popular packages is RMySQL
. This package allows us to make connections between R and the MySQL server.
MySQL, which can deal with a mid-size, multi-platform RDBMS is a popular...