Book Image

R: Mining spatial, text, web, and social media data

By : Nathan H. Danneman, Richard Heimann, Pradeepta Mishra, Bater Makhabel
Book Image

R: Mining spatial, text, web, and social media data

By: Nathan H. Danneman, Richard Heimann, Pradeepta Mishra, Bater Makhabel

Overview of this book

Data mining is the first step to understanding data and making sense of heaps of data. Properly mined data forms the basis of all data analysis and computing performed on it. This learning path will take you from the very basics of data mining to advanced data mining techniques, and will end up with a specialized branch of data mining—social media mining. You will learn how to manipulate data with R using code snippets and how to mine frequent patterns, association, and correlation while working with R programs. You will discover how to write code for various predication models, stream data, and time-series data. You will also be introduced to solutions written in R based on R Hadoop projects. Now that you are comfortable with data mining with R, you will move on to implementing your knowledge with the help of end-to-end data mining projects. You will learn how to apply different mining concepts to various statistical and data applications in a wide range of fields. At this stage, you will be able to complete complex data mining cases and handle any issues you might encounter during projects. After this, you will gain hands-on experience of generating insights from social media data. You will get detailed instructions on how to obtain, process, and analyze a variety of socially-generated data while providing a theoretical background to accurately interpret your findings. You will be shown R code and examples of data that can be used as a springboard as you get the chance to undertake your own analyses of business, social, or political data. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: ? Learning Data Mining with R by Bater Makhabel ? R Data Mining Blueprints by Pradeepta Mishra ? Social Media Mining with R by Nathan Danneman and Richard Heimann
Table of Contents (6 chapters)

Appendix A. Algorithms and Data Structures

Here is a list of algorithms related to association rules mining; it is only a small portion of the available algorithms, but it has proved to be effective:

Approach

Dataset

Sequential pattern mining

Sequential rule mining

Frequent itemset mining

Association rule mining

Apriori

Transaction

  

Yes

 

AprioriTid

Transaction

  

Yes

 

DHP (Direct Hashing and Pruning)

Transaction

  

Yes

 

FDM (Fast Distributed Mining of association rules)

Transaction

  

Yes

 

GSP (Generalized Sequential Patterns)

Sequence

Yes

   

DIC

Transaction

  

Yes

 

Pincer Search (the Pincer-search algorithm)

Transaction

  

Yes

 

CARMA (Continuous Association Rule Mining Algorithm)

Transaction

  

Yes

 

CHARM (Closed Association Rule Mining)

Transaction

  

Yes (closed)

 

Depth-project

Transaction

  

Yes (maximal)

 

Eclat

Transaction

  

Yes

 

SPAD

Sequence

Yes

   

SPAM

Sequence

Yes

   

Diffset

Transaction

  

Yes

 

FP-growth

Transaction

  

Yes

FP-growth

DSM-FI (Data Stream Mining for Frequent Itemsets)

Transaction

  

Yes

 

PRICES

Transaction

  

Yes

 

PrefixSpan

Sequence

Yes

   

Sporadic Rules

Transaction

   

Yes

IGB

Transaction

   

Yes

GenMax

Transaction

  

Yes (maximal)

 

FPMax (Frequent Maximal Item Set)

Transaction

  

Yes

 

FHARM (Fuzzy Healthy Association Rule Mining)

Transaction

  

Yes

 

H-Mine

Transaction

  

Yes

 

FHSAR

Transaction

   

Yes

Reverse Apriori

Transaction

  

Yes (maximal)

 

DTFIM

Transaction

  

Yes

 

GIT tree

Transaction

  

Yes

 

Scaling Apriori

Transaction

  

Yes

 

CMRules

Sequence

 

Yes

  

Minimum effort

Transaction

  

Yes (maximal)

 

TopSeqRules

Sequence

 

Yes

  

FPG ARM

Transaction

  

Yes

 

TNR

Transaction

   

Yes

ClaSP

Sequence

Yes (closed)