Book Image

R for Data Science

By : Dan Toomey
Book Image

R for Data Science

By: Dan Toomey

Overview of this book

Table of Contents (19 chapters)

Chapter 2. Data Mining Sequences

Data mining is frequently used to detect sequences or patterns in data. In this chapter, we are looking for the data to follow a pattern where one event or series of events predicts another data point in a consistent manner.

This chapter describes the different ways to find patterns in your dataset:

  • Patterns to look for

  • Find patterns in data

  • Constraints

We can find patterns in many large datasets. This can range across a number of areas, such as population mix changes, frequency of cell phone use, deterioration of highways, accidents due to age, and so on. It really feels like there are many patterns and sequences just waiting to be discovered.

We can find these patterns using a number of tools in R programming. Most patterns are limited in their extent by constraints, such as time over which the sequence will be meaningful.