CHAPTER 11
Data Mining Models in SQL
Data mining is the process of finding meaningful patterns in large quantities of data. Traditionally, the subject is introduced through statistics and statistical modeling. This chapter takes an alternative approach that introduces data mining concepts using databases. This perspective presents the important concepts, sidestepping the rigor of theoretical statistics to focus instead on the most important practical aspect: data.
The next two chapters extend the discussion that this chapter begins. Chapter 12 covers linear regression, a more traditional starting point for modeling and data mining. Chapter 13 focuses on data preparation, often the most challenging part of a data mining endeavor.
Earlier chapters have already shown some powerful techniques implemented using SQL. Snobs may feel that data mining is more advanced than mere SQL queries. This sentiment downplays the importance of data manipulation, which lies at the heart of even the most...