Book Image

Data Science with SQL Server Quick Start Guide

By : Dejan Sarka
Book Image

Data Science with SQL Server Quick Start Guide

By: Dejan Sarka

Overview of this book

SQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you. This book is the ideal introduction to data science with Microsoft SQL Server and In-Database ML Services. It covers all stages of a data science project, from businessand data understanding,through data overview, data preparation, modeling and using algorithms, model evaluation, and deployment. You will learn to use the engines and languages that come with SQL Server, including ML Services with R and Python languages and Transact-SQL. You will also learn how to choose which algorithm to use for which task, and learn the working of each algorithm.
Table of Contents (15 chapters)
Title Page
Copyright and Credits
Packt Upsell
Contributors
Preface
Index

Discovering associations between continuous and discrete variables


The last possibility left for discovering and measuring the strength of associations is dependencies between continuous and discrete variables. Let me start by an example. In the dataset I use, the dbo.vTargetMail view from the AdventureWorksDW2017 demo database, I have the variables that show the occupation and the income of each person. You would expect that there is some association between these two variables—some occupations have higher mean and median income, some lower. However, there could be a surprise hidden in the data. Imagine that somebody would mark their occupation as skilled manual for an excellent basketball player, for an NBA star. By comparing the mean income over occupation, you could wrongly conclude that you need to go for a skilled manual job, to have the highest possible income. But the difference in mean income between skilled manual and other occupation comes in this case from the variability within...