Book Image

Data Science with SQL Server Quick Start Guide

By : Dejan Sarka
Book Image

Data Science with SQL Server Quick Start Guide

By: Dejan Sarka

Overview of this book

SQL Server only started to fully support data science with its two most recent editions. If you are a professional from both worlds, SQL Server and data science, and interested in using SQL Server and Machine Learning (ML) Services for your projects, then this is the ideal book for you. This book is the ideal introduction to data science with Microsoft SQL Server and In-Database ML Services. It covers all stages of a data science project, from businessand data understanding,through data overview, data preparation, modeling and using algorithms, model evaluation, and deployment. You will learn to use the engines and languages that come with SQL Server, including ML Services with R and Python languages and Transact-SQL. You will also learn how to choose which algorithm to use for which task, and learn the working of each algorithm.
Table of Contents (15 chapters)
Title Page
Copyright and Credits
Packt Upsell
Contributors
Preface
Index

Chapter 6. Intermediate Statistics and Graphs

In Chapter 4, Data Overview, we analyzed a single variable. We finished the chapter by showing possible associations between pairs of variables graphically. In this chapter, I will briefly explain the statistics behind, and then develop the code to measure possible associations between, two variables. I will also include more graphical examples.

A very important concept in statistics is the null hypothesis. This is where you start your analysis from; you suppose that there is no association between two variables. An example of a question could be Is the commute distance to work associated with occupation? The null hypothesis here is there is no association between commute distance and occupation. With statistical analysis, you try either to prove or to reject the null hypothesis. However, you can never be 100% sure of the outcome; therefore, you prove or reject the null hypothesis with some probability.

In this chapter, we will deal with the following...