SQL Server 2017 Machine Learning Services with R

By : Julie Koesmarno, Toma≈æ Ka≈°trun

SQL Server 2017 Machine Learning Services with R

By: Julie Koesmarno, Toma≈æ Ka≈°trun

Overview of this book

R Services was one of the most anticipated features in SQL Server 2016, improved significantly and rebranded as SQL Server 2017 Machine Learning Services. Prior to SQL Server 2016, many developers and data scientists were already using R to connect to SQL Server in siloed environments that left a lot to be desired, in order to do additional data analysis, superseding SSAS Data Mining or additional CLR programming functions. With R integrated within SQL Server 2017, these developers and data scientists can now benefit from its integrated, effective, efficient, and more streamlined analytics environment. This book gives you foundational knowledge and insights to help you understand SQL Server 2017 Machine Learning Services with R. First and foremost, the book provides practical examples on how to implement, use, and understand SQL Server and R integration in corporate environments, and also provides explanations and underlying motivations. It covers installing Machine Learning Services;maintaining, deploying, and managing code;and monitoring your services. Delving more deeply into predictive modeling and the RevoScaleR package, this book also provides insights into operationalizing code and exploring and visualizing data. To complete the journey, this book covers the new features in SQL Server 2017 and how they are compatible with R, amplifying their combined power.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Introduction to R and SQL Server

Using R prior to SQL Server 2016

Microsoft's commitment to the open source R language

Boosting analytics with SQL Server R integration

Summary

Overview of Microsoft Machine Learning Server and SQL Server

Analytical barriers

The Microsoft Machine learning R Server platform

The Microsoft Machine Learning R Services architecture

Summary

Managing Machine Learning Services for SQL Server 2017 and R

Minimum requirements

Choosing the edition

Configuring the environment and installing R Tools for Visual Studio (RTVS)

Security

Package information

Summary

Data Exploration and Data Visualization

Understanding SQL and R data types

Data exploration and data munging

Data visualization in R

Integrating R code in reports and visualizations

Summary

RevoScaleR Package

Overcomming R language limitations

Scalable and distributive computational environments

Functions for data preparation

Variable creation and data transformation

Variable creation and recoding

Dataset subsetting

Dataset merging

Functions for descriptive statistics

Functions for statistical tests and sampling

Summary

Predictive Modeling

Data modeling

Advanced predictive algorithms and analytics

Deploying and using predictive solutions

Performing predictions with R Services in the SQL Server database

Summary

Operationalizing R Code

Integrating an existing R model

Fast batch prediction

Integrating the R model for fast batch prediction

Managing roles and permissions for workloads

Tools

Integrating R workloads and prediction operations beyond SQL Server

Summary

Deploying, Managing, and Monitoring Database Solutions containing R Code

Integrating R into the SQL Server Database lifecycle workflow

Prerequisites for this chapter

Using version control

Setting up continuous integration

Setting up continuous delivery

Monitoring the accuracy of the productionized model

Summary

Machine Learning Services with R for DBAs

Gathering relevant data

Exploring and analyzing data

Creating a baseline and workloads, and replaying

Creating predictions with R - disk usage

Summary

R and SQL Server 2016/2017 Features Extended

Built-in JSON capabilities

Accessing external data sources using PolyBase

High performance using ColumnStore and in memory OLTP

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Creating predictions with R - disk usage

Predictions involve spotting any unplanned and unwanted activities or unusual system behavior, especially when compared it to the baseline. In this manner, raising a red flag would result in fewer false positive states.

In addition, we always come across disk-size problems. Based on this problem, we will demo database growth, store the data, and then run predictions against the collected data to be able at the end to predict when a DBA can expect disk space problems.

To illustrate this scenario, I will create a small database of 8 MB and no possibility of growth. I will create two tables. One will serve as a baseline, DataPack_Info_SMALL, and the other will serve as a so-called everyday log, where everything will be stored for unexpected cases or undesired behavior. This will persist in the DataPack_Info_LARGE table.

First, create a database...

SQL Server 2017 Machine Learning Services with R

By : Julie Koesmarno, Toma≈æ Ka≈°trun

SQL Server 2017 Machine Learning Services with R

By: Julie Koesmarno, Toma≈æ Ka≈°trun

Overview of this book

Related Content you might be interested in

Current Title:

SQL Server 2017 Machine Learning Services with R

Data Science with SQL Server Quick Start Guide

Hands-On Data Science with SQL Server 2017

Introducing Microsoft SQL Server 2019