Introducing Microsoft SQL Server 2019

By : Kellyn Gorman, Allan Hirt, Dave Noderer, Mitchell Pearson, James Rowland-Jones, Dustin Ryan, Arun Sirpal, Buck Woody

Introducing Microsoft SQL Server 2019

By: Kellyn Gorman, Allan Hirt, Dave Noderer, Mitchell Pearson, James Rowland-Jones, Dustin Ryan, Arun Sirpal, Buck Woody

Overview of this book

Microsoft SQL Server comes equipped with industry-leading features and the best online transaction processing capabilities. If you are looking to work with data processing and management, getting up to speed with Microsoft Server 2019 is key. Introducing SQL Server 2019 takes you through the latest features in SQL Server 2019 and their importance. You will learn to unlock faster querying speeds and understand how to leverage the new and improved security features to build robust data management solutions. Further chapters will assist you with integrating, managing, and analyzing all data, including relational, NoSQL, and unstructured big data using SQL Server 2019. Dedicated sections in the book will also demonstrate how you can use SQL Server 2019 to leverage data processing platforms, such as Apache Hadoop and Spark, and containerization technologies like Docker and Kubernetes to control your data and efficiently monitor it. By the end of this book, you'll be well versed with all the features of Microsoft SQL Server 2019 and understand how to use them confidently to build robust data management solutions.

Preface

About Microsoft SQL Server 2019

1. Optimizing for performance, scalability and real‑time insights

Hybrid transactional and analytical processing (HTAP)

Clustered Columnstore Indexes

Disk-based tables versus memory-optimized tables

In-memory OLTP

Planning data migration to memory-optimized tables

Natively compiled stored procedures

TempDB enhancements

Intelligent Query Processing

Hybrid Buffer Pool

Query Store

Automatic tuning

Lightweight query profiling

Columnstore stats in DBCC CLONEDATABASE

Estimate compression for Columnstore Indexes

Troubleshooting page resource waits

Free Chapter

2. Enterprise Security

SQL Data Discovery and Classification

SQL Vulnerability Assessment

Always Encrypted

Confidential computing with secure enclaves

3. High Availability and Disaster Recovery

SQL Server availability feature overview

What About Database Mirroring and Replication?

Availability improvements in SQL Server 2019

Windows Server 2019 availability enhancements

4. Hybrid Features – SQL Server and Microsoft Azure

Backup to URL

The storage account

SQL Server data files in Azure

File-snapshot backups

Extending on-premises Availability Groups to Azure

Replication to Azure SQL Database

Transactional replication

5. SQL Server 2019 on Linux

2019 platform support

Why move databases to SQL Server on Linux?

Machine Learning Services on Linux

Kubernetes

Working with Docker and Linux

Change data capture

Hybrid Buffer Pool and PMEM

Distributed Transaction Coordinator on Linux

Replication

SQL Server tools

Command-line query tools for SQL in Linux

Enhanced focus on scripting

The SQL DBA in the Linux world

Windows Subsystem for Linux

6. SQL Server 2019 in Containers and Kubernetes

Why containers matter

Container technical fundamentals

Deploying an SQL Server container using Docker

Customizing SQL Server containers

Availability for SQL Server containers

7. Data Virtualization

Data integration challenges

Introducing data virtualization

Data virtualization use cases

Contrasting data virtualization and data movement

Data virtualization in SQL Server 2019

Secure data access

External data sources

External file formats

PolyBase external tables

Installing PolyBase in SQL Server 2019

Installing PolyBase as a scale-out group

Bringing it all together: your first data virtualization query

8. Machine Learning Services Extensibility Framework

Machine learning overview

SQL Server 2019 Machine Learning Services architecture and components

Machine learning using the Machine Learning Services extensibility framework

Java and machine learning in SQL Server

Machine learning using the PREDICT T-SQL command

Machine learning using the sp_rxPredict stored procedure

Libraries and packages for machine learning

Management

Using the team data science process with Machine Learning Services

9. SQL Server 2019 Big Data Clusters

Big data overview

Applying scale-out architectures to SQL Server

SQL Server 2019 big data cluster components

Installation and configuration

Programming SQL Server 2019 big data clusters

Management and monitoring

Security

10. Enhancing the Developer Experience

SQL Graph Database

Java language extensions

JSON

UTF-8 support

Temporal tables

Spatial data types

11. Data Warehousing

Extract-transform-load solutions with SQL Server Integration Services

Clustered Columnstore Indexes

Partitioning

Online index management

Creating and maintaining statistics

Statistics for columnstore indexes

Modern data warehouse patterns in Azure

Introduction to Azure SQL Data Warehouse

Best practices for working with Azure SQL Data Warehouse

Using Azure Data Factory

Copying data to Azure SQL Data Warehouse

Hosting SSIS packages in ADF

Azure Data Lake Storage

Azure Databricks

Working with streaming data in Azure Stream Analytics

Analyzing data by using Power BI – and introduction to Power BI

12. Analysis Services

Introduction to tabular models

Introduction to multidimensional models

Enhancements in tabular mode

Introducing DAX

Writing DAX queries

Using variables in DAX

Introduction to Azure Analysis Services

13. Power BI Report Server

SSRS versus Power BI Report Server

Report content types

Migrating existing paginated reports to Power BI Report Server

Exploring new capabilities

Managing parameter layouts

Developing KPIs

Publishing reports

Managing report access and security

Publishing mobile reports

Viewing reports in modern browsers

Viewing reports on mobile devices

Exploring Power BI reports

Automating report delivery with subscriptions

Pinning report items to the Power BI service

14. Modernization to the Azure Cloud

The SQL data platform in Azure

Deployment of a managed instance in Azure

Migrating SQL Server to Managed Instance

Creating an Azure VM from the Azure portal

Storage options for VMs

Diagnostics and advanced options

Creating a SQL Server 2019 VM from the command line in Azure

Security for SQL Server on an Azure VM

SQL Server IaaS agent extension

Disaster Recovery environment in the cloud

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Clustered Columnstore Indexes

Clustered Columnstore indexes can make a dramatic difference and are the technology used to optimize real-time analytics. They can achieve an order of magnitude performance gain over a normal row table, a dramatic compression of the data, and minimize interference with real-time transaction processing.

A columnstore has rows and columns, but the data is stored in a column format.

A rowgroup is a set of rows that are compressed into a columnstore format — a maximum of a million rows (1,048,576).

There are an optimum number of rows in a rowgroup that are stored column-wise, and this represents a trade-off between large overhead, if there are too few rows, and an inability to perform in-memory operations if the rows are too big.

Each row consists of column segments, each of which represents a column from the compressed row.

Columnstore is illustrated in Figure 1.1, showing how to load data into a non-clustered columnstore index:

Figure 1.1: Loading data into a non-clustered columnstore index

A clustered columnstore index is how the columnstore table segments are stored in physical media. For performance reasons, and to avoid fragmenting the data, the columnstore index may store some data in a deltastore and a list of the IDs of deleted rows. All deltastore operations are handled by the system and not visible directly to the user. Deltastore and columnstore data is combined when queried.

A delta rowgroup is used to store columnstore indexes until there are enough to store in the columnstore. Once the maximum number of rows is reached, the delta rowgroup is closed, and a background process detects, compresses, and writes the delta rowgroup into the columnstore.

There may be more than one delta rowgroup. All delta rowgroups are described as the deltastore. While loading data, anything less than 102,400 rows will be kept in the deltastore until they group to the maximum size and are written to the columnstore.

Batch mode execution is used during a query to process multiple rows at once.

Loading a clustered columnstore index and the deltastore are shown in Figure 1.2.

Figure 1.2: Loading a clustered columnstore index

Further information can be found here: https://docs.microsoft.com/en-us/sql/relational-databases/indexes/get-started-with-columnstore-for-real-time-operational-analytics?view=sql-server-2017.

Adding Clustered Columnstore Indexes to memory-optimized tables

When using a memory-optimized table, add a non-clustered columnstore index. A clustered columnstore index is especially useful for running analytics on a transactional table.

A clustered columnstore index can be added to an existing memory-optimized table, as shown in the following code snippet:

-- Add a clustered columnstore index to a memory-optimized table
ALTER TABLE MyMemOpttable 
ADD INDEX MyMemOpt_ColIndex clustered columnstore

Introducing Microsoft SQL Server 2019

By : Kellyn Gorman, Allan Hirt, Dave Noderer, Mitchell Pearson, James Rowland-Jones, Dustin Ryan, Arun Sirpal, Buck Woody

Introducing Microsoft SQL Server 2019

By: Kellyn Gorman, Allan Hirt, Dave Noderer, Mitchell Pearson, James Rowland-Jones, Dustin Ryan, Arun Sirpal, Buck Woody

Overview of this book

Related Content you might be interested in

Current Title:

Introducing Microsoft SQL Server 2019

SQL Server on Azure Virtual Machines

SQL Server 2019 Administrator's Guide

SQL Server 2017 Administrator's Guide

Clustered Columnstore Indexes

Figure 1.1: Loading data into a non-clustered columnstore index

Figure 1.2: Loading a clustered columnstore index

Adding Clustered Columnstore Indexes to memory-optimized tables