Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Business Intelligence with Databricks SQL
  • Table Of Contents Toc
Business Intelligence with Databricks SQL

Business Intelligence with Databricks SQL

By : Vihag Gupta
4.6 (13)
close
close
Business Intelligence with Databricks SQL

Business Intelligence with Databricks SQL

4.6 (13)
By: Vihag Gupta

Overview of this book

In this new era of data platform system design, data lakes and data warehouses are giving way to the lakehouse – a new type of data platform system that aims to unify all data analytics into a single platform. Databricks, with its Databricks SQL product suite, is the hottest lakehouse platform out there, harnessing the power of Apache Spark™, Delta Lake, and other innovations to enable data warehousing capabilities on the lakehouse with data lake economics. This book is a comprehensive hands-on guide that helps you explore all the advanced features, use cases, and technology components of Databricks SQL. You’ll start with the lakehouse architecture fundamentals and understand how Databricks SQL fits into it. The book then shows you how to use the platform, from exploring data, executing queries, building reports, and using dashboards through to learning the administrative aspects of the lakehouse – data security, governance, and management of the computational power of the lakehouse. You’ll also delve into the core technology enablers of Databricks SQL – Delta Lake and Photon. Finally, you’ll get hands-on with advanced SQL commands for ingesting data and maintaining the lakehouse. By the end of this book, you’ll have mastered Databricks SQL and be able to deploy and deliver fast, scalable business intelligence on the lakehouse.
Table of Contents (21 chapters)
close
close
1
Part 1: Databricks SQL on the Lakehouse
9
Part 2: Internals of Databricks SQL
13
Part 3: Databricks SQL Commands
16
Part 4: TPC-DS, Experiments, and Frequently Asked Questions

The Photon Engine

In this chapter, we will turn our attention back to SQL Warehouses. This time, however, we will focus on the query engine running on SQL Warehouses. The query engine is known as Photon Engine. We will begin by learning about Photon Engine and its place in the Apache Spark framework. Going ahead, we will understand the core engineering philosophy of Photon Engine. Finally, we will go through its limitations and the roadmap to overcome them.

I do want to highlight that you don’t need to learn the details of Photon Engine to work with Databricks SQL. This chapter is intended for those who are interested in how Databricks SQL achieves record-beating query performances on the Lakehouse setup with open source storage formats.

In this chapter, we will cover the following topics:

  • Understanding Photon Engine
  • Understanding vectorization
  • Discussing the Photon product roadmap
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Business Intelligence with Databricks SQL
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon