Book Image

Learn PostgreSQL

By : Luca Ferrari, Enrico Pirozzi

Book Image

Learn PostgreSQL

By: Luca Ferrari, Enrico Pirozzi

Overview of this book

PostgreSQL is one of the fastest-growing open source object-relational database management systems (DBMS) in the world. As well as being easy to use, it’s scalable and highly efficient. In this book, you’ll explore PostgreSQL 12 and 13 and learn how to build database solutions using it. Complete with hands-on tutorials, this guide will teach you how to achieve the right database design required for a reliable environment. You'll learn how to install and configure a PostgreSQL server and even manage users and connections. The book then progresses to key concepts of relational databases, before taking you through the Data Definition Language (DDL) and commonly used DDL commands. To build on your skills, you’ll understand how to interact with the live cluster, create database objects, and use tools to connect to the live cluster. You’ll then get to grips with creating tables, building indexes, and designing your database schema. Later, you'll explore the Data Manipulation Language (DML) and server-side programming capabilities of PostgreSQL using PL/pgSQL, before learning how to monitor, test, and troubleshoot your database application to ensure high-performance and reliability. By the end of this book, you'll be well-versed with the Postgres database and be able to set up your own PostgreSQL instance and use it to build robust solutions.

Preface

Who this book is for

What this book covers

To get the most out of this book

Section 1: Getting Started

Section 1: Getting Started

Free Chapter

Introduction to PostgreSQL

Introduction to PostgreSQL

Technical requirements

PostgreSQL at a glance

Exploring PostgreSQL terminology

Installing PostgreSQL 12 or higher

Getting to Know Your Cluster

Getting to Know Your Cluster

Technical requirements

Managing your cluster

Connecting to the cluster

Exploring the disk layout of PGDATA

Exploring configuration files and parameters

Managing Users and Connections

Managing Users and Connections

Introduction to users and groups

Managing incoming connections at the role level

Section 2: Interacting with the Database

Section 2: Interacting with the Database

Basic Statements

Basic Statements

Technical requirements

Setting up our developing environment

Creating and managing databases

Managing tables

Understanding basic table manipulation statements

Advanced Statements

Advanced Statements

Exploring the SELECT statement

Window Functions

Window Functions

Using basic statement window functions

Using advanced statement window functions

Server-Side Programming

Server-Side Programming

Exploring data types

Exploring functions and languages

Triggers and Rules

Triggers and Rules

Exploring rules in PostgreSQL

Managing triggers in PostgreSQL

Partitioning

Exploring partitioning using inheritance

Exploring declarative partitioning

Section 3: Administering the Cluster

Section 3: Administering the Cluster

Users, Roles, and Database Security

Users, Roles, and Database Security

Understanding roles

Access control lists

Granting and revoking permissions

Row-level security

Role password encryption

SSL connections

Transactions, MVCC, WALs, and Checkpoints

Transactions, MVCC, WALs, and Checkpoints

Technical requirements

Introducing transactions

Transaction isolation levels

Explaining MVCC

How PostgreSQL handles persistency and consistency: WALs

Extending the Database - the Extension Ecosystem

Extending the Database - the Extension Ecosystem

Introducing extensions

Managing extensions

Exploring the PGXN client

Installing extensions

Creating your own extension

Indexes and Performance Optimization

Indexes and Performance Optimization

Technical requirements

Execution of a statement

The EXPLAIN statement

An example of query tuning

ANALYZE and how to update statistics

Logging and Auditing

Logging and Auditing

Technical requirements

Introduction to logging

Extracting information from logs – PgBadger

Implementing auditing

Backup and Restore

Backup and Restore

Technical requirements

Introducing various types of backups and restores

Exploring logical backups

Exploring physical backups

Further reading

Configuration and Monitoring

Configuration and Monitoring

Technical requirements

Cluster configuration

Monitoring the cluster

Advanced statistics with pg_stat_statements

Further Reading

Section 4: Replication

Section 4: Replication

Physical Replication

Physical Replication

Exploring basic concepts

Learning WAL archiving and PITR

Managing streaming replication

Logical Replication

Logical Replication

Understanding basic concepts

Exploring logical replication setup

Section 5: The PostegreSQL Ecosystem

Section 5: The PostegreSQL Ecosystem

Useful Tools and Extensions

Useful Tools and Extensions

Exploring the pg_trgm extension

Using foreign data wrappers and the postgres_fdw extension

Exploring the btree_gin extension

Managing the pgbackrest tool

Toward PostgreSQL 13

Toward PostgreSQL 13

Introducing PostgreSQL 13's new features

Upgrading to PostgreSQL 13

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

VACUUM

In the previous sections, you have learned how PostgreSQL exploits MVCC to store different versions of the same data (tuples) that different transactions can perceive depending on their active snapshot. However, keeping different versions of the same tuples requires extra space with regard to the last active version, and this space could fill your storage sooner or later. To prevent that, and reclaim storage space, PostgreSQL provides an internal tool named vacuum, the aim of which is to analyze stored tuple versions and remove the ones that are no longer perceivable.

Remember: a tuple is not perceivable when there are no more active transactions that can reference the version, which means having the tuple version within their snapshot.

Vacuum can be an I/O-intensive operation since it must reclaim no more used disk space, and therefore can be an invasive operation. For that reason, you are not supposed to run vacuum very frequently and PostgreSQL also provides a background job...