PostgreSQL High Performance Cookbook

PostgreSQL High Performance Cookbook

By : Chitij Chauhan, Dinesh Kumar

Buy this Book

PostgreSQL High Performance Cookbook

By: Chitij Chauhan, Dinesh Kumar

Buy this Book

Overview of this book

PostgreSQL is one of the most powerful and easy to use database management systems. It has strong support from the community and is being actively developed with a new release every year. PostgreSQL supports the most advanced features included in SQL standards. It also provides NoSQL capabilities and very rich data types and extensions. All of this makes PostgreSQL a very attractive solution in software systems. If you run a database, you want it to perform well and you want to be able to secure it. As the world’s most advanced open source database, PostgreSQL has unique built-in ways to achieve these goals. This book will show you a multitude of ways to enhance your database’s performance and give you insights into measuring and optimizing a PostgreSQL database to achieve better performance. This book is your one-stop guide to elevate your PostgreSQL knowledge to the next level. First, you’ll get familiarized with essential developer/administrator concepts such as load balancing, connection pooling, and distributing connections to multiple nodes. Next, you will explore memory optimization techniques before exploring the security controls offered by PostgreSQL. Then, you will move on to the essential database/server monitoring and replication strategies with PostgreSQL. Finally, you will learn about query processing algorithms.

PostgreSQL High Performance Cookbook

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Customer Feedback

Preface

Free Chapter

Database Benchmarking

Performing a seek rate test

Working with the fsync commit rate

Checking IOPS

Storage sizing

Discussing RAID levels

Configuring pgbench

Running read/write pgbench test cases

Server Configuration and Control

Introduction

Starting the server manually

Stopping the server quickly

Stopping the server in an emergency

Reloading server configuration

Restarting the database server quickly

Tuning connection-related parameters

Tuning query-related parameters

Tuning logging-related parameters

Device Optimization

Introduction

Understanding memory units in PostgreSQL

Handling Linux/Unix memory parameters

CPU scheduling parameters

Disk tuning parameters

Identifying checkpoint overhead

Analyzing buffer cache contents

Monitoring Server Performance

Introduction

Monitoring CPU usage

Monitoring paging and swapping

Tracking CPU consuming processes

Monitoring CPU load

Identifying CPU bottlenecks

Identifying disk I/O bottlenecks

Monitoring system load

Tracking historical CPU usage

Tracking historical memory usage

Monitoring disk space

Monitoring network status

Connection Pooling and Database Partitioning

Introduction

Installing pgpool-II

Configuring pgpool and testing the setup

Installing PgBouncer

Connection pooling using PgBouncer

Managing PgBouncer

Implementing partitioning

Managing partitions

Installing PL/Proxy

Partitioning with PL/Proxy

High Availability and Replication

Introduction

Setting up hot streaming replication

Replication using Slony

Replication using Londiste

Replication using Bucardo

Replication using DRBD

Setting up a Postgres-XL cluster

Working with Third-Party Replication Management Utilities

Introduction

Setting up Barman

Backup and recovery using Barman

Setting up OmniPITR

WAL management with OmniPITR

Setting up repmgr

Using repmgr to create replica

Setting up walctl

Using walctl to create replica

Database Monitoring and Performance

Introduction

Checking active sessions

Finding out what the users are currently running

Finding blocked sessions

Dealing with deadlocks

Table access statistics

Logging slow statements

Determining disk usage

Preventing page corruption

Routine reindexing

Generating planner statistics

Tuning with background writer statistics

Vacuum Internals

Introduction

Dealing with bloating tables and indexes

Vacuum and autovacuum

Freezing and transaction ID wraparound

Monitoring vacuum progress

Control bloat using transaction age

Data Migration from Other Databases to PostgreSQL and Upgrading the PostgreSQL Cluster

Introduction

Using pg_dump to upgrade data

Using the pg_upgrade utility for version upgrade

Replicating data from other databases to PostgreSQL using Goldengate

Query Optimization

Introduction

Using sample data sets

Timing overhead

Studying hot and cold cache behavior

Clearing the cache

Query plan node structure

Generating an explain plan

Computing basic cost

Running sequential scans

Running bitmap heap and index scan

Aggregate and hash aggregate

Running CTE scan

Nesting loops

Working with hash and merge join

Grouping

Working with set operations

Working on semi and anti joins

Database Indexing

Introduction

Measuring query and index block statistics

Index lookup

Comparing indexed scans and sequential scans

Clustering against an index

Concurrent indexes

Combined indexes

Partial indexes

Finding unused indexes

Forcing a query to use an index

Detecting a missing index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Grouping

In this recipe, we will be discussing the optimizer node type, which will be chosen during the group by operation.

Getting ready

As we discussed group or aggregate operations in the previous recipe, grouping operations will have performed based on the group key list. The PostgreSQL optimizer chooses hash aggregate, when it finds enough memory and if not, group aggregate will be the option. Unlike hash aggregate, the group aggregate operation needs data to be sorted. If the group columns have a sorted index already, then group aggregate will choose over the hash aggregate as to reduce the memory usage.

How to do it…

To demonstrate group aggregate, let's run the query in the benchmarksql database to get the count of customers, grouped by their city:

 benchmarksql=# EXPLAIN SELECT COUNT(*), c_city FROM
           bmsql_customer GROUP BY c_city;
                                            QUERY PLAN                                             
------------------------------------...

PostgreSQL High Performance Cookbook

By : Chitij Chauhan, Dinesh Kumar

PostgreSQL High Performance Cookbook

By: Chitij Chauhan, Dinesh Kumar

Overview of this book

Related Content you might be interested in

Current Title:

PostgreSQL High Performance Cookbook

Grouping

Getting ready

How to do it…