Python Data Visualization Cookbook (Second Edition)

Python Data Visualization Cookbook (Second Edition)

Overview of this book

Python Data Visualization Cookbook will progress the reader from the point of installing and setting up a Python environment for data manipulation and visualization all the way to 3D animations using Python libraries. Readers will benefit from over 60 precise and reproducible recipes that will guide the reader towards a better understanding of data concepts and the building blocks for subsequent and sometimes more advanced concepts. Python Data Visualization Cookbook starts by showing how to set up matplotlib and the related libraries that are required for most parts of the book, before moving on to discuss some of the lesser-used diagrams and charts such as Gantt Charts or Sankey diagrams. Initially it uses simple plots and charts to more advanced ones, to make it easy to understand for readers. As the readers will go through the book, they will get to know about the 3D diagrams and animations. Maps are irreplaceable for displaying geo-spatial data, so this book will also show how to build them. In the last chapter, it includes explanation on how to incorporate matplotlib into different environments, such as a writing system, LaTeX, or how to create Gantt charts using Python.

Python Data Visualization Cookbook Second Edition

Credits

About the Authors

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Preparing Your Working Environment

Introduction

Installing matplotlib, NumPy, and SciPy

Installing virtualenv and virtualenvwrapper

Installing matplotlib on Mac OS X

Installing matplotlib on Windows

Installing Python Imaging Library (PIL) for image processing

Installing a requests module

Customizing matplotlib's parameters in code

Customizing matplotlib's parameters per project

Knowing Your Data

Introduction

Importing data from CSV

Importing data from Microsoft Excel files

Importing data from fixed-width data files

Importing data from tab-delimited files

Importing data from a JSON resource

Exporting data to JSON, CSV, and Excel

Importing and manipulating data with Pandas

Importing data from a database

Cleaning up data from outliers

Reading files in chunks

Reading streaming data sources

Importing image data into NumPy arrays

Generating controlled random datasets

Smoothing the noise in real-world data

Drawing Your First Plots and Customizing Them

Introduction

Defining plot types – bar, line, and stacked charts

Drawing simple sine and cosine plots

Defining axis lengths and limits

Defining plot line styles, properties, and format strings

Setting ticks, labels, and grids

Adding legends and annotations

Moving spines to the center

Making histograms

Making bar charts with error bars

Making pie charts count

Plotting with filled areas

Making stacked plots

Drawing scatter plots with colored markers

More Plots and Customizations

Introduction

Setting the transparency and size of axis labels

Adding a shadow to the chart line

Adding a data table to the figure

Using subplots

Customizing grids

Creating contour plots

Filling an under-plot area

Drawing polar plots

Visualizing the filesystem tree using a polar bar

Customizing matplotlib with style

Making 3D Visualizations

Introduction

Creating 3D bars

Creating 3D histograms

Animating in matplotlib

Animating with OpenGL

Plotting Charts with Images and Maps

Introduction

Processing images with PIL

Plotting with images

Displaying images with other plots in the figure

Plotting data on a map using Basemap

Plotting data on a map using the Google Map API

Generating CAPTCHA images

Using the Right Plots to Understand Data

Introduction

Understanding logarithmic plots

Understanding spectrograms

Creating stem plot

Drawing streamlines of vector flow

Using colormaps

Using scatter plots and histograms

Plotting the cross correlation between two variables

Importance of autocorrelation

Generating controlled random datasets

In this recipe, we will show different ways of generating random number sequences and word sequences. Some of the examples use standard Python modules, and others use NumPy/SciPy functions.

We will go through some statistics terminology but will explain every term, so you don't have to have a statistical reference book with you while reading this recipe.

We generate artificial datasets using common Python modules. By doing so, we are able to understand distributions, variance, sampling, and similar statistical terminology. More importantly, we can use this fake data as a way to understand if our statistical method is capable of discovering models we want to discover. We can do that because we know the model in advance and verify our statistical method by applying it over our known data. In real life, we don't have that ability and there is always a percentage of uncertainty that we must assume, giving way to errors.

Getting ready

We don't need anything new...

Python Data Visualization Cookbook (Second Edition)

Python Data Visualization Cookbook (Second Edition)

Overview of this book

Related Content you might be interested in

Current Title:

Python Data Visualization Cookbook (Second Edition)

Generating controlled random datasets

Getting ready