Book Image

Hands-On Data Visualization with Bokeh

By : Kevin Jolly
Book Image

Hands-On Data Visualization with Bokeh

By: Kevin Jolly

Overview of this book

Adding a layer of interactivity to your plots and converting these plots into applications hold immense value in the field of data science. The standard approach to adding interactivity would be to use paid software such as Tableau, but the Bokeh package in Python offers users a way to create both interactive and visually aesthetic plots for free. This book gets you up to speed with Bokeh - a popular Python library for interactive data visualization. The book starts out by helping you understand how Bokeh works internally and how you can set up and install the package in your local machine. You then use a real world data set which uses stock data from Kaggle to create interactive and visually stunning plots. You will also learn how to leverage Bokeh using some advanced concepts such as plotting with spatial and geo data. Finally you will use all the concepts that you have learned in the previous chapters to create your very own Bokeh application from scratch. By the end of the book you will be able to create your very own Bokeh application. You will have gone through a step by step process that starts with understanding what Bokeh actually is and ends with building your very own Bokeh application filled with interactive and visually aesthetic plots.
Table of Contents (10 chapters)

Creating plots using pandas DataFrames

Most of the data that you will work with will be available in the CSV or Excel formats and thus you will inevitably convert them into a pandas DataFrame in order to work with them effectively. Bokeh extends its functionality to help us build interactive yet meaningful plots using a pandas DataFrame in Python. In this section, we will construct scatter plots and time series plots using a pandas DataFrame.

For this section, we will be using a popular dataset about the stock market found on Kaggle that can be accessed via this link: Kaggle S&P 500 stock data (https://www.kaggle.com/camnugent/sandp500/data).

As a first step, let's load the dataset into Jupyter Notebook. We can do this using the code shown here:

#Importing the required packages

import pandas as pd

#Read in the data

df = pd.read_csv('all_stocks_5yr.csv')

#Filtering...