Time Series Analysis with Python Cookbook

By : Tarek A. Atwan

Time Series Analysis with Python Cookbook

By: Tarek A. Atwan

Overview of this book

Time series data is everywhere, available at a high frequency and volume. It is complex and can contain noise, irregularities, and multiple patterns, making it crucial to be well-versed with the techniques covered in this book for data preparation, analysis, and forecasting. This book covers practical techniques for working with time series data, starting with ingesting time series data from various sources and formats, whether in private cloud storage, relational databases, non-relational databases, or specialized time series databases such as InfluxDB. Next, you’ll learn strategies for handling missing data, dealing with time zones and custom business days, and detecting anomalies using intuitive statistical methods, followed by more advanced unsupervised ML models. The book will also explore forecasting using classical statistical models such as Holt-Winters, SARIMA, and VAR. The recipes will present practical techniques for handling non-stationary data, using power transforms, ACF and PACF plots, and decomposing time series data with multiple seasonal patterns. Later, you’ll work with ML and DL models using TensorFlow and PyTorch. Finally, you’ll learn how to evaluate, compare, optimize models, and more using the recipes covered in the book.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Chapter 1: Getting Started with Time Series Analysis

Technical requirements

Development environment setup

Installing Python libraries

Installing JupyterLab and JupyterLab extensions

Free Chapter

Chapter 2: Reading Time Series Data from Files

Technical requirements

Reading data from CSVs and other delimited files

Reading data from an Excel file

Reading data from URLs

Reading data from a SAS dataset

Chapter 3: Reading Time Series Data from Databases

Technical requirements

Reading data from a relational database

Reading data from Snowflake

Reading data from a document database (MongoDB)

Reading third-party financial data using APIs

Reading data from a time series database (InfluxDB)

Chapter 4: Persisting Time Series Data to Files

Technical requirements

Serializing time series data with pickle

Writing to CSV and other delimited files

Writing data to an Excel file

Storing data to S3

Chapter 5: Persisting Time Series Data to Databases

Technical requirements

Writing time series data to a relational database (PostgreSQL and MySQL)

Writing time series data to MongoDB

Writing time series data to InfluxDB

Writing time series data to Snowflake

Chapter 6: Working with Date and Time in Python

Technical requirements

Working with DatetimeIndex

Providing a format argument to DateTime

Working with Unix epoch timestamps

Working with time deltas

Converting DateTime with time zone information

Working with date offsets

Working with custom business days

Chapter 7: Handling Missing Data

Technical requirements

Understanding missing data

Performing data quality checks

Handling missing data with univariate imputation using pandas

Handling missing data with univariate imputation using scikit-learn

Handling missing data with multivariate imputation

Handling missing data with interpolation

Chapter 8: Outlier Detection Using Statistical Methods

Technical requirements

Understanding outliers

Resampling time series data

Detecting outliers using visualizations

Detecting outliers using the Tukey method

Detecting outliers using a z-score

Detecting outliers using a modified z-score

Chapter 9: Exploratory Data Analysis and Diagnosis

Technical requirements

Plotting time series data using pandas

Plotting time series data with interactive visualizations using hvPlot

Decomposing time series data

Detecting time series stationarity

Applying power transformations

Testing for autocorrelation in time series data

Chapter 10: Building Univariate Time Series Models Using Statistical Methods

Technical requirements

Plotting ACF and PACF

Forecasting univariate time series data with exponential smoothing

Forecasting univariate time series data with non-seasonal ARIMA

Forecasting univariate time series data with seasonal ARIMA

Chapter 11: Additional Statistical Modeling Techniques for Time Series

Technical requirements

Forecasting time series data using auto_arima

Forecasting time series data using Facebook Prophet

Forecasting multivariate time series data using VAR

Evaluating vector autoregressive (VAR) models

Forecasting volatility in financial time series data with GARCH

Chapter 12: Forecasting Using Supervised Machine Learning

Technical requirements

Understanding supervised machine learning

Preparing time series data for supervised learning

One-step forecasting using linear regression models with scikit-learn

Multi-step forecasting using linear regression models with scikit-learn

Forecasting using non-linear models with sktime

Optimizing a forecasting model with hyperparameter tuning

Forecasting with exogenous variables and ensemble learning

Chapter 13: Deep Learning for Time Series Forecasting

Technical requirements

Understanding artificial neural networks

Forecasting with an RNN using Keras

Forecasting with LSTM using Keras

Forecasting with a GRU using Keras

Forecasting with an RNN using PyTorch

Forecasting with LSTM using PyTorch

Forecasting with a GRU using PyTorch

Chapter 14: Outlier Detection Using Unsupervised Machine Learning

Technical requirements

Detecting outliers using KNN

Detecting outliers using LOF

Detecting outliers using iForest

Detecting outliers using One-Class Support Vector Machine (OCSVM)

Detecting outliers using COPOD

Detecting outliers with PyCaret

Chapter 15: Advanced Techniques for Complex Time Series

Technical requirements

Understanding state-space models

Decomposing time series with multiple seasonal patterns using MSTL

Forecasting with multiple seasonal patterns using the Unobserved Components Model (UCM)

Forecasting time series with multiple seasonal patterns using Prophet

Forecasting time series with multiple seasonal patterns using NeuralProphet

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Customer Reviews

5 star

4 star

3 star

2 star

1 star

To get the most out of this book

You should be comfortable coding in Python, with some familiarity with Matplotlib, NumPy, and pandas. The book covers a wide variety of libraries, and the first chapter will show you how to create different virtual environments for Python development. Working knowledge of the Python programming language will assist with understanding the key concepts covered in this book. It is recommended, but not required, to install either Anaconda, Miniconda, or Miniforge. Throughout the chapters, you will see instructions using either pip or Conda.

Alternatively, you can use Colab, and all you need is a browser.

Software/hardware covered in the book	Operating system requirements
Python 3.8/3.9+	Windows, macOS, or Linux
JupyterLab or the Jupyter Notebook	Windows, macOS, or Linux

In Chapter 3, Reading Time Series Data from Databases, and Chapter 5, Persisting Time Series Data to Databases, you will be working with different databases, including PostgreSQL, MySQL, InfluxDB, and MongoDB. If you do not have access to such databases, you can install them locally on your machine or use Docker and download the appropriate image using docker pull to download images from Docker Hub https://hub.docker.com – for example, docker pull influxdb to download InfluxDB. You can download Docker from the official page here: https://docs.docker.com/get-docker/.

Alternatively, you can explore hosted services such as Aiven https://aiven.io, which offers a 30-day trial and supports PostgreSQL, MySQL, and InfluxDB. For the recipes using AWS Redshift and Snowflake, you will need to have a subscription. You can subscribe to the AWS free tier here: https://aws.amazon.com/free. You can subscribe for a 30-day Snowflake trial here: https://signup.snowflake.com.

Similarly, in Chapter 2, Reading Time Series Data from Files, and Chapter 4, Persisting Time Series Data to Files, you will learn how to read and write data to AWS S3 buckets. This will require an AWS service subscription and should be covered under the free tier. For a list of all services covered under the free tier, you can visit the official page here: https://aws.amazon.com/free.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book's GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

To get the most value out of this book, it is important that you continue to experiment with the recipes further using different time series data. Throughout the recipes, you will see a recurring theme in which multiple time series datasets are used. This is done deliberately so that you can observe how the results vary on different data. You are encouraged to continue with that theme on your own.

If you are looking for additional datasets, in addition to those provided in the GitHub repository, you can check out some of the following links:

https://ourworldindata.org
https://www.kaggle.com/datasets?search=time+series
https://github.com/numenta/NAB (specific to anomaly and outlier detection)
https://fred.stlouisfed.org
https://datasetsearch.research.google.com

Time Series Analysis with Python Cookbook

By : Tarek A. Atwan

Time Series Analysis with Python Cookbook

By: Tarek A. Atwan

Overview of this book

Related Content you might be interested in

Current Title:

Time Series Analysis with Python Cookbook

Practical Time Series Analysis

Forecasting Time Series Data with Facebook Prophet

Codeless Time Series Analysis with KNIME

To get the most out of this book