Book Image

Modern Time Series Forecasting with Python

By : Manu Joseph

5 (1)

Book Image

Modern Time Series Forecasting with Python

5 (1)

By: Manu Joseph

Overview of this book

We live in a serendipitous era where the explosion in the quantum of data collected and a renewed interest in data-driven techniques such as machine learning (ML), has changed the landscape of analytics, and with it, time series forecasting. This book, filled with industry-tested tips and tricks, takes you beyond commonly used classical statistical methods such as ARIMA and introduces to you the latest techniques from the world of ML. This is a comprehensive guide to analyzing, visualizing, and creating state-of-the-art forecasting systems, complete with common topics such as ML and deep learning (DL) as well as rarely touched-upon topics such as global forecasting models, cross-validation strategies, and forecast metrics. You’ll begin by exploring the basics of data handling, data visualization, and classical statistical methods before moving on to ML and DL models for time series forecasting. This book takes you on a hands-on journey in which you’ll develop state-of-the-art ML (linear regression to gradient-boosted trees) and DL (feed-forward neural networks, LSTMs, and transformers) models on a real-world dataset along with exploring practical topics such as interpretability. By the end of this book, you’ll be able to build world-class time series forecasting systems and tackle problems in the real world.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Share Your Thoughts

Download a free PDF copy of this book

Part 1 – Getting Familiar with Time Series

Part 1 – Getting Familiar with Time Series

Free Chapter

Chapter 1: Introducing Time Series

Chapter 1: Introducing Time Series

Technical requirements

What is a time series?

Data-generating process (DGP)

What can we forecast?

Forecasting terminology

Further reading

Chapter 2: Acquiring and Processing Time Series Data

Chapter 2: Acquiring and Processing Time Series Data

Technical requirements

Understanding the time series dataset

pandas datetime operations, indexing, and slicing  – a refresher

Handling missing data

Mapping additional information

Saving and loading files to disk

Handling longer periods of missing data

Chapter 3: Analyzing and Visualizing Time Series Data

Chapter 3: Analyzing and Visualizing Time Series Data

Technical requirements

Components of a time series

Visualizing time series data

Decomposing a time series

Detecting and treating outliers

Further reading

Chapter 4: Setting a Strong Baseline Forecast

Chapter 4: Setting a Strong Baseline Forecast

Technical requirements

Setting up a test harness

Generating strong baseline forecasts

Assessing the forecastability of a time series

Further reading

Part 2 – Machine Learning for Time Series

Part 2 – Machine Learning for Time Series

Chapter 5: Time Series Forecasting as Regression

Chapter 5: Time Series Forecasting as Regression

Understanding the basics of machine learning

Time series forecasting as regression

Global forecasting models – a paradigm shift

Further reading

Chapter 6: Feature Engineering for Time Series Forecasting

Chapter 6: Feature Engineering for Time Series Forecasting

Technical requirements

Feature engineering

Avoiding data leakage

Setting a forecast horizon

Time delay embedding

Temporal embedding

Chapter 7: Target Transformations for Time Series Forecasting

Chapter 7: Target Transformations for Time Series Forecasting

Technical requirements

Handling non-stationarity in time series

Detecting and correcting for unit roots

Detecting and correcting for trends

Detecting and correcting for seasonality

Detecting and correcting for heteroscedasticity

AutoML approach to target transformation

Further reading

Chapter 8: Forecasting Time Series with Machine Learning Models

Chapter 8: Forecasting Time Series with Machine Learning Models

Technical requirements

Training and predicting with machine learning models

Generating single-step forecast baselines

Standardized code to train and evaluate machine learning models

Training and predicting for multiple households

Further reading

Chapter 9: Ensembling and Stacking

Chapter 9: Ensembling and Stacking

Technical requirements

Combining forecasts

Stacking or blending

Further reading

Chapter 10: Global Forecasting Models

Chapter 10: Global Forecasting Models

Technical requirements

Why Global Forecasting Models (GFMs)?

Strategies to improve GFMs

Bonus – interpretability

Further reading

Part 3 – Deep Learning for Time Series

Part 3 – Deep Learning for Time Series

Chapter 11: Introduction to Deep Learning

Chapter 11: Introduction to Deep Learning

Technical requirements

What is deep learning and why now?

Components of a deep learning system

Further reading

Chapter 12: Building Blocks of Deep Learning for Time Series

Chapter 12: Building Blocks of Deep Learning for Time Series

Technical requirements

Understanding the encoder-decoder paradigm

Feed-forward networks

Recurrent neural networks

Long short-term memory (LSTM) networks

Gated recurrent unit (GRU)

Convolution networks

Further reading

Chapter 13: Common Modeling Patterns for Time Series

Chapter 13: Common Modeling Patterns for Time Series

Technical requirements

Tabular regression

Single-step-ahead recurrent neural networks

Sequence-to-sequence (Seq2Seq) models

Further reading

Chapter 14: Attention and Transformers for Time Series

Chapter 14: Attention and Transformers for Time Series

Technical requirements

What is attention?

The generalized attention model

Forecasting with sequence-to-sequence models and attention

Transformers – Attention is all you need

Forecasting with Transformers

Further reading

Chapter 15: Strategies for Global Deep Learning Forecasting Models

Chapter 15: Strategies for Global Deep Learning Forecasting Models

Technical requirements

Creating global deep learning forecasting models

Using time-varying information

Using static/meta information

Using the scale of the time series

Balancing the sampling procedure

Further reading

Chapter 16: Specialized Deep Learning Architectures for Forecasting

Chapter 16: Specialized Deep Learning Architectures for Forecasting

Technical requirements

The need for specialized architectures

Neural Basis Expansion Analysis for Interpretable Time Series Forecasting (N-BEATS)

Neural Basis Expansion Analysis for Interpretable Time Series Forecasting with Exogenous Variables (N-BEATSx)

Neural Hierarchical Interpolation for Time Series Forecasting (N-HiTS)

Temporal Fusion Transformer (TFT)

Interpretability

Probabilistic forecasting

Further reading

Part 4 – Mechanics of Forecasting

Part 4 – Mechanics of Forecasting

Chapter 17: Multi-Step Forecasting

Chapter 17: Multi-Step Forecasting

Why multi-step forecasting?

Recursive strategy

Direct strategy

Hybrid strategies

How to choose a multi-step forecasting strategy?

Chapter 18: Evaluating Forecasts – Forecast Metrics

Chapter 18: Evaluating Forecasts – Forecast Metrics

Technical requirements

Taxonomy of forecast error measures

Investigating the error measures

Experimental study of the error measures

Guidelines for choosing a metric

Further reading

Chapter 19: Evaluating Forecasts – Validation Strategies

Chapter 19: Evaluating Forecasts – Validation Strategies

Technical requirements

Model validation

Holdout strategies

Cross-validation strategies

Choosing a validation strategy

Validation strategies for datasets with multiple time series

Further reading

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 (1)

5 star

100%

4 star

0

3 star

0

2 star

0

1 star

0

Gated recurrent unit (GRU)

In 2014, Cho et al. proposed another variant of the RNN that has a much simpler structure than an LSTM, called a gated recurrent unit (GRU). The intuition behind this is similar to when we use a bunch of gates to regulate the information that flows through the cell, but a GRU eliminates the long-term memory component and uses just the hidden state to propagate information. So, instead of the memory cell becoming the gradient highway, the hidden state itself becomes the “gradient highway.” In keeping with the same notation convention we used in the previous section, let’s look at the updated equations for a GRU.

While we had three gates in an LSTM, we only have two in a GRU:

Reset gate: This gate decides how much of the previous hidden state will be considered as the candidate's hidden state of the current timestep. The equation for this is:

Update gate: The update gate decides how much of the previous hidden...