Book Image

Codeless Time Series Analysis with KNIME

By : KNIME AG, Corey Weisinger, Maarit Widmann, Daniele Tonini
Book Image

Codeless Time Series Analysis with KNIME

By: KNIME AG, Corey Weisinger, Maarit Widmann, Daniele Tonini

Overview of this book

This book will take you on a practical journey, teaching you how to implement solutions for many use cases involving time series analysis techniques. This learning journey is organized in a crescendo of difficulty, starting from the easiest yet effective techniques applied to weather forecasting, then introducing ARIMA and its variations, moving on to machine learning for audio signal classification, training deep learning architectures to predict glucose levels and electrical energy demand, and ending with an approach to anomaly detection in IoT. There’s no time series analysis book without a solution for stock price predictions and you’ll find this use case at the end of the book, together with a few more demand prediction use cases that rely on the integration of KNIME Analytics Platform and other external tools. By the end of this time series book, you’ll have learned about popular time series analysis techniques and algorithms, KNIME Analytics Platform, its time series extension, and how to apply both to common use cases.
Table of Contents (20 chapters)
1
Part 1: Time Series Basics and KNIME Analytics Platform
7
Part 2: Building and Deploying a Forecasting Model
14
Part 3: Forecasting on Mixed Platforms

Building the deployment application

In this section, we introduce a deployment application that predicts the hourly trip counts on one day. We selected to predict the trip counts on July 1, 2018. We perform both static and dynamic deployment as introduced in the following subsections:

  1. Predicting the trip count in the next hour (static deployment)
  2. Predicting the trip count in the next 24 hours (dynamic deployment)

Predicting the trip count in the next hour

The goal of this subsection is to predict the demand for taxis in the first hour on July 1, 2018, so at 00:00.

The required seed data for the prediction contains the 24 past values representing the hourly trip counts on June 30, 2018, and in addition, the hour of the day (0) and the day of the week (1 since July 1, 2008, was a Sunday). We access the seed data already pre-processed as a Parquet file. The workflow containing the static deployment (accessible via https://kni.me/w/wl2B4B5LNvj0Z4K9) is shown in...