Book Image

Codeless Time Series Analysis with KNIME

By : KNIME AG, Corey Weisinger, Maarit Widmann, Daniele Tonini
Book Image

Codeless Time Series Analysis with KNIME

By: KNIME AG, Corey Weisinger, Maarit Widmann, Daniele Tonini

Overview of this book

This book will take you on a practical journey, teaching you how to implement solutions for many use cases involving time series analysis techniques. This learning journey is organized in a crescendo of difficulty, starting from the easiest yet effective techniques applied to weather forecasting, then introducing ARIMA and its variations, moving on to machine learning for audio signal classification, training deep learning architectures to predict glucose levels and electrical energy demand, and ending with an approach to anomaly detection in IoT. There’s no time series analysis book without a solution for stock price predictions and you’ll find this use case at the end of the book, together with a few more demand prediction use cases that rely on the integration of KNIME Analytics Platform and other external tools. By the end of this time series book, you’ll have learned about popular time series analysis techniques and algorithms, KNIME Analytics Platform, its time series extension, and how to apply both to common use cases.
Table of Contents (20 chapters)
1
Part 1: Time Series Basics and KNIME Analytics Platform
7
Part 2: Building and Deploying a Forecasting Model
14
Part 3: Forecasting on Mixed Platforms

Training an H2O model from within KNIME

In this section, we will introduce the forecasting part of the application. We'll show how to train, optimize, evaluate, and deploy a stock price prediction model on H2O.

We selected the linear regression model as the stock price prediction model, which predicts the adjusted closing price based on the prices on the previous days. We can train the model as an H2O generalized linear model with Gaussian as the distribution family and Identity as the link function. This means we assume that the prices are approximately normally distributed and linearly related to the combination of the previous day's prices, as indicated by the following regression equation:

Formula 14.1

Where is the target variable, is the vector of the previous day's prices, is the vector of regression coefficients, and is the normally distributed residual term.

If we assumed a non-linear relationship between the target and predictor...