Book Image

Time Series Indexing

By : Mihalis Tsoukalos
Book Image

Time Series Indexing

By: Mihalis Tsoukalos

Overview of this book

Time series are everywhere, ranging from financial data and system metrics to weather stations and medical records. Being able to access, search, and compare time series data quickly is essential, and this comprehensive guide enables you to do just that by helping you explore SAX representation and the most effective time series index, iSAX. The book begins by teaching you about the implementation of SAX representation in Python as well as the iSAX index, along with the required theory sourced from academic research papers. The chapters are filled with figures and plots to help you follow the presented topics and understand key concepts easily. But what makes this book really great is that it contains the right amount of knowledge about time series indexing using the right amount of theory and practice so that you can work with time series and develop time series indexes successfully. Additionally, the presented code can be easily ported to any other modern programming language, such as Swift, Java, C, C++, Ruby, Kotlin, Go, Rust, and JavaScript. By the end of this book, you'll have learned how to harness the power of iSAX and SAX representation to efficiently index and analyze time series data and will be equipped to develop your own time series indexes and effectively work with time series data.
Table of Contents (11 chapters)

Exercises

Try to do the following exercises:

  • Create a new Anaconda environment.
  • List the installed packages of an Anaconda environment.
  • Delete an existing Anaconda environment.
  • Create a new synthetic dataset with 1,000 values from -10 to +10.
  • Create a new synthetic dataset with 100,000 values from 0 to +10.
  • Write a Python script that reads a plain text file line by line.
  • Write a Python script that reads a plain text file and prints it word by word. Why is this more difficult than printing a file line by line?
  • Write a Python script that reads the same plain text file multiple times, and time that operation. The number of times the file is read as well as the file path should be given as command-line arguments.
  • Modify synthetic_data.py to generate integer values instead of floating-point values.
  • Create a time series with 500,000 elements with synthetic_data.py, and execute matrix_profile.py on the generated time series. Do not forget to compress...