Book Overview & Buying
Table Of Contents

Getting Started with Haskell Data Analysis

By : James Church

Buy this Book

Getting Started with Haskell Data Analysis

By: James Church

Buy this Book

Overview of this book

Every business and organization that collects data is capable of tapping into its own data to gain insights how to improve. Haskell is a purely functional and lazy programming language, well-suited to handling large data analysis problems. This book will take you through the more difficult problems of data analysis in a hands-on manner. This book will help you get up-to-speed with the basics of data analysis and approaches in the Haskell language. You'll learn about statistical computing, file formats (CSV and SQLite3), descriptive statistics, charts, and progress to more advanced concepts such as understanding the importance of normal distribution. While mathematics is a big part of data analysis, we've tried to keep this course simple and approachable so that you can apply what you learn to the real world. By the end of this book, you will have a thorough understanding of data analysis, and the different ways of analyzing data. You will have a mastery of all the tools and techniques in Haskell for effective data analysis.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Descriptive Statistics

The CSV library – working with CSV files

Data range

Data mean and standard deviation

Data median

Data mode

Summary

SQLite3

SQLite3 command line

Working with SQLite3 and Haskell

Slices of data

Working with SQLite3 and descriptive statistics

Summary

Regular Expressions

Dots and pipes

Atom and Atom modifiers

Character classes

Regular expressions in CSV files

SQLite3 and regular expressions

Summary

Visualizations

Line plots of a single variable

Plotting a moving average

Creating publication-ready plots

Feature scaling

Scatter plots

Summary

Kernel Density Estimation

The central limit theorem

Normal distribution

Introducing kernel density estimation

Application of the KDE

Summary

Course Review

Converting CSV variation files into SQLite3

Using SQLite3 SELECT and the DescriptiveStats module for descriptive statistics

Creating compelling visualizations using EasyPlot

Reintroducing kernel density estimation

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Introducing kernel density estimation

Kernel density estimation is a process by which we can estimate the shape of a dataset. After we have computed the shape of a dataset, we can compute the probability in which an event will happen.

In this section, we're going to introduce the kernel density estimator. The kernel density estimator requires a kernel function, and we are going to discuss the requirements of the kernel function and how the normal distribution meets those requirements. Finally, we're going to compute the KDE of a set of values. So, kernel density estimation tries to estimate the shape of a dataset. All data has a shape - we could also refer to this as the density - and that shape is not always clear. Once we have estimated the shape of a dataset, we can compute the probability of a particular observation.

We require a kernel function, and in this section...

Tech Concepts

Programming languages

Tech Tools