Book Image

Getting Started with Haskell Data Analysis

By : James Church
Book Image

Getting Started with Haskell Data Analysis

By: James Church

Overview of this book

Every business and organization that collects data is capable of tapping into its own data to gain insights how to improve. Haskell is a purely functional and lazy programming language, well-suited to handling large data analysis problems. This book will take you through the more difficult problems of data analysis in a hands-on manner. This book will help you get up-to-speed with the basics of data analysis and approaches in the Haskell language. You'll learn about statistical computing, file formats (CSV and SQLite3), descriptive statistics, charts, and progress to more advanced concepts such as understanding the importance of normal distribution. While mathematics is a big part of data analysis, we've tried to keep this course simple and approachable so that you can apply what you learn to the real world. By the end of this book, you will have a thorough understanding of data analysis, and the different ways of analyzing data. You will have a mastery of all the tools and techniques in Haskell for effective data analysis.
Table of Contents (8 chapters)

Using SQLite3 SELECT and the DescriptiveStats module for descriptive statistics

This section will serve as our review of the SQLite3 SELECT queries and our DescriptiveStats module. In this section, we're going to take a look at loading data from our MovieLens dataset into an IHaskell notebook. We're going to be demonstrating the built-in avg function in SQLite3 and a simple inner table join in SQLite3. We're also going to be using SQLite3 and descriptive stats to find the highest median across three genres of movies. We want to know which genre is considered the highest-rated: action, drama, or sci-fi movies. In the previous section, we had just created the movies.sqlite3 database. What we need to do now is to copy that file into our data folder, and, if we look at this folder, we should see our movies.sqlite3 file, as shown in the following example:

Now let&apos...