Book Image

Clojure for Data Science

By : Henry Garner
Book Image

Clojure for Data Science

By: Henry Garner

Overview of this book

Table of Contents (18 chapters)
Clojure for Data Science
Credits
About the Author
Acknowledgments
About the Reviewer
www.PacktPub.com
Preface
Index

Inspecting the data


The first task when confronted with a new dataset is to study it to ensure that we understand what it contains.

The all-london-2012-athletes.xlsx file is small enough that it's been provided with the sample code for this chapter. We can inspect the data with Incanter, as we did in Chapter 1, Statistics using the incanter.excel/read-xls and incanter.core/view functions:

(ns cljds.ch3.examples
  (:require [incanter.charts :as c]
            [incanter.core :as i]
            [incanter.excel :as xls]
            [incanter.stats :as s]))

(defn athlete-data []
  (-> (io/resource "all-london-2012-athletes.xlsx")
      (str)
      (xls/read-xls)))

(defn ex-3-1 []
  (i/view (athlete-data)))

If you run this code (either in the REPL or on the command line with lein run –e 3.1), you should see the following output:

We're fortunate that the data is clearly labeled in the columns and contains the following information:

  • Name of the athlete

  • Country for which they are competing

  • Age in years...