Similar to functions in other languages, Julia can perform most of the rudimentary statistics on your data using the describe
function, as in the example script that follows:
using RDatasets describe(dataset("datasets", "iris"))
This script accesses the Iris dataset and displays summary statistics on the dataset.
If we were to build a notebook to show describe
in use against the iris
dataset (loaded in the previous example), we would end up with a display like this:
You can see the standard statistics generated for each of the variables in the dataset. I thought it was interesting that the count and percentage of NA values in the dataset are provided. I have found that I usually have to double-check to exclude this data using other languages. Here, it is a quick, built-in reminder.