Book Image

Interactive Visualization and Plotting with Julia

By : Diego Javier Zea
Book Image

Interactive Visualization and Plotting with Julia

By: Diego Javier Zea

Overview of this book

The Julia programming language offers a fresh perspective into the data visualization field. Interactive Visualization and Plotting with Julia begins by introducing the Julia language and the Plots package. The book then gives a quick overview of the Julia plotting ecosystem to help you choose the best library for your task. In particular, you will discover the many ways to create interactive visualizations with its packages. You’ll also leverage Pluto notebooks to gain interactivity and use them intensively through this book. You’ll find out how to create animations, a handy skill for communication and teaching. Then, the book shows how to solve data analysis problems using DataFrames and various plotting packages based on the grammar of graphics. Furthermore, you’ll discover how to create the most common statistical plots for data exploration. Also, you’ll learn to visualize geographically distributed data, graphs and networks, and biological data. Lastly, this book will go deeper into plot customizations with Plots, Makie, and Gadfly—focusing on the former—teaching you to create plot themes, arrange multiple plots into a single figure, and build new plot types. By the end of this Julia book, you’ll be able to create interactive and publication-quality static plots for data analysis and exploration tasks using Julia.
Table of Contents (19 chapters)
1
Section 1 – Getting Started
6
Section 2 – Advanced Plot Types
12
Section 3 – Mastering Plot Customization

Visualizing clustering results

The Clustering package offers multiple clustering algorithms for Julia. These algorithms aim to create groups where the elements in a group are more similar than those between groups. In particular, it provides the hclust function for performing a hierarchical clustering from a distance matrix. It creates a dendrogram, where the most similar data points, known as the leaves, are closer. This means you need to travel fewer and shorter branches from one leaf to another that belong to the same cluster than from visiting leaves outside it. The function returns an object of the Hclust type. The StatsPlots package exports a Plots recipe to draw the dendrogram when you call the plot function with a Hclust object. Let’s create a dendrogram that clusters the variables in the Iris dataset:

  1. Create a new Pluto notebook.
  2. Execute the following code in the first cell to load the necessary libraries and the Iris dataset:
    begin
       &...