In this second recipe on plotting using Bokeh, we'll see how to plot a time series graph with a dataset borrowed from https://archive.ics.uci.edu/ml/datasets/Dow+Jones+Index
. We will also see how to plot multiple charts in a single document.
To step through this recipe, you will need a running Spark Cluster in any one of the modes, that is, local, standalone, YARN, or Mesos. Also, include the Spark MLlib package in the build.sbt
file so that it downloads the related libraries and the API can be used. Install Hadoop (optionally), Scala, and Java.
Initially, specify the following libraries in the build.sbt
file as follows:
libraryDependencies ++= Seq( "io.continuum.bokeh" % "bokeh_2.10" % "0.5", "org.scalanlp" %% "breeze" % "0.5", "org.scalanlp" %% "breeze-viz" % "0.5" )
We'll be using only two fields from the dataset: the closing price of the stock at the end of the week, and...