Chapter 6
Other Scala Frameworks for Machine Learning
Section 3
Introduction to Apache Spark
In this video, we introduce Apache spark and give a broad overview of what it is and what it does. - First we explain how Spark works at a low level and mention its advantages over its predecessors such as Hadoop and Scalding. - Then we explain what RDDs are and how to work with them. We then provide an example using RDDs. - We then introduce Mllib—the spark library for machine learning—and discuss when to use Spark for machine learning implementations.