Chapter 10. Jupyter and Big Data
Big data is the topic on everyone's mind. I thought it would be good to see what can be done with big data in Jupyter. An up-and-coming language for dealing with large datasets is Spark. Spark is an open source big data processing framework. Spark can run over Hadoop, in the cloud, or standalone. We can use Spark coding in Jupyter much like the other languages we have seen.
In this chapter, we will cover the following topics:
Installing Spark for use in Jupyter
Using Spark's features