Book Image

Apache Spark in 7 Days [Video]

By : Karen Yang
Book Image

Apache Spark in 7 Days [Video]

By: Karen Yang

Overview of this book

If you’re looking to get up to speed with learning the fundamentals of Apache Spark in a short period of time, you can count on this course to help you learn the basics of this engine. Spark is becoming a popular big data processing engine with its unique ability to run in-memory with excellent speed. It is also easy to use and offers simple syntax. The course is designed to give you a fundamental understanding of and hands-on experience in writing basic code as well as running applications on a Spark cluster. Over 7 days, you will work on interesting examples and assignments that will demonstrate and help you understand basic operations, querying, machine learning, and streaming. By the end of this course, you’ll be able to put your learning to practice and build your own projects with ease and confidence. The code bundle for this video course is available at - https://github.com/PacktPublishing/Apache-Spark-in-7-Days
Table of Contents (7 chapters)
Chapter 1
Getting Started
Content Locked
Section 3
Launching a Spark Cluster on EC2
The aim of this video is to launch a Spark Cluster on EC2 using a Python script. - Download the Spark EC2 script and dependencies - Create a key-pair and generate AWS access credentials - Launch Spark cluster with EC2 script