Book Image

Apache Spark in 7 Days [Video]

By : Karen Yang
Book Image

Apache Spark in 7 Days [Video]

By: Karen Yang

Overview of this book

If you’re looking to get up to speed with learning the fundamentals of Apache Spark in a short period of time, you can count on this course to help you learn the basics of this engine. Spark is becoming a popular big data processing engine with its unique ability to run in-memory with excellent speed. It is also easy to use and offers simple syntax. The course is designed to give you a fundamental understanding of and hands-on experience in writing basic code as well as running applications on a Spark cluster. Over 7 days, you will work on interesting examples and assignments that will demonstrate and help you understand basic operations, querying, machine learning, and streaming. By the end of this course, you’ll be able to put your learning to practice and build your own projects with ease and confidence. The code bundle for this video course is available at - https://github.com/PacktPublishing/Apache-Spark-in-7-Days
Table of Contents (7 chapters)
Chapter 3
DataFrames
Content Locked
Section 3
DataFrame Row Operations
The aim of this video is to explore DataFrame row operations such as changing values, filtering rows, and using a row function to create a DataFrame. - Convert a DataFrame to an RDD and perform operations - Alter row values and filter row values - Use a row function to create a DataFrame