Book Image

Big Data Analytics Projects with Apache Spark [Video]

By : Tomasz Lelek
Book Image

Big Data Analytics Projects with Apache Spark [Video]

By: Tomasz Lelek

Overview of this book

<p>Ready to use statistical and machine-learning techniques across large data sets? This course shows you how the Apache Spark and the Hadoop MapReduce ecosystem is perfect for the job.</p> <p>This course contains various projects that consist of real-world examples. The first project is to find top selling products for an e-commerce business by efficiently joining data sets in the Map/Reduce paradigm. Next, a Market Basket Analysis will help you identify items likely to be purchased together and find correlations between items in a set of transactions.</p> <p>Moving on, you'll learn about probabilistic logistic regression by finding an author for a post. Next, you'll build a content-based recommendation system for movies to predict whether an action will happen, which we’ll do by building a trained model. Finally, we’ll use the Map/Reduce Spark program to calculate mutual friends on social network.</p> <p>By the end of this course, you’ll have been exposed to a wide variety of mathematical techniques that can be utilized as training models with the Spark and Hadoop software, and know how to solve common problems.</p> <h1>Style and Approach</h1> <p>This will help you perform data analysis, introducing to each subject by example and practice that makes the audience more productive after each video.</p>
Table of Contents (5 chapters)
Chapter 4
Content-Based Recommendation System: Movies
Content Locked
Section 1
Content-Based Recommendation Systems Explanation
This video will show us how to build recommendation system. - Define collaborative filtering - Learn how to implement it