Book Image

Machine Learning with Apache Spark Quick Start Guide

By : Jillur Quddus
Book Image

Machine Learning with Apache Spark Quick Start Guide

By: Jillur Quddus

Overview of this book

Every person and every organization in the world manages data, whether they realize it or not. Data is used to describe the world around us and can be used for almost any purpose, from analyzing consumer habits to fighting disease and serious organized crime. Ultimately, we manage data in order to derive value from it, and many organizations around the world have traditionally invested in technology to help process their data faster and more efficiently. But we now live in an interconnected world driven by mass data creation and consumption where data is no longer rows and columns restricted to a spreadsheet, but an organic and evolving asset in its own right. With this realization comes major challenges for organizations: how do we manage the sheer size of data being created every second (think not only spreadsheets and databases, but also social media posts, images, videos, music, blogs and so on)? And once we can manage all of this data, how do we derive real value from it? The focus of Machine Learning with Apache Spark is to help us answer these questions in a hands-on manner. We introduce the latest scalable technologies to help us manage and process big data. We then introduce advanced analytical algorithms applied to real-world use cases in order to uncover patterns, derive actionable insights, and learn from this big data.
Table of Contents (10 chapters)

Other Books You May Enjoy

If you enjoyed this book, you may be interested in these other books by Packt:

Apache Spark Deep Learning Cookbook
Ahmed Sherif

ISBN: 9781788474221

  • Set up a fully functional Spark environment
  • Understand practical machine learning and deep learning concepts
  • Apply built-in machine learning libraries within Spark
  • Explore libraries that are compatible with TensorFlow and Keras
  • Explore NLP models such as word2vec and TF-IDF on Spark
  • Organize dataframes for deep learning evaluation
  • Apply testing and training modeling to ensure accuracy
  • Access readily available code that may be reusable

Apache Spark 2.x Machine Learning Cookbook
Siamak Amirghodsi

ISBN: 9781783551606

  • Get to know how Scala and Spark go hand-in-hand for developers when developing ML systems with Spark
  • Build a recommendation engine that scales with Spark
  • Find out how to build unsupervised clustering...