Book Image

Scala Machine Learning Projects

Book Image

Scala Machine Learning Projects

Overview of this book

Machine learning has had a huge impact on academia and industry by turning data into actionable information. Scala has seen a steady rise in adoption over the past few years, especially in the fields of data science and analytics. This book is for data scientists, data engineers, and deep learning enthusiasts who have a background in complex numerical computing and want to know more hands-on machine learning application development. If you're well versed in machine learning concepts and want to expand your knowledge by delving into the practical implementation of these concepts using the power of Scala, then this book is what you need! Through 11 end-to-end projects, you will be acquainted with popular machine learning libraries such as Spark ML, H2O, DeepLearning4j, and MXNet. At the end, you will be able to use numerical computing and functional programming to carry out complex numerical tasks to develop, build, and deploy research or commercial projects in a production-ready environment.
Table of Contents (17 chapters)
Title Page
Packt Upsell
Contributors
Preface
Index

Selecting and deploying the best model 


It is worth mentioning that the first model developed in the first project cannot be persisted since it is just a few lines of code for computing movie similarity. It also has another limitation that we did not cover earlier. It can compute the similarity between two movies, but what about more than two movies? Frankly speaking, a model like the first one would rarely be deployed for a real-life movie. So let's focus on the model-based recommendation engine instead.

Although ratings from users will keep coming, still it might be worth it to store the current one. Therefore, we also want to persist our current base model for later use in order to save time when starting up the server. The idea is to use the current model for real-time movie recommendations.

Nevertheless, we might also save time if we persist some of the RDDs we have generated, especially those that took longer to process. The following line saves our trained ALS model (see the MovieRecommendation...