Book Image

Machine Learning with Scala Quick Start Guide

By : Md. Rezaul Karim, Ajay Kumar N
Book Image

Machine Learning with Scala Quick Start Guide

By: Md. Rezaul Karim, Ajay Kumar N

Overview of this book

Scala is a highly scalable integration of object-oriented nature and functional programming concepts that make it easy to build scalable and complex big data applications. This book is a handy guide for machine learning developers and data scientists who want to develop and train effective machine learning models in Scala. The book starts with an introduction to machine learning, while covering deep learning and machine learning basics. It then explains how to use Scala-based ML libraries to solve classification and regression problems using linear regression, generalized linear regression, logistic regression, support vector machine, and Naïve Bayes algorithms. It also covers tree-based ensemble techniques for solving both classification and regression problems. Moving ahead, it covers unsupervised learning techniques, such as dimensionality reduction, clustering, and recommender systems. Finally, it provides a brief overview of deep learning using a real-life example in Scala.
Table of Contents (9 chapters)

Summary

In this chapter, we saw how to classify cancer patients on the basis of tumor types from a very high-dimensional gene expression dataset curated from TCGA. Our LSTM architecture managed to achieve 99% accuracy, which is outstanding. Nevertheless, we discussed many aspects of DL4J, which will be helpful in upcoming chapters. Finally, we saw answers to some frequent questions related to this project, LSTM networks, and DL4J hyperparameters/network tuning.

This is, more or less, the end of our little journey in developing ML projects using Scala and different open source frameworks. Throughout these chapters, I have tried to provide you with several examples of how to use these wonderful technologies efficiently for developing ML projects. While writing this book, I had to keep many constraints in my mind; for example, the page count, API availability, and of course, my expertise...