Book Image

Apache Mahout Essentials

By : Jayani Withanawasam
Book Image

Apache Mahout Essentials

By: Jayani Withanawasam

Overview of this book

Table of Contents (13 chapters)
Apache Mahout Essentials
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Summary


In this chapter we learned that classification and regression are supervised learning problems, which require labeled data. Predictor variables and output variables should be defined to come up with a model during the training phase.

We also saw that supervised learning can be achieved using different techniques, namely model-based, regression-based and tree-based techniques.

Regression can be divided into two categories based on the outcome of the algorithm, that is linear regression and logistic regression.

We saw that text classification is an important application and this is explained using the Naïve Bayes algorithm.

In the next chapter, we will discuss the recommendation techniques using Apache Mahout.