Book Image

Machine Learning Fundamentals

By : Hyatt Saleh
Book Image

Machine Learning Fundamentals

By: Hyatt Saleh

Overview of this book

As machine learning algorithms become popular, new tools that optimize these algorithms are also developed. Machine Learning Fundamentals explains you how to use the syntax of scikit-learn. You'll study the difference between supervised and unsupervised models, as well as the importance of choosing the appropriate algorithm for each dataset. You'll apply unsupervised clustering algorithms over real-world datasets, to discover patterns and profiles, and explore the process to solve an unsupervised machine learning problem. The focus of the book then shifts to supervised learning algorithms. You'll learn to implement different supervised algorithms and develop neural network structures using the scikit-learn package. You'll also learn how to perform coherent result analysis to improve the performance of the algorithm by tuning hyperparameters. By the end of this book, you will have gain all the skills required to start programming machine learning algorithms.
Table of Contents (9 chapters)
Machine Learning Fundamentals
Preface

Error Analysis


In the previous chapter, we explained the importance of error analysis. In this section, the different evaluation metrics will be calculated for all three models that were created in the previous activities, so that we can compare them.

Keep in mind that the selection of an evaluation metric is done according to the purpose of the case study. Nonetheless, next, we will compare the models using the accuracy, precision, and recall metrics, for learning purposes. This way, it will be possible to see that even though a model may be better in terms of one metric, it can be worse when measuring a different metric, which helps to emphasize the importance of choosing the right metric.

Accuracy, Precision, and Recall

As a quick reminder, in order to measure performance and perform error analysis, it is required that you use the predict method on the different sets of data (training, validation, and testing). The following code snippets present a clean way of measuring all three metrics...