Book Image

Hands-On Machine Learning with Microsoft Excel 2019

By : Julio Cesar Rodriguez Martino

Book Image

Hands-On Machine Learning with Microsoft Excel 2019

By: Julio Cesar Rodriguez Martino

Overview of this book

We have made huge progress in teaching computers to perform difficult tasks, especially those that are repetitive and time-consuming for humans. Excel users, of all levels, can feel left behind by this innovation wave. The truth is that a large amount of the work needed to develop and use a machine learning model can be done in Excel. The book starts by giving a general introduction to machine learning, making every concept clear and understandable. Then, it shows every step of a machine learning project, from data collection, reading from different data sources, developing models, and visualizing the results using Excel features and offerings. In every chapter, there are several examples and hands-on exercises that will show the reader how to combine Excel functions, add-ins, and connections to databases and to cloud services to reach the desired goal: building a full data analysis flow. Different machine learning models are shown, tailored to the type of data to be analyzed. At the end of the book, the reader is presented with some advanced use cases using Automated Machine Learning, and artificial neural network, which simplifies the analysis task and represents the future of machine learning.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

Section 1: Machine Learning Basics

Section 1: Machine Learning Basics

Implementing Machine Learning Algorithms

Implementing Machine Learning Algorithms

Technical requirements

Understanding learning and models

Focusing on model features

Studying machine learning models in practice

Comparing underfitting and overfitting

Evaluating models

Further reading

Hands-On Examples of Machine Learning Models

Hands-On Examples of Machine Learning Models

Technical requirements

Understanding supervised learning with multiple linear regression

Understanding supervised learning with decision trees

Understanding unsupervised learning with clustering

Further reading

Section 2: Data Collection and Preparation

Section 2: Data Collection and Preparation

Importing Data into Excel from Different Data Sources

Importing Data into Excel from Different Data Sources

Technical requirements

Importing data from a text file

Importing data from another Excel workbook

Importing data from a web page

Importing data from Facebook

Importing data from a JSON file

Importing data from a database

Further reading

Data Cleansing and Preliminary Data Analysis

Data Cleansing and Preliminary Data Analysis

Technical requirements

Visualizing data for preliminary analysis

Understanding unbalanced datasets

Further reading

Correlations and the Importance of Variables

Correlations and the Importance of Variables

Technical requirements

Building a scatter diagram

Calculating the covariance

Calculating the Pearson's coefficient of correlation

Studying the Spearman's correlation

Understanding least squares

Focusing on feature selection

Further reading

Section 3: Analytics and Machine Learning Models

Section 3: Analytics and Machine Learning Models

Data Mining Models in Excel Hands-On Examples

Data Mining Models in Excel Hands-On Examples

Technical requirements

Learning by example – Market Basket Analysis

Learning by example – Customer Cohort Analysis

Further reading

Implementing Time Series

Implementing Time Series

Technical requirements

Modeling and visualizing time series

Forecasting time series automatically in Excel

Studying the stationarity of a time series

Further reading

Section 4: Data Visualization and Advanced Machine Learning

Section 4: Data Visualization and Advanced Machine Learning

Visualizing Data in Diagrams, Histograms, and Maps

Visualizing Data in Diagrams, Histograms, and Maps

Technical requirements

Showing basic comparisons and relationships between variables

Building data distributions using histograms

Representing geographical distribution of data in maps

Showing data that changes over time

Further reading

Artificial Neural Networks

Artificial Neural Networks

Technical requirements

Introducing the perceptron – the simplest type of neural network

Building a deep network

Understanding the backpropagation algorithm

Further reading

Azure and Excel - Machine Learning in the Cloud

Azure and Excel - Machine Learning in the Cloud

Technical requirements

Introducing the Azure Cloud

Using AMLS for free – a step-by-step guide

Loading your data into AMLS

Creating and running an experiment in AMLS

Further reading

The Future of Machine Learning

The Future of Machine Learning

Automatic data analysis flows

Automated machine learning

Further reading

Assessment

Chapter 1, Implementing Machine Learning Algorithms

Chapter 2, Hands-On Examples of Machine Learning Models

Chapter 3, Importing Data into Excel from Different Data Sources

Chapter 4, Data Cleansing and Preliminary Data Analysis

Chapter 5, Correlations and the Importance of Variables

Chapter 6, Data Mining Models in Excel Hands-On Examples

Chapter 7, Implementing Time Series

Chapter 8, Visualizing Data in Diagrams, Histograms, and Maps

Chapter 9, Artificial Neural Networks

Chapter 10, Azure and Excel - Machine Learning in the Cloud

Chapter 11, The Future of Machine Learning

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Understanding unsupervised learning with clustering

Clustering is a statistical method that attempts to group the points in a dataset according to a distance measure, usually the Euclidean distance, which calculates the root of the squared differences between coordinates of a pair of points. To put this simply, those points that are classified within the same cluster are closer (in terms of the distance defined) to each other than they are to the points belonging to other clusters. At the same time, the larger the distance between two clusters, the better we can distinguish them. This is similar to saying that we try to build groups in which members are more alike and are more different to members of other groups.

It is clear that the most important part of a clustering algorithm is to define and calculate the distance between two given points and to iteratively assign the points...