Test Driven Machine Learning

Book Image

Test Driven Machine Learning

Book Image

Test Driven Machine Learning

Overview of this book

Test-Driven Machine Learning

Test-Driven Machine Learning

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Introducing Test-Driven Machine Learning

Introducing Test-Driven Machine Learning

Test-driven development

Behavior-driven development

TDD applied to machine learning

Dealing with randomness

Different approaches to validating the improved models

Quantifying the classification models

Perceptively Testing a Perceptron

Perceptively Testing a Perceptron

Getting started

Exploring the Unknown with Multi-armed Bandits

Exploring the Unknown with Multi-armed Bandits

Understanding a bandit

Testing with simulation

Starting from scratch

Simulating real world situations

A randomized probability matching algorithm

A bootstrapping bandit

The problem with straight bootstrapping

Multi-armed armed bandit throw down

Predicting Values with Regression

Predicting Values with Regression

Refresher on advanced regression

Generating our own data

Building the foundations of our model

Cross-validating our model

Generating data

Making Decisions Black and White with Logistic Regression

Making Decisions Black and White with Logistic Regression

Generating logistic data

Measuring model accuracy

Generating a more complex example

Test driving our model

You're So Naïve, Bayes

You're So Naïve, Bayes

Gaussian classification by hand

Beginning the development

Optimizing by Choosing a New Algorithm

Optimizing by Choosing a New Algorithm

Upgrading the classifier

Applying our classifier

Upgrading to Random Forest

Exploring scikit-learn Test First

Exploring scikit-learn Test First

Test-driven design

Planning our journey

Getting choosey

Developing testable documentation

Bringing It All Together

Bringing It All Together

Starting at the highest level

What we've accomplished

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

A randomized probability matching algorithm

The randomized probability matching bandit algorithm is a Bayesian statistical approach to the problem of figuring out when to explore our options and when to exploit them for a nice payoff. It works by sampling a probability distribution that describes the probable mean of the payoff. As we gain more data, the variance of the possible means narrows significantly. As we go through the rest of this chapter, we'll delve deeper into how this algorithm works.

As a concrete example, we can run some simulations. The following is a histogram of repeatedly sampling means with 100 samples from a normal distribution:

plt.title('Distribution of means for N(35,5) distribution (sampling 100 vs 500 data points)')
plt.xlabel('')
plt.ylabel('Counts')

plt.hist([np.random.normal(loc=35, scale=5, size=100).mean() for i in range(2500)], label='100 sample mean')
plt.hist([np.random.normal(loc=35, scale=5, size=500).mean() for i in range(2500)], label='500 sample mean...