Chapter 7: Choosing the Best AI Algorithm | Building Data Science Solutions with Anaconda

Book Overview & Buying
Table Of Contents

Building Data Science Solutions with Anaconda

By : Dan Meador

5 (11)

Buy this Book

Building Data Science Solutions with Anaconda

5 (11)

By: Dan Meador

Buy this Book

Overview of this book

You might already know that there's a wealth of data science and machine learning resources available on the market, but what you might not know is how much is left out by most of these AI resources. This book not only covers everything you need to know about algorithm families but also ensures that you become an expert in everything, from the critical aspects of avoiding bias in data to model interpretability, which have now become must-have skills. In this book, you'll learn how using Anaconda as the easy button, can give you a complete view of the capabilities of tools such as conda, which includes how to specify new channels to pull in any package you want as well as discovering new open source tools at your disposal. You’ll also get a clear picture of how to evaluate which model to train and identify when they have become unusable due to drift. Finally, you’ll learn about the powerful yet simple techniques that you can use to explain how your model works. By the end of this book, you’ll feel confident using conda and Anaconda Navigator to manage dependencies and gain a thorough understanding of the end-to-end data science workflow.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Part 1: The Data Science Landscape – Open Source to the Rescue

Free Chapter

Chapter 1: Understanding the AI/ML landscape

Introducing Artificial Intelligence (AI)

Understanding the current state of AI and ML

Understanding the massive generation of new data

Evaluating how AI delivers business value

Understanding the main types of ML models

Dealing with out-of-date models

Installing packages with Anaconda

Summary

Chapter 2: Analyzing Open Source Software

Technical requirements

Understanding open source

Understanding the top four OSS licenses

Evaluating a new tool or library

Importing packages with Anaconda and conda-forge

Evaluating and using scikit-learn

Summary

Chapter 3: Using the Anaconda Distribution to Manage Packages

Technical requirements

Learning how dependency resolution works

Discovering what conda environments are and how to use them

Managing channels with Anaconda Navigator and conda

Using advanced conda info and settings

Conda cheat sheet

Summary

Chapter 4: Working with Jupyter Notebooks and NumPy

Technical requirements

Working with Jupyter notebooks

Using NumPy to perform calculations quickly

Summary

Part 2: Data Is the New Oil, Models Are the New Refineries

Chapter 5: Cleaning and Visualizing Data

Technical requirements

Cleaning data with pandas

Visualization with Matplotlib

Summary

Chapter 6: Overcoming Bias in AI/ML

Technical requirements

Defining bias versus discrimination

Overcoming proxy bias

Overcoming sample bias

Overcoming exclusion bias

Overcoming measurement bias

Overcoming societal AI bias

Finding bias in an example

Summary

Chapter 7: Choosing the Best AI Algorithm

Technical requirements

Defining your problem

Understanding regression problems with examples

Classification

Anomaly detection

Clustering problems

Summary

Chapter 8: Dealing with Common Data Problems

Technical requirements

Dealing with too much data

Finding and correcting data entries

Working with categorical values with one-hot encoding

Feature scaling

Working with date formats

Summary

Part 3: Practical Examples and Applications

Chapter 9: Building a Regression Model with scikit-learn

Technical requirements

Walking through the data science workflow

Setting up and understanding the problem space

Exploring and cleaning the data

Creating and evaluating regression algorithms

Evaluating potential models using MSE and R2 scores

Summary

Chapter 10: Explainable AI - Using LIME and SHAP

Technical requirements

Understanding the value of interpretation

Understanding models that are interpretable by design

Explaining a model's outcome with LIME

Explaining a model's outcome with SHAP

Summary

Chapter 11: Tuning Hyperparameters and Versioning Your Model

Technical requirements

Creating a scikit-learn pipeline

Finding optimal hyperparameters with GridSearchCV

Versioning and storing your model

Summary

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Building Data Science Solutions with Anaconda

By : Dan Meador

Building Data Science Solutions with Anaconda

By: Dan Meador

Overview of this book

Clustering problems

DBScan

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access