Book Image

Interpretable Machine Learning with Python - Second Edition

By : Serg Masís

4 (4)

Book Image

Interpretable Machine Learning with Python - Second Edition

4 (4)

By: Serg Masís

Overview of this book

Interpretable Machine Learning with Python, Second Edition, brings to light the key concepts of interpreting machine learning models by analyzing real-world data, providing you with a wide range of skills and tools to decipher the results of even the most complex models. Build your interpretability toolkit with several use cases, from flight delay prediction to waste classification to COMPAS risk assessment scores. This book is full of useful techniques, introducing them to the right use case. Learn traditional methods, such as feature importance and partial dependence plots to integrated gradients for NLP interpretations and gradient-based attribution methods, such as saliency maps. In addition to the step-by-step code, you’ll get hands-on with tuning models and training data for interpretability by reducing complexity, mitigating bias, placing guardrails, and enhancing reliability. By the end of the book, you’ll be confident in tackling interpretability challenges with black-box models using tabular, language, image, and time series data.

Preface

Who this book is for

What this book covers

To get the most out of this book

Interpretation, Interpretability, and Explainability; and Why Does It All Matter?

Interpretation, Interpretability, and Explainability; and Why Does It All Matter?

Technical requirements

What is machine learning interpretation?

Understanding the difference between interpretability and explainability

A business case for interpretability

Dataset sources

Further reading

Free Chapter

Key Concepts of Interpretability

Key Concepts of Interpretability

Technical requirements

Interpretation method types and scopes

Appreciating what hinders machine learning interpretability

Mission accomplished

Further reading

Interpretation Challenges

Interpretation Challenges

Technical requirements

The preparations

Loading the libraries

Reviewing traditional model interpretation methods

Understanding limitations of traditional model interpretation methods

Studying intrinsically interpretable (white-box) models

Recognizing the trade-off between performance and interpretability

Discovering newer interpretable (glass-box) models

Mission accomplished

Dataset sources

Further reading

Global Model-Agnostic Interpretation Methods

Global Model-Agnostic Interpretation Methods

Technical requirements

The preparations

Model training and evaluation

What is feature importance?

Assessing feature importance with model-agnostic methods

Visualize global explanations

Feature summary explanations

Feature interactions

Further reading

Local Model-Agnostic Interpretation Methods

Local Model-Agnostic Interpretation Methods

Technical requirements

The preparations

Leveraging SHAP’s KernelExplainer for local interpretations with SHAP values

Using LIME for NLP

Trying SHAP for NLP

Comparing SHAP with LIME

Mission accomplished

Dataset sources

Further reading

Anchors and Counterfactual Explanations

Anchors and Counterfactual Explanations

Technical requirements

The preparations

Understanding anchor explanations

Exploring counterfactual explanations

Mission accomplished

Dataset sources

Further reading

Visualizing Convolutional Neural Networks

Visualizing Convolutional Neural Networks

Technical requirements

Visualizing the learning process with activation-based methods

Evaluating misclassifications with gradient-based attribution methods

Understanding classifications with perturbation-based attribution methods

Mission accomplished

Further reading

Interpreting NLP Transformers

Interpreting NLP Transformers

Technical requirements

The preparations

Visualizing attention with BertViz

Interpreting token attributions with integrated gradients

LIME, counterfactuals, and other possibilities with the LIT

Mission accomplished

Further reading

Interpretation Methods for Multivariate Forecasting and Sensitivity Analysis

Interpretation Methods for Multivariate Forecasting and Sensitivity Analysis

Technical requirements

The preparation

Assessing time series models with traditional interpretation methods

Generating LSTM attributions with integrated gradients

Computing global and local attributions with SHAP’s KernelExplainer

Identifying influential features with factor prioritization

Quantifying uncertainty and cost sensitivity with factor fixing

Mission accomplished

Dataset and image sources

Further reading

Feature Selection and Engineering for Interpretability

Feature Selection and Engineering for Interpretability

Technical requirements

The preparations

Understanding the effect of irrelevant features

Reviewing filter-based feature selection methods

Exploring embedded feature selection methods

Discovering wrapper, hybrid, and advanced feature selection methods

Considering feature engineering

Mission accomplished

Dataset sources

Further reading

Bias Mitigation and Causal Inference Methods

Bias Mitigation and Causal Inference Methods

Technical requirements

The preparations

Mitigating bias

Creating a causal model

Understanding heterogeneous treatment effects

Testing estimate robustness

Mission accomplished

Dataset sources

Further reading

Monotonic Constraints and Model Tuning for Interpretability

Monotonic Constraints and Model Tuning for Interpretability

Technical requirements

The preparations

Placing guardrails with feature engineering

Tuning models for interpretability

Implementing model constraints

Mission accomplished

Dataset sources

Further reading

Adversarial Robustness

Adversarial Robustness

Technical requirements

The preparations

Learning about evasion attacks

Defending against targeted attacks with preprocessing

Shielding against any evasion attack by adversarial training of a robust classifier

Evaluating adversarial robustness

Mission accomplished

Dataset sources

Further reading

What’s Next for Machine Learning Interpretability?

What’s Next for Machine Learning Interpretability?

Understanding the current landscape of ML interpretability

Speculating on the future of ML interpretability

Further reading

Other Books You May Enjoy

Other Books You May Enjoy

Index

Customer Reviews

4 (4)

5 star

50%

4 star

25%

3 star

0

2 star

25%

1 star

0

Learning about evasion attacks

There are six broad categories of adversarial attacks:

Evasion: designing an input that can cause a model to make an incorrect prediction, especially when it wouldn’t fool a human observer. It can either be targeted or untargeted, depending on the attacker’s intention to fool the model into misclassifying a specific class (targeted) or, rather, misclassifying any class (untargeted). The attack methods can be white-box if the attacker has full access to the model and its training dataset, or black-box with only inference access. Gray-box sits in the middle. Black-box is always model-agnostic, whereas white and gray-box methods might be.
Poisoning: injecting faulty training data or parameters into a model can come in many forms, depending on the attacker’s capabilities and access. For instance, for systems with user-generated data, the attacker may be capable of adding faulty data or labels. If they have more access...