Book Image

Machine Learning Security Principles

By : John Paul Mueller

Book Image

Machine Learning Security Principles

By: John Paul Mueller

Overview of this book

Businesses are leveraging the power of AI to make undertakings that used to be complicated and pricy much easier, faster, and cheaper. The first part of this book will explore these processes in more depth, which will help you in understanding the role security plays in machine learning. As you progress to the second part, you’ll learn more about the environments where ML is commonly used and dive into the security threats that plague them using code, graphics, and real-world references. The next part of the book will guide you through the process of detecting hacker behaviors in the modern computing environment, where fraud takes many forms in ML, from gaining sales through fake reviews to destroying an adversary’s reputation. Once you’ve understood hacker goals and detection techniques, you’ll learn about the ramifications of deep fakes, followed by mitigation strategies. This book also takes you through best practices for embracing ethical data sourcing, which reduces the security risk associated with data. You’ll see how the simple act of removing personally identifiable information (PII) from a dataset lowers the risk of social engineering attacks. By the end of this machine learning book, you'll have an increased awareness of the various attacks and the techniques to secure your ML systems effectively.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Share Your Thoughts

Download a free PDF copy of this book

Part 1 – Securing a Machine Learning System

Part 1 – Securing a Machine Learning System

Free Chapter

Chapter 1: Defining Machine Learning Security

Chapter 1: Defining Machine Learning Security

Building a picture of ML

Adding security to ML

Setting up for the book

Chapter 2: Mitigating Risk at Training by Validating and Maintaining Datasets

Chapter 2: Mitigating Risk at Training by Validating and Maintaining Datasets

Technical requirements

Defining dataset threats

Detecting dataset modification

Mitigating dataset corruption

Chapter 3: Mitigating Inference Risk by Avoiding Adversarial Machine Learning Attacks

Chapter 3: Mitigating Inference Risk by Avoiding Adversarial Machine Learning Attacks

Defining adversarial ML

Considering security issues in ML algorithms

Describing the most common attack techniques

Mitigating threats to the algorithm

Further reading

Part 2 – Creating a Secure System Using ML

Part 2 – Creating a Secure System Using ML

Chapter 4: Considering the Threat Environment

Chapter 4: Considering the Threat Environment

Technical requirements

Defining an environment

Understanding business threats

Considering social threats

Employing ML in security in the real world

Further reading

Chapter 5: Keeping Your Network Clean

Chapter 5: Keeping Your Network Clean

Technical requirements

Defining current network threats

Considering traditional protections

Adding ML to the mix

Creating real-time defenses

Developing predictive defenses

Chapter 6: Detecting and Analyzing Anomalies

Chapter 6: Detecting and Analyzing Anomalies

Technical requirements

Defining anomalies

Detecting data anomalies

Using anomaly detection effectively in ML

Considering other mitigation techniques

Further reading

Chapter 7: Dealing with Malware

Chapter 7: Dealing with Malware

Technical requirements

Defining malware

Generating malware detection features

Classifying malware

Further reading

Chapter 8: Locating Potential Fraud

Chapter 8: Locating Potential Fraud

Technical requirements

Understanding the types of fraud

Defining fraud sources

Considering fraud that occurs in the background

Considering fraud that occurs in real time

Building a fraud detection example

Further reading

Chapter 9: Defending against Hackers

Chapter 9: Defending against Hackers

Technical requirements

Considering hacker targets

Defining hacker goals

Monitoring and alerting

Improving security and reliability

Further reading

Part 3 – Protecting against ML-Driven Attacks

Part 3 – Protecting against ML-Driven Attacks

Chapter 10: Considering the Ramifications of Deepfakes

Chapter 10: Considering the Ramifications of Deepfakes

Technical requirements

Defining a deepfake

Creating a deepfake computer setup

Understanding autoencoders

Understanding CNNs and implementing GANs

Further reading

Chapter 11: Leveraging Machine Learning for Hacking

Chapter 11: Leveraging Machine Learning for Hacking

Making attacks automatic and personalized

Enhancing existing capabilities

Further reading

Part 4 – Performing ML Tasks in an Ethical Manner

Part 4 – Performing ML Tasks in an Ethical Manner

Chapter 12: Embracing and Incorporating Ethical Behavior

Chapter 12: Embracing and Incorporating Ethical Behavior

Technical requirements

Sanitizing data correctly

Defining data source awareness

Understanding ML fairness

Addressing fairness concerns

Mitigating privacy risks using federated learning and differential privacy

Further reading

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Detecting data anomalies

Anomaly (and its novelty counterpart) detection is a never-ending, constant requirement because anomalies happen all the time. However, with all this talk of detecting and removing anomalies, you need to consider something else. If you remove the novelties from the dataset (thinking that they are anomalies), then you may not see an important trend. Consequently, detection and research into possible novelties go hand in hand. Of course, the most important place to start is with the data itself, looking for values that don’t obviously belong. Figure 6.2 provides a list of common techniques to detect outliers (the table is definitely incomplete because there are many others):

Method	Type	Description
Cook’s distance	Model-specific	This estimates the variations in regression coefficients...