Book Image

Python Machine Learning Workbook for Beginners

By : AI Sciences
Book Image

Python Machine Learning Workbook for Beginners

By: AI Sciences

Overview of this book

<p>Machine Learning (ML) is the lifeblood of businesses worldwide. ML tools empower organizations to identify profitable opportunities fast and help them to better understand potential risks. The ever-expanding data, cost-effective data storage, and competitively priced powerful processing continue to drive the growth of ML. </p><p> </p><p>This is the best time you could enter the exciting machine learning universe. Industries are reinventing themselves constantly by developing more advanced data analysis models. These models analyze larger and more complex data than ever while delivering instantaneous and more accurate results on enormous scales. </p><p>In this backdrop, it is evident that hands-on practice is everything in machine learning. Tons of theory will amount to nothing if you don’t have enough hands-on practice. Textbooks and online classes mislead you into a false sense of mastery. The easy availability of learning resources tricks you and you become overconfident. But when you try to apply the theoretical concepts you have learned, you realize it’s not that simple. </p><p> </p><p>This is where projects play a crucial role in your learning journey. Projects are doubtless the best investment of your time. You’ll not only enjoy learning but you’ll also make quick progress. And unlike studying boring theoretical concepts, you’ll find that working on projects is easier to stay motivated. </p><p> </p><p>The projects in this book cover ten different interesting topics. Each project will help you refine your ML skills and apply them in the real world. These projects also present you with an opportunity to enrich your portfolio, making it simpler to find a great job, explore interesting career paths, and even negotiate a higher pay package. Overall, this learning-by-doing book will help you accomplish your machine learning career goals faster. </p><p> </p><p>The code bundle for this course is available at https://www.aispublishing.net/ai-sciences-book</p>
Table of Contents (15 chapters)
1
About the Author

2.5. Cleaning the Data

Before training our machine learning model on the training data, we need to remove the special characters and numbers from our text. Removing special characters and numbers create empty spaces in the text, which also need to be removed.

Before cleaning the data, let’s first divide the data into the email text, which forms the feature set (X), and the email labels (y), which contains information about whether or not an email is a spam email.

Script 8:

1. X = message_dataset[“text”]

2.

3. y = message_dataset[“spam”]

The following script defines a clean_text() method, which accepts a text string and returns a string that is cleaned of digits, special characters, and multiple empty spaces.

Script 9:

1. def clean_text(doc):

2.

3.

4.imgae document = re.sub(‘[^a-zA-Z]’, ‘ ‘, doc)

5.

6.imgae document = re.sub(r»\s+[a-zA-Z]\s+», ‘ ‘, document)

7.

8. imgaedocument = re.sub(r’\s+’, ‘...