Book Image

Python Machine Learning Blueprints - Second Edition

By : Alexander Combs, Michael Roman

Book Image

Python Machine Learning Blueprints - Second Edition

By: Alexander Combs, Michael Roman

Overview of this book

Machine learning is transforming the way we understand and interact with the world around us. This book is the perfect guide for you to put your knowledge and skills into practice and use the Python ecosystem to cover key domains in machine learning. This second edition covers a range of libraries from the Python ecosystem, including TensorFlow and Keras, to help you implement real-world machine learning projects. The book begins by giving you an overview of machine learning with Python. With the help of complex datasets and optimized techniques, you’ll go on to understand how to apply advanced concepts and popular machine learning algorithms to real-world projects. Next, you’ll cover projects from domains such as predictive analytics to analyze the stock market and recommendation systems for GitHub repositories. In addition to this, you’ll also work on projects from the NLP domain to create a custom news feed using frameworks such as scikit-learn, TensorFlow, and Keras. Following this, you’ll learn how to build an advanced chatbot, and scale things up using PySpark. In the concluding chapters, you can look forward to exciting insights into deep learning and you'll even create an application using computer vision and neural networks. By the end of this book, you’ll be able to analyze data seamlessly and make a powerful impact through your projects.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

The Python Machine Learning Ecosystem

The Python Machine Learning Ecosystem

Data science/machine learning workflow

Python libraries and functions for each stage of the data science workflow

Setting up your machine learning environment

Build an App to Find Underpriced Apartments

Build an App to Find Underpriced Apartments

Sourcing apartment listing data

Inspecting and preparing the data

Visualizing our data

Visualizing the data

Modeling the data

Extending the model

Build an App to Find Cheap Airfares

Build an App to Find Cheap Airfares

Sourcing airfare pricing data

Retrieving fare data with advanced web scraping

Parsing the DOM to extract pricing data

Identifying outlier fares with anomaly detection techniques

Sending real-time alerts using IFTTT

Putting it all together

Forecast the IPO Market Using Logistic Regression

Forecast the IPO Market Using Logistic Regression

Data cleansing and feature engineering

Binary classification with logistic regression

Generating the importance of a feature from our model

Create a Custom Newsfeed

Create a Custom Newsfeed

Creating a supervised training set with Pocket

Using the Embedly API to download story bodies

Basics of Natural Language Processing

Support Vector Machines

IFTTT integration with feeds, Google Sheets, and email

Setting up your daily personal newsletter

Predict whether Your Content Will Go Viral

Predict whether Your Content Will Go Viral

What does research tell us about virality?

Sourcing shared counts and content

Exploring the features of shareability

Building a predictive content scoring model

Use Machine Learning to Forecast the Stock Market

Use Machine Learning to Forecast the Stock Market

Types of market analysis

What does research tell us about the stock market?

How to develop a trading strategy

Building the regression model

Classifying Images with Convolutional Neural Networks

Classifying Images with Convolutional Neural Networks

Image-feature extraction

Convolutional neural networks

Building a convolutional neural network to classify images in the Zalando Research dataset, using Keras

Building a Chatbot

Building a Chatbot

The Turing Test

The history of chatbots

The design of chatbots

Building a chatbot

Sequence-to-sequence modeling for chatbots

Build a Recommendation Engine

Build a Recommendation Engine

Collaborative filtering

Content-based filtering

Building a recommendation engine

What's Next?

Summary of the projects

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Building a predictive content scoring model

Let's use what we have learned to create a model that can estimate the share counts for a given piece of content. We'll use the features we have already created, along with a number of additional ones.

Ideally, we would have a much larger sample of content—especially content that had more typical share counts—but we'll have to make do with what we have here.

We're going to be using an algorithm called random forest regression. In previous chapters, we looked at a more typical implementation of random forests that is based on classification, but here we're going to attempt to predict the share counts. We could consolidate our share classes into ranges, but it is preferable to use regression when dealing with continuous variables, which is what we're working with here.

To begin, we'll create...