Book Image

Hands-On Predictive Analytics with Python

By : Alvaro Fuentes
Book Image

Hands-On Predictive Analytics with Python

By: Alvaro Fuentes

Overview of this book

Predictive analytics is an applied field that employs a variety of quantitative methods using data to make predictions. It involves much more than just throwing data onto a computer to build a model. This book provides practical coverage to help you understand the most important concepts of predictive analytics. Using practical, step-by-step examples, we build predictive analytics solutions while using cutting-edge Python tools and packages. The book's step-by-step approach starts by defining the problem and moves on to identifying relevant data. We will also be performing data preparation, exploring and visualizing relationships, building models, tuning, evaluating, and deploying model. Each stage has relevant practical examples and efficient Python code. You will work with models such as KNN, Random Forests, and neural networks using the most important libraries in Python's data science stack: NumPy, Pandas, Matplotlib, Seaborn, Keras, Dash, and so on. In addition to hands-on code examples, you will find intuitive explanations of the inner workings of the main techniques and algorithms used in predictive analytics. By the end of this book, you will be all set to build high-performance predictive analytics solutions using Python programming.
Table of Contents (11 chapters)

Preface

Predictive analytics is one of the most important technologies of our time. Every day, companies in all industries and all types of institutions are using predictive techniques to solve a wide range of problems. Although many of the main ideas and techniques have been around for many decades, the use of predictive analytics has exploded recently due to the increased ability to capture and store data, which is the raw material from which we build predictive models. There are two other big factors that explain the increasing adoption of this technology: the first is the astonishing increase in computing power, and the second is the availability of many open source software projects that have given access to professionals outside academia to many of the most powerful predictive analytics techniques. The Python programming language and its ecosystem of analytical libraries, also known as Python's data science stack, is such a project and has democratized the use of advanced analytical techniques.

This is a book about predictive analytics, but rather than focusing exclusively on explaining in detail the algorithms and techniques, this book is more about the process of doing predictive analytics in the real world. The main goal of this book is to make you familiar with all the stages in the process of solving a business problem using predictive modeling and to show, with hands-on examples, how to use Python and its data analytics ecosystem to implement many of the main techniques and approaches used in real-world predictive analytics projects. We use two main projects in this book and walk you through the entire predictive analytics process: from business and problem understanding to model deployment, all through hands-on examples.

There are many techniques that can be used for predictive analytics: statistical models, time series analysis, and spatial statistics to mention a few; however, in this book, we focus on the most widely applicable and successful set of techniques: machine learning, specifically the branch of supervised learning.

In my view, a predictive model is only a means to an end. The goal of using predictive analytics is to solve problems; therefore, a good predictive model is not one that uses the latest and most fashionable techniques, nor is it the most complicated or the simplest one. A good predictive model one can be used to solve a real-world problem in a satisfactory way. My goal is that by the end of this book you will have the foundations that you need to start solving real-world problems using predictive analytics.