So far, we've been performing supervised learning. There have been labels we wished to predict correctly, and values we wished to approximate closely with a function, and were unable to. Now, we'll look at an entirely different topic, which will be the focus of both this chapter and the next: unsupervised learning, starting with clustering. This chapter starts with a brief discussion on the difference between supervised and unsupervised learning, and specifically, what clustering is. After that, we'll look at our first clustering algorithm: the k-means algorithm, a popular and simple algorithm. Before exploring some other algorithms, we'll discuss approaches to evaluating a clustering scheme. Then, we'll move on to the next two approaches for clustering; the first being hierarchical clustering. The final clustering approach we&apos...
-
Book Overview & Buying
-
Table Of Contents
Training Systems Using Python Statistical Modeling
By :
Training Systems Using Python Statistical Modeling
By:
Overview of this book
Python's ease-of-use and multi-purpose nature has made it one of the most popular tools for data scientists and machine learning developers. Its rich libraries are widely used for data analysis, and more importantly, for building state-of-the-art predictive models. This book is designed to guide you through using these libraries to implement effective statistical models for predictive analytics.
You’ll start by delving into classical statistical analysis, where you will learn to compute descriptive statistics using pandas. You will focus on supervised learning, which will help you explore the principles of machine learning and train different machine learning models from scratch. Next, you will work with binary prediction models, such as data classification using k-nearest neighbors, decision trees, and random forests. The book will also cover algorithms for regression analysis, such as ridge and lasso regression, and their implementation in Python. In later chapters, you will learn how neural networks can be trained and deployed for more accurate predictions, and understand which Python libraries can be used to implement them.
By the end of this book, you will have the knowledge you need to design, build, and deploy enterprise-grade statistical models for machine learning using Python and its rich ecosystem of libraries for predictive analytics.
Table of Contents (9 chapters)
Preface
Classical Statistical Analysis
Introduction to Supervised Learning
Binary Prediction Models
Regression Analysis and How to Use It
Neural Networks
Clustering Techniques
Dimensionality Reduction
Other Books You May Enjoy