Machine Learning with Python

By : Oliver Theobald

Machine Learning with Python

By: Oliver Theobald

Overview of this book

The course starts by setting the foundation with an introduction to machine learning, Python, and essential libraries, ensuring you grasp the basics before diving deeper. It then progresses through exploratory data analysis, data scrubbing, and pre-model algorithms, equipping you with the skills to understand and prepare your data for modeling. The journey continues with detailed walkthroughs on creating, evaluating, and optimizing machine learning models, covering key algorithms such as linear and logistic regression, support vector machines, k-nearest neighbors, and tree-based methods. Each section is designed to build upon the previous, reinforcing learning and application of concepts. Wrapping up, the course introduces the next steps, including an introduction to Python for newcomers, ensuring a comprehensive understanding of machine learning applications.

Free Chapter

FOREWORD

DATASETS USED IN THIS BOOK

INTRODUCTION

DEVELOPMENT ENVIRONMENT

MACHINE LEARNING LIBRARIES

EXPLORATORY DATA ANALYSIS

DATA SCRUBBING

PRE-MODEL ALGORITHMS

SPLIT VALIDATION

MODEL DESIGN

LINEAR REGRESSION

LOGISTIC REGRESSION

SUPPORT VECTOR MACHINES

k-NEAREST NEIGHBORS

TREE-BASED METHODS

NEXT STEPS

APPENDIX 1: INTRODUCTION TO PYTHON

APPENDIX 2: PRINT COLUMNS

Customer Reviews

5 star

4 star

3 star

2 star

1 star

PRE-MODEL ALGORITHMS

As an extension of the data scrubbing process, unsupervised learning algorithms are sometimes used in advance of a supervised learning algorithm to prepare the data for prediction modeling. In this way, unsupervised algorithms are used to clean or reshape the data rather than to derive actionable insight.

Examples of pre-model algorithms include dimension reduction techniques, as introduced in the previous chapter, as well as k-means clustering. Both of these algorithms are examined in this chapter.

Principal Component Analysis

One of the most popular dimension reduction techniques is principal component analysis (PCA). Known also as general factor analysis, PCA is useful for dramatically reducing data complexity and visualizing data in fewer dimensions. The practical goal of PCA is to find a low-dimensional representation of the dataset that preserves as much of the original variation as possible. Rather than removing individual features from...

Machine Learning with Python

By : Oliver Theobald

Machine Learning with Python

By: Oliver Theobald

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning with Python

Machine Learning for Healthcare Analytics Projects

Machine Learning with scikit-learn Quick Start Guide

AI for Absolute Beginners: A Clear Guide to Tomorrow

PRE-MODEL ALGORITHMS