Beginning Data Science with Python and Jupyter

Beginning Data Science with Python and Jupyter

By : Chris DallaVilla

Buy this Video

Beginning Data Science with Python and Jupyter

By: Chris DallaVilla

Buy this Video

Overview of this book

Getting started with data science doesn’t have to be an uphill battle. This step-by-step video course is ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction. Get to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You’ll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world.We'll start with understanding the basics of Jupyter and its standard features. You'll be analyzing an example of a data analytics report. After analyzing a data analytics report, next step is to implement multiple classification algorithms. We’ll then show you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context. Finish up by learning to visualize these data interactively. The code bundle for this course is available at https://github.com/TrainingByPackt/Beginning-Data-Science-with-Python-and-Jupyter-eLearning

Free Chapter

Jupyter Fundamentals

Title Overview

Lesson Overview

Basic Functionality

Useful Features of Jupyter

Python Libraries

Our First Analysis - The Boston Housing Dataset

Introduction to Predictive Analytics

Lesson Summary

Data Cleaning and Advanced Machine Learning

Lesson Overview

Preparing to Train a Predictive Model

Training Classification Models

K-Fold Cross-Validation

Lesson Summary

Web Scraping and Interactive Visualizations

Lesson Overview

Scraping Web Page Data

Interactive Visualizations

Lesson Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Chapter 2

Data Cleaning and Advanced Machine Learning

Section 5

K-Fold Cross-Validation

Thus far, we have trained models on a subset of the data and then assessed performance on the unseen portion, called the test set. This is good practice because the model performance on training data is not a good indicator of its e?ectiveness as a predictor. It's very easy to increase accuracy on a training dataset by overfitting a model, which can result in poorer performance on unseen data. This video covers: - Assessing Models with K-Fold Cross-Validation and Validation Curves - K-Fold Cross Validation - K-Fold Cross Validation Algorithm - Stratified –fold - Validation Curves - Demo on Using K-fold Cross Validation and Validation Curves in Python with Scikit-learn - Dimensionality Reduction Techniques - Principal Component Analysis (PCA) - Key Insights of PCA - Demo on Training a Predictive Model For The Employee Retention Problem

Beginning Data Science with Python and Jupyter

By : Chris DallaVilla

Beginning Data Science with Python and Jupyter

By: Chris DallaVilla

Overview of this book

Related Content you might be interested in

Current Title:

Beginning Data Science with Python and Jupyter