Beginning Data Science with Python and Jupyter

Beginning Data Science with Python and Jupyter

By : Alex Galea

Buy this Book

Beginning Data Science with Python and Jupyter

By: Alex Galea

Buy this Book

Overview of this book

Get to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. We'll finish up by showing you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context.

Beginning Data Science with Python and Jupyter

Contributors

Preface

Free Chapter

Jupyter Fundamentals

Lesson Objectives

Basic Functionality and Features

Our First Analysis - The Boston Housing Dataset

Summary

Data Cleaning and Advanced Machine Learning

Preparing to Train a Predictive Model

Training Classification Models

Summary

Web Scraping and Interactive Visualizations

Lesson Objectives

Scraping Web Page Data

Interactive Visualizations

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Scraping Web Page Data

In the spirit of leveraging the internet as a database, we can think about acquiring data from web pages either by scraping content or by interfacing with web APIs. Generally, scraping content means getting the computer to read data that was intended to be displayed in a human-readable format. This is in contradistinction to web APIs, where data is delivered in machine-readable formats – the most common being JSON.

In this topic, we will focus on web scraping. The exact process for doing this will depend on the page and desired content. However, as we will see, it's quite easy to scrape anything we need from an HTML page so long as we have an understanding of the underlying concepts and tools. In this topic, we'll use Wikipedia as an example and scrape tabular content from an article. Then, we'll apply the same techniques to scrape data from a page on an entirely separate domain. But first, we'll take some time to introduce HTTP requests.

Beginning Data Science with Python and Jupyter

By : Alex Galea

Beginning Data Science with Python and Jupyter

By: Alex Galea

Overview of this book

Related Content you might be interested in

Current Title:

Beginning Data Science with Python and Jupyter

Become a Python Data Analyst

Applied Supervised Learning with Python

The Data Science Workshop

Scraping Web Page Data

Subtopic A: Introduction to...