Book Image

Jupyter Notebook for Data Science [Video]

By : Drazen Lucanin
5 (1)
Book Image

Jupyter Notebook for Data Science [Video]

5 (1)
By: Drazen Lucanin

Overview of this book

This video course will help you get familiar with Jupyter Notebook and all of its features to perform various data science tasks in Python. Jupyter Notebook is a powerful tool for interactive data exploration and visualization and has become the standard tool among data scientists. In the course, we will start from basic data analysis tasks in Jupyter Notebook and work our way up to learn some common scientific Python tools such as pandas, matplotlib, and plotly. We will work with real datasets, such as crime and traffic accidents in New York City, to explore common issues such as data scraping and cleaning. We will create insightful visualizations, showing time-stamped and spatial data. By the end of the course, you will feel confident about approaching a new dataset, cleaning it up, exploring it, and analyzing it in Jupyter Notebook to extract useful information in the form of interactive reports and information-dense data visualizations. All the code and supporting files for this course are available on GitHub at https://github.com/PacktPublishing/Jupyter-Notebook-for-Data-Science This course uses Jupyter 5.4.1, while not the latest version available, it provides relevant and informative content for data science enthusiasts.
Table of Contents (5 chapters)
Chapter 3
Scraping Data
Content Locked
Section 4
Scraping Websites Using BeautifulSoup
In this video, we will show a more difficult example of scraping data from an unstructured website. - Show the website we will be using to fetch the Chicago weather data - Show how to use BeautifulSoup to download the website and parse the HTML - Show how to convert the parsed HTML object into a pandas DataFrame