-
Book Overview & Buying
-
Table Of Contents
Web Scraping Tutorial with Scrapy and Python for Beginners
By :
Web Scraping Tutorial with Scrapy and Python for Beginners
By:
Overview of this book
Web scraping is the process of scraping websites and extracting desired data from them, and in this course, you will learn and master web scraping using Python and Scrapy with a step-by-step and in-depth guide.
The course starts with introducing you to the web scraping process (with infographics—no code); learn how to scrape data from websites and how to use Scrapy for this. After getting the basics clear, you will perform web scraping using Python and the Scrapy framework! After you have built an actual web scraper, you will get an idea of how web scraping works first-hand. You will then look at the essential concepts of web scraping and Scrapy. Learning how to scrape websites and the essentials already makes you a complete web scraper but you will take this even further and learn the advanced web scraping techniques to become an expert.
Advanced topics such as crawling multiple pages and extracting data—pagination, scraping data using Regular Expressions (RegEx), scraping dynamic or JavaScript-rendered websites using Scrapy Playwright—will be thoroughly explained. Finally, you will perform three projects at the end—Champions League Table [ESPN], Product Tracker [Amazon], and Scraper Application [GUI].
By the end of this course, you will have learned how to do web scraping using Python and Scrapy.
All the resource files are added to the GitHub repository at:
https://github.com/PacktPublishing/Web-Scraping-Tutorial-with-Scrapy-and-Python-for-Beginners-
Table of Contents (22 chapters)
Introduction to the Course
Scrapy Installation
Scrapy Spider
CSS Selectors
Scrapy Items
Exporting Data
Scrapy Item Pipeline
Pagination
Following Links
Scraping Tables
Logging into Websites
Scraping JavaScript Rendered Websites
Scrapy Playwright
User Agents and Proxies
Tips and Tricks
Project 1: Champions League Table from ESPN.com
Project 2: Amazon Product Rank
Project 3: Extending Scraper with GUI