Index
A
- .append() method
- append() method
B
- BeautifulSoup
- TreeBuilders / Understanding the features argument
- Beautiful Soup
- installing / Installing Beautiful Soup
- installing, in Linux / Installing Beautiful Soup in Linux
- installing, in Windows / Installing Beautiful Soup in Windows
- using, without installation / Using Beautiful Soup without installation
- installation, verifying / Verifying the installation
- searching / Searching in Beautiful Soup
- search methods / Searching in Beautiful Soup
- navigation / Navigation using Beautiful Soup
- encoding / Encoding in Beautiful Soup
- output encoding / Output encoding
- formatted printing / Formatted printing
- unformatted printing / Unformatted printing
- output formatters / Output formatters in Beautiful Soup
- Beautiful Soup installation, in Linux
- performing / Installing Beautiful Soup in Linux
- package manager, used / Installing Beautiful Soup using package manager
- pip, used / Installing Beautiful Soup using pip
- easy_install, used / Installing Beautiful Soup using easy_install
- Beautiful Soup installation, in Windows
- performing / Installing Beautiful Soup in Windows
- Python path, verifying / Verifying Python path in Windows
- setup.py, used / Installing Beautiful Soup using setup.py
- BeautifulSoup object
- creating / Creating a BeautifulSoup object
- creating, from string / Creating a BeautifulSoup object from a string
- creating, from file-like object / Creating a BeautifulSoup object from a file-like object
- creating, for XML parsing / Creating a BeautifulSoup object for XML parsing
- features argument / Understanding the features argument
C
- .children attribute
- about / The .children attribute
- .contents attribute
- about / The .contents attribute
- clear() method
D
- .descendants attribute
- about / The .descendants attribute
- decode() function
- about / Unformatted printing
- decompose() method
- descendants
- about / Using predefined attributes
- direct children
- about / Using predefined attributes
E
- easy_install
- encode() method
- about / Output encoding, Unformatted printing
- extract() method
F
- features argument
- file-like object
- about / Creating a BeautifulSoup object from a file-like object
- BeautifulSoup object, creating from / Creating a BeautifulSoup object from a file-like object
- find() method
- about / Explaining find()
- find() method, searching with
- tag, searching / Searching for tags
- text, searching / Searching for text
- regular expressions based search / Searching based on regular expressions
- tag attribute values based search / Searching based on attribute values of a tag
- searching, functions used / Searching using functions defined
- searching methods, applying in combination / Applying searching methods in combination
- find_all() method, searching with
- about / Searching with find_all()
- tertiary consumers, searching / Finding all tertiary consumers
- parameters, using / Understanding parameters used with find_all()
- find_all_previous() method / Searching for previous
- find_next_sibling() method / Searching for siblings
- find_next_siblings() method
- about / Searching for siblings
- find_parents() method
- about / Searching with find_all()
- find_previous_sibling() method / Searching for siblings
- find_previous_siblings() method / Searching for siblings
- find_siblings() method
- about / Searching with find_all()
- formatted printing
- about / Formatted printing
- function formatter
- about / The function formatter
- functions, for content modifications
- insert_after() / Special functions to modify content
- insert_before() / Special functions to modify content
- replace_with() method / Special functions to modify content
- wrap() method / Special functions to modify content
- unwrap() method / Special functions to modify content
G
- get_bookurls() function / Finding pages with a list of books
- get_text()
- about / Using get_text()
- using / Using get_text()
H
- html.parser
- html5lib parser
- HTML document
- encoding, specifying / Specifying the encoding of the HTML document
- html formatter
- about / The html formatter
I
- insert() method
- insert_after() method
- installation
- Beautiful Soup / Installing Beautiful Soup
L
- limit parameter, find_all() method / Understanding parameters used with find_all()
- Linux
- Beautiful Soup, installing / Installing Beautiful Soup in Linux
- lxml parser
M
- minimal formatter
- about / The minimal formatter
N
- .next_sibling attribute
- about / The .next_sibling attribute
- NavigableString object
- navigating down
- about / Navigating down
- child tag name, using / Using the name of the child tag
- predefined attributes, using / Using predefined attributes
- special attributes / Special attributes for navigating down
- navigating sideways, to siblings
- .next_sibling attribute / The .next_sibling attribute
- .previous_sibling attribute / The .previous_sibling attribute
- navigating up
- about / Navigating up
- .parent attribute / The .parent attribute
- .parents attribute / The .parents attribute
- navigation, Beautiful Soup
- about / Navigation using Beautiful Soup
- navigating down / Navigating down
- navigating up / Navigating up
- navigating sideways to siblings / Navigating sideways to the siblings
- navigating to next and previous objects parsed / Navigating to the previous and next objects parsed
- new tag
- adding, new_tag() method used / Adding a new tag
- new producer, adding using new_tag() and append() / Adding a new producer using new_tag() and append()
- creating, new_tag() method used / Creating a new tag using new_tag()
- adding, append() method used / Adding a new tag using append()
- new div tag, adding to li tag / Adding a new div tag to the li tag using insert()
- new_tag() method
- next sibling
- searching for / Searching for next
- None formatter
- about / The None formatter
O
- output encoding
- about / Output encoding
- output formatters
- about / Output formatters in Beautiful Soup
- minimal formatter / The minimal formatter
- html formatter / The html formatter
- None formatter / The None formatter
- function formatter / The function formatter
P
- .parent attribute
- about / The .parent attribute
- .parents attribute
- about / The .parents attribute
- .previous_sibling attribute
- about / The .previous_sibling attribute
- parameters, find_all() method
- limit parameter / Understanding parameters used with find_all()
- pip
- predefined attributes, for navigating down
- about / Using predefined attributes
- direct children / Using predefined attributes
- descendants / Using predefined attributes
- .contents attribute / The .contents attribute
- .children attribute / The .children attribute
- .descendants attribute / The .descendants attribute
- prettify() method
- about / Output encoding, Formatted printing
- previous sibling
- searching for / Searching for previous
- Python Package Index (PyPI) / Installing Beautiful Soup using easy_install
R
- replace_with() method
S
- .string attribute
- about / The .string attribute
- .strings attribute
- about / The .strings attribute
- searching, in Beautiful Soup
- about / Searching in Beautiful Soup
- find() method used / Searching with find()
- find_all() method used / Searching with find_all()
- tags, searching / Searching for Tags in relation
- parent tags, searching / Searching for the parent tags
- siblings, searching / Searching for siblings
- next sibling, searching / Searching for next
- previous sibling, searching / Searching for previous
- searching, with find()
- performing / Searching with find()
- first producer, finding / Finding the first producer
- searching methods
- about / Searching in Beautiful Soup
- used, for scraping information from web page / Using search methods to scrape information from a web page
- setup.py script
- siblings
- searching for / Searching for siblings
- soup.original_encoding
- str() method
- about / Unformatted printing
- string contents, modifying
- about / Modifying string contents
- .string attribute, used / Using .string to modify the string content
- strings, adding using .append() method / Adding strings using .append(), insert(), and new_string()
- strings, adding using new_string() method / Adding strings using .append(), insert(), and new_string()
- strings, adding using insert() method / Adding strings using .append(), insert(), and new_string()
T
- tag attribute values based search, find() method used
- about / Searching based on attribute values of a tag
- first primary consumer, finding / Finding the first primary consumer
- custom attributes based search / Searching based on custom attributes
- CSS class based search / Searching based on the CSS class
- Tag modifying, Beautiful Soup used
- name property, modifying / Modifying the name property of Tag
- attribute values, modifying / Modifying the attribute values of Tag
- attribute value, updating / Updating the existing attribute value of Tag
- attribute value, adding / Adding new attribute values to Tag
- tag attributes, deleting / Deleting the tag attributes
- new tag, adding / Adding a new tag
- Tag object
- about / Tag
- accessing, from BeautifulSoup / Accessing the Tag object from BeautifulSoup
- name / Name of the Tag object
- attributes / Attributes of a Tag object
- tags
- searching for / Searching for Tags in relation
- parent tags, searching for / Searching for the parent tags
- deleting, from HTML document / Deleting tags from the HTML document
- producer, deleting using decompose() / Deleting the producer using decompose()
- producer, deleting using extract() / Deleting the producer using extract()
- contents, deleting using Beautiful Soup / Deleting the contents of a tag using Beautiful Soup
- TreeBuilders
U
- unformatted printing
- about / Unformatted printing
- unicode() method
- about / Unformatted printing
- UnicodeDammit library / Specifying the encoding of the HTML document
- unwrap() method / Special functions to modify content
- UTF-8 encoding
W
- website scraper
- book details, getting / Getting book details from PacktPub.com
- pages containing list of books, finding / Finding pages with a list of books
- creating / Finding pages with a list of books
- book details, finding / Finding book details
- selling price, searching on Amazon / Getting selling prices from Amazon
- selling price, getting from Barnes and Noble / Getting the selling price from Barnes and Noble
- wrap() method