Automate it! - Recipes to upskill your business

Automate it! - Recipes to upskill your business

By : Chetan Giridhar

Buy this Book

Automate it! - Recipes to upskill your business

By: Chetan Giridhar

Buy this Book

Overview of this book

This book gives you a great selection of recipes to automate your business processes with Python, and provides a platform for you to understand how Python is useful to make time consuming and repetitive business tasks more efficient. Python is a mature high level language, has object-oriented programming features, powers various apps, has a huge set of modules, and great community support. Python is extremely easy to use, can help you get complex tasks done efficiently and is an apt choice for our needs. With a classic problem-solution based approach and real-world examples, you will delve into things that automate your business processes. You will begin by learning about the Python modules to work with Web, Worksheets, Presentations and PDFs. You’ll leverage Python recipes to automate processes in HR, Finance and making them efficient and reliable. For instance, company payroll — an integral process in HR will be automated with Python recipes. A few chapters of this book will also help you gain knowledge on working with bots and computer vision. You will learn how to build bots for automating business use cases by integrating artificial intelligence. You’ll also understand how Python is helpful in face detection and building a scanner of your own. You will see how to effectively and easily use Python code to manage SMS and voice notifications, opening a world of possibilities using cloud telephony to solve your business needs. Moving forward, you will learn to work with APIs, Webhooks and Emails to automate Marketing and Customer Support processes. Finally, using the various Python libraries, this book will arm you with knowledge to customize data solutions and generate reports to meet your business needs. This book will help you up-skill and make your business processes efficient with the various Python recipes covered in this book.

Automate it!

Credits

About the Author

About the Reviewer

www.PacktPub.com

Customer Feedback

Preface

Free Chapter

Working with the Web

Introduction

Making HTTP requests

A brief look at web scraping

Parsing and extracting web content

Downloading content from the Web

Working with third-party REST APIs

Asynchronous HTTP server in Python

Web automation with selenium bindings

Automating lead generation with web scraping

Working with CSV and Excel Worksheets

Introduction

Reading CSV files with reader objects

Writing data into CSV files

Developing your own CSV dialects

Managing employee information in an automated way

Reading Excel sheets

Writing data into worksheets

Formatting Excel cells

Playing with Excel formulae

Building charts within Excel sheets

Automating the comparison of company financials

Getting Creative with PDF Files and Documents

Introduction

Extracting data from PDF files

Creating and copying PDF documents

Manipulating PDFs (adding header/footer, merge, split, delete)

Automating generation of payslips for finance department

Reading Word documents

Writing data into Word documents (adding headings, images, tables)

Generating personalized new hire orientation for HR team in an automated way

Playing with SMS and Voice Notifications

Introduction

Registering with a cloud telephony provider

Sending text messages

Receiving SMS messages

SMS workflows for Domino's

Sending voice messages

Receiving voice calls

Building your own customer service software

Fun with E-mails

Introduction

Sending e-mail messages

E-mail encryption

Beautifying e-mail messages with MIME

E-mail messages with attachments

Connecting to your inbox

Fetching and reading e-mail messages

Marking e-mail messages

Clearing up e-mail messages from your inbox

Automating customer support flows with with e-mail responses

Presentations and More

Introduction

Reading PowerPoint presentations

Creating and updating presentations, and adding slides

Playing with layouts, placeholders, and textboxes

Working with different shapes and adding tables

Visual treat with pictures and charts

Automating weekly sales reports

Power of APIs

Introduction

Designing your own REST APIs

Automating social media marketing with Twitter APIs

An introduction to Webhooks

Implementing Webhooks

Automating lead management with Webhooks

Talking to Bots

Introduction

Building a moody Telegram bot

Different types of bots

A smart bot with artificial intelligence

Automating business processes with bots

Imagining Images

Introduction

Converting images

Resizing, cropping, and generating thumbnails

Copy-pasting and watermarking images

Image differences and comparison

Face detection

Imaging as a business process

Data Analysis and Visualizations

Introduction

Reading, selecting, and interpreting data with visualizations

Generating insights using data filtering and aggregation

Automating social media analysis for businesses

Time in the Zone

Introduction

Working with time, date, and calendar

Comparing and combining the date and time objects, and date arithmetic

Formatting and parsing dates

Dealing with time zone calculations

Automating invoicing based on user time zone

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Parsing and extracting web content

Well, now we're confident about making HTTP requests to multiple URLs. We also looked at a simple example of web scraping.

But WWW is made up of pages with multiple data formats. If we want to scrape the Web and make sense of the data, we should also know how to parse different formats in which data is available on the Web.

In this recipe, we'll discuss how to s.

Getting ready

Data on the Web is mostly in the HTML or XML format. To understand how to parse web content, we'll take an example of an HTML file. We'll learn how to select certain HTML elements and extract the desired data. For this recipe, you need to install the BeautifulSoup module of Python. The BeautifulSoup module is one of the most comprehensive Python modules that will do a good job of parsing HTML content. So, let's get started.

How to do it...

We start by installing BeautifulSoup on our Python instance. The following command will help us install the module. We install the latest version, which is beautifulsoup4:
```
        pip install beautifulsoup4
```

Now, let's take a look at the following HTML file, which will help us learn how to parse the HTML content:

        <html xmlns="http://www.w3.org/1999/html">
        <head>
            <title>Enjoy Facebook!</title> 
        </head>
        <body>
            <p>
              <span>You know it's easy to get intouch with
              your <strong>Friends</strong> on web!<br></span>
              Click here <a href="https://facebook.com">here</a>
              to sign up and enjoy<br>
            </p>
            <p class="wow"> Your gateway to social web! </p>
            <div id="inventor">Mark Zuckerberg</div>
            Facebook, a webapp used by millions
        </body>
        </html>

Let's name this file as python.html. Our HTML file is hand-crafted so that we can learn the multiple ways of parsing it to get the required data from it. Python.html has typical HTML tags given as follows:
- <head> - It is the container of all head elements like <title>.
- <body> - It defines the body of the HTML document.
-  - This element defines a paragraph in HTML.
-  - It is used to group inline elements in a document.
-  - It is used to apply a bold style to the text present under this tag.
- <a> - It represents a hyperlink or anchor and contains <href> that points to the hyperlink.
- <class> - It is an attribute that points to a class in a style sheet.
- <div id> - It is a container that encapsulates other page elements and divides the content into sections. Every section can be identified by attribute id.
If we open this HTML in a browser, this is how it'll look:
Let's now write some Python code to parse this HTML file. We start by creating a BeautifulSoup object.
Tip
We always need to define the parser. In this case we used lxml as the parser. The parser helps us read files in a designated format so that querying data becomes easy.
```
        import bs4
        myfile = open('python.html')
        soup = bs4.BeautifulSoup(myfile, "lxml")
        #Making the soup
        print "BeautifulSoup Object:", type(soup)
```
The output of the preceding code is seen in the following screenshot:
OK, that's neat, but how do we retrieve data? Before we try to retrieve data, we need to select the HTML elements that contain the data we need.

We can select or find HTML elements in different ways. We could select elements with ID, CSS, or tags. The following code uses python.html to demonstrate this concept:

        #Find Elements By tags
        print soup.find_all('a')
        print soup.find_all('strong')
        #Find Elements By id
        print soup.find('div', {"id":"inventor"})
        print soup.select('#inventor')
        #Find Elements by css print
        soup.select('.wow')

The output of the preceding code can be viewed in the following screenshot:

Now let's move on and get the actual content from the HTML file. The following are a few ways in which we can extract the data of interest:

        print "Facebook URL:", soup.find_all('a')[0]['href']
        print "Inventor:", soup.find('div', {"id":"inventor"}).text 
        print "Span content:", soup.select('span')[0].getText()

The output of the preceding code snippet is as follows:

Whoopie! See how we got all the text we wanted from the HTML elements.

How it works...

In this recipe, you learnt the skill of finding or selecting different HTML elements based on ID, CSS, or tags.

In the second code example of this recipe, we used find_all('a') to get all the anchor elements from the HTML file. When we used the find_all() method, we got multiple instances of the match as an array. The select() method helps you reach the element directly.

We also used find('div', <divId>) or select(<divId>) to select HTML elements by div Id. Note how we selected the inventor element with div ID #inventor in two ways using the find() and select() methods. Actually, the select method can also be used as select(<class-name>) to select HTML elements with a CSS class name. We used this method to select element wow in our example.

In the third code example, we searched for all the anchor elements in the HTML page and looked at the first index with soup.find_all('a')[0]. Note that since we have only one anchor tag, we used the index 0 to select that element, but if we had multiple anchor tags, it could be accessed with index 1. Methods like getText() and attributes like text (as seen in the preceding examples) help in extracting the actual content from the elements.

There's more...

Cool, so we understood how to parse a web page (or an HTML page) with Python. You also learnt how to select or find HTML elements by ID, CSS, or tags. We also looked at examples of how to extract the required content from HTML. What if we want to download the contents of a page or file from the Web? Let's see if we can achieve that in our next recipe.

Automate it! - Recipes to upskill your business

By : Chetan Giridhar

Automate it! - Recipes to upskill your business

By: Chetan Giridhar

Overview of this book

Related Content you might be interested in

Current Title:

Automate it! - Recipes to upskill your business

Python Automation Cookbook

Mastering Python Scripting for System Administrators

Python Automation Cookbook.

Parsing and extracting web content

Getting ready

How to do it...

Tip

How it works...

There's more...