Parallel Programming with Python

Book Image

Parallel Programming with Python

Book Image

Parallel Programming with Python

Overview of this book

Parallel Programming with Python

Parallel Programming with Python

Credits

About the Author

About the Author

Acknowledgments

Acknowledgments

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Contextualizing Parallel, Concurrent, and Distributed Programming

Contextualizing Parallel, Concurrent, and Distributed Programming

Why use parallel programming?

Exploring common forms of parallelization

Communicating in parallel programming

Identifying parallel programming problems

Discovering Python's parallel programming tools

Taking care of Python GIL

Designing Parallel Algorithms

Designing Parallel Algorithms

The divide and conquer technique

Using data decomposition

Decomposing tasks with pipeline

Processing and mapping

Identifying a Parallelizable Problem

Identifying a Parallelizable Problem

Obtaining the highest Fibonacci value for multiple inputs

Crawling the Web

Using the threading and concurrent.futures Modules

Using the threading and concurrent.futures Modules

Defining threads

Using threading to obtain the Fibonacci series term with multiple inputs

Crawling the Web using the concurrent.futures module

Using Multiprocessing and ProcessPoolExecutor

Using Multiprocessing and ProcessPoolExecutor

Understanding the concept of a process

Implementing multiprocessing communication

Using multiprocessing to compute Fibonacci series terms with multiple inputs

Crawling the Web using ProcessPoolExecutor

Utilizing Parallel Python

Utilizing Parallel Python

Understanding interprocess communication

Using PP to calculate the Fibonacci series term on SMP architecture

Using PP to make a distributed Web crawler

Distributing Tasks with Celery

Distributing Tasks with Celery

Understanding Celery

Understanding Celery's architecture

Setting up the environment

Dispatching a simple task

Using Celery to obtain a Fibonacci series term

Defining queues by task types

Using Celery to make a distributed Web crawler

Doing Things Asynchronously

Doing Things Asynchronously

Understanding blocking, nonblocking, and asynchronous operations

Understanding event loop

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Crawling the Web using the concurrent.futures module

The following section will make use of our code by implementing the parallel Web crawler. In this scheme, we will use a very interesting Python resource, ThreadPoolExecutor, which is featured in the concurrent.futures module. In the previous example, in which we analyzed parallel_fibonacci.py, quite primitive forms of threads were used. Also, at a specific moment, we had to create and initialize more than one thread manually. In larger programs, it is very difficult to manage this kind of situation. In such case, there are mechanisms that allow a thread pool. A thread pool is nothing but a structure that keeps several threads, which are previously created, to be used in a certain process. It aims to reuse threads, thus avoiding unnecessary creation of threads—which is costly.

Basically, as mentioned in the previous chapter, we will have an algorithm that will execute some tasks in stages, and these tasks depend on each other. Here, we will...