Advanced Python Programming

Book Image

Advanced Python Programming

By : Dr. Gabriele Lanaro, Quan Nguyen, Sakis Kasampalis

Book Image

Advanced Python Programming

By: Dr. Gabriele Lanaro, Quan Nguyen, Sakis Kasampalis

Overview of this book

This Learning Path shows you how to leverage the power of both native and third-party Python libraries for building robust and responsive applications. You will learn about profilers and reactive programming, concurrency and parallelism, as well as tools for making your apps quick and efficient. You will discover how to write code for parallel architectures using TensorFlow and Theano, and use a cluster of computers for large-scale computations using technologies such as Dask and PySpark. With the knowledge of how Python design patterns work, you will be able to clone objects, secure interfaces, dynamically choose algorithms, and accomplish much more in high performance computing. By the end of this Learning Path, you will have the skills and confidence to build engaging models that quickly offer efficient solutions to your problems. This Learning Path includes content from the following Packt products: • Python High Performance - Second Edition by Gabriele Lanaro • Mastering Concurrency in Python by Quan Nguyen • Mastering Python Design Patterns by Sakis Kasampalis

Title Page

Copyright

About Packt

Contributors

Preface

Free Chapter

Benchmarking and Profiling

Benchmarking and Profiling

Designing your application

Writing tests and benchmarks

Better tests and benchmarks with pytest-benchmark

Finding bottlenecks with cProfile

Profile line by line with line_profiler

Optimizing our code

Profiling memory usage with memory_profiler

Pure Python Optimizations

Pure Python Optimizations

Useful algorithms and data structures

Caching and memoization

Comprehensions and generators

Fast Array Operations with NumPy and Pandas

Fast Array Operations with NumPy and Pandas

Getting started with NumPy

Rewriting the particle simulator in NumPy

Reaching optimal performance with numexpr

C Performance with Cython

C Performance with Cython

Compiling Cython extensions

Adding static types

Sharing declarations

Working with arrays

Particle simulator in Cython

Profiling Cython

Using Cython with Jupyter

Exploring Compilers

Exploring Compilers

The PyPy project

Other interesting projects

Implementing Concurrency

Implementing Concurrency

Asynchronous programming

The asyncio framework

Reactive programming

Parallel Processing

Parallel Processing

Introduction to parallel programming

Using multiple processes

Parallel Cython with OpenMP

Automatic parallelism

Advanced Introduction to Concurrent and Parallel Programming

Advanced Introduction to Concurrent and Parallel Programming

Technical requirements

What is concurrency?

Not everything should be made concurrent

The history, present, and future of concurrency

A brief overview of mastering concurrency in Python

Setting up your Python environment

Further reading

Amdahl's Law

Technical requirements

Formula and interpretation

Amdahl's Law's relationship to the law of diminishing returns

How to simulate in Python

Practical applications of Amdahl's Law

Further reading

Working with Threads in Python

Working with Threads in Python

Technical requirements

The concept of a thread

An overview of the threading module

Creating a new thread in Python

Synchronizing threads

Multithreaded priority queue

Further reading

Using the with Statement in Threads

Using the with Statement in Threads

Technical requirements

Context management

The with statement in concurrent programming

Further reading

Concurrent Web Requests

Concurrent Web Requests

Technical requirements

The basics of web requests

The requests module

Concurrent web requests

The problem of timeout

Good practices in making web requests

Further reading

Working with Processes in Python

Working with Processes in Python

Technical requirements

The concept of a process

An overview of the multiprocessing module

Interprocess communication

Further reading

Reduction Operators in Processes

Reduction Operators in Processes

Technical requirements

The concept of reduction operators

Example implementation in Python

Real-life applications of concurrent reduction operators

Further reading

Concurrent Image Processing

Concurrent Image Processing

Technical requirements

Image processing fundamentals

Applying concurrency to image processing

Good concurrent image processing practices

Further reading

Introduction to Asynchronous Programming

Introduction to Asynchronous Programming

Technical requirements

A quick analogy

Asynchronous versus other programming models

An example in Python

Further reading

Implementing Asynchronous Programming in Python

Implementing Asynchronous Programming in Python

Technical requirements

The asyncio module

The asyncio framework in action

concurrent.futures as a solution for blocking tasks

Further reading

Building Communication Channels with asyncio

Building Communication Channels with asyncio

Technical requirements

The ecosystem of communication channels

Client-side communication with aiohttp

Further reading

Deadlocks

Technical requirements

The concept of deadlock

Approaches to deadlock situations

The concept of livelock

Further reading

Starvation

Technical requirements

The concept of starvation

The readers-writers problem

Solutions to starvation

Further reading

Race Conditions

Race Conditions

Technical requirements

The concept of race conditions

Simulating race conditions in Python

Locks as a solution to race conditions

Race conditions in real life

Further reading

The Global Interpreter Lock

The Global Interpreter Lock

Technical requirements

An introduction to the Global Interpreter Lock

The potential removal of the GIL from Python

How to work with the GIL

Further reading

The Factory Pattern

The Factory Pattern

The factory method

The abstract factory

The Builder Pattern

The Builder Pattern

Real-world examples

Other Creational Patterns

Other Creational Patterns

The prototype pattern

The Adapter Pattern

The Adapter Pattern

Real-world examples

The Decorator Pattern

The Decorator Pattern

Real-world examples

The Bridge Pattern

The Bridge Pattern

Real-world examples

The Facade Pattern

The Facade Pattern

Real-world examples

Other Structural Patterns

Other Structural Patterns

The flyweight pattern

The model-view-controller pattern

The proxy pattern

The Chain of Responsibility Pattern

The Chain of Responsibility Pattern

Real-world examples

The Command Pattern

The Command Pattern

Real-world examples

The Observer Pattern

The Observer Pattern

Real-world examples

Appendix

Other Books You May Enjoy

Other Books You May Enjoy

Leave a Review - Let Other Readers Know What You Think

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Profiling memory usage with memory_profiler

In some cases, high memory usage constitutes an issue. For example, if we want to handle a huge number of particles, we will incur a memory overhead due to the creation of many Particle instances.

The memory_profiler module summarizes, in a way similar to line_profiler, the memory usage of the process.

Note

The memory_profiler package is also available on the Python Package Index. You should also install the psutil module (https://github.com/giampaolo/psutil) as an optional dependency that will make memory_profiler considerably faster.

Just like line_profiler, memory_profiler also requires the instrumentation of the source code by placing a @profile decorator on the function we intend to monitor. In our case, we want to analyze the benchmark function.

We can slightly change benchmark to instantiate a considerable amount (100000) of Particle instances and decrease the simulation time:

    def benchmark_memory(): 
        particles = [Particle(uniform(-1.0, 1.0), 
                              uniform(-1.0, 1.0), 
                              uniform(-1.0, 1.0)) 
                      for i in range(100000)] 

        simulator = ParticleSimulator(particles) 
        simulator.evolve(0.001)

We can use memory_profiler from an IPython shell through the %mprun magic command as shown in the following screenshot:

Note

It is possible to run memory_profiler from the shell using the mprof run command after adding the @profile decorator.

From the Increment column, we can see that 100,000 Particle objects take 23.7 MiB of memory.

Note

1 MiB (mebibyte) is equivalent to 1,048,576 bytes. It is different from 1 MB (megabyte), which is equivalent to 1,000,000 bytes.

We can use __slots__ on the Particle class to reduce its memory footprint. This feature saves some memory by avoiding storing the variables of the instance in an internal dictionary. This strategy, however, has a drawback--it prevents the addition of attributes other than the ones specified in __slots__ :

    class Particle:
        __slots__ = ('x', 'y', 'ang_vel') 

        def __init__(self, x, y, ang_vel): 
            self.x = x 
            self.y = y 
            self.ang_vel = ang_vel

We can now rerun our benchmark to assess the change in memory consumption, the result is displayed in the following screenshot:

By rewriting the Particle class using __slots__, we can save about 10 MiB of memory.