Book Image

Practical Data Science Cookbook

By : Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta
Book Image

Practical Data Science Cookbook

By: Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta

Overview of this book

<p>As increasing amounts of data is generated each year, the need to analyze and operationalize it is more important than ever. Companies that know what to do with their data will have a competitive advantage over companies that don't, and this will drive a higher demand for knowledgeable and competent data professionals.</p> <p>Starting with the basics, this book will cover how to set up your numerical programming environment, introduce you to the data science pipeline (an iterative process by which data science projects are completed), and guide you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples in the two most popular programming languages for data analysis—R and Python.</p>
Table of Contents (18 chapters)
Practical Data Science Cookbook
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Testing the performance benefits of NumPy


Minor changes to the original code did not get the job done, so we must consider more drastic approaches. Luckily, with Python, we have the NumPy library. NumPy provides a very fast n-dimensional array (called ndarray) data structure for Python and offers a number of operations on this data type that have been implemented in C and are highly optimized.

Before we make any major changes to asa.py, we use this recipe to try a few toy examples to see how much faster the NumPy library's faster yet less flexible ndarray arrays are versus naïve Python lists.

Getting ready

For this recipe, we only need to fire up the Python REPL or, even better, start up IPython. Also, this recipe assumes that you have NumPy installed.

How to do it…

Perform the following steps to test out the performance differential between Python and NumPy:

  1. We must import both the NumPy, random, and math libraries:

    import numpy as np
    import math, random
    

    Note that it is standard practice to import...