Book Image

NumPy Cookbook - Second Edition

By : Ivan Idris
Book Image

NumPy Cookbook - Second Edition

By: Ivan Idris

Overview of this book

<p>NumPy has the ability to give you speed and high productivity. High performance calculations can be done easily with clean and efficient code, and it allows you to execute complex algebraic and mathematical computations in no time.</p> <p>This book will give you a solid foundation in NumPy arrays and universal functions. Starting with the installation and configuration of IPython, you'll learn about advanced indexing and array concepts along with commonly used yet effective functions. You will then cover practical concepts such as image processing, special arrays, and universal functions. You will also learn about plotting with Matplotlib and the related SciPy project with the help of examples. At the end of the book, you will study how to explore atmospheric pressure and its related techniques. By the time you finish this book, you'll be able to write clean and fast code with NumPy.</p>
Table of Contents (19 chapters)
NumPy Cookbook Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Loading an example dataset


The scikit-learn project comes with a number of datasets and sample images that we can experiment with. In this recipe, we will load an example dataset included in the scikit-learn distribution. The datasets hold data as a NumPy two-dimensional array and metadata linked to the data.

How to do it...

We will load a sample dataset of house prices in Boston. It is a tiny dataset, so if you are looking for a house in Boston, don't get too excited! Other datasets are described at http://scikit-learn.org/dev/modules/classes.html#module-sklearn.datasets.

We will look at the shape of the raw data and its maximum and minimum values. The shape is a tuple, representing the dimensions of the NumPy array. We will do the same for the target array, which contains values that are the learning objectives (determining house price). The following code from sample_data.py accomplishes our goals:

from __future__ import print_function
from sklearn import datasets

boston_prices = datasets...