Mastering Numerical Computing with NumPy

By : Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Mastering Numerical Computing with NumPy

By: Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Overview of this book

NumPy is one of the most important scientific computing libraries available for Python. Mastering Numerical Computing with NumPy teaches you how to achieve expert level competency to perform complex operations, with in-depth coverage of advanced concepts. Beginning with NumPy's arrays and functions, you will familiarize yourself with linear algebra concepts to perform vector and matrix math operations. You will thoroughly understand and practice data processing, exploratory data analysis (EDA), and predictive modeling. You will then move on to working on practical examples which will teach you how to use NumPy statistics in order to explore US housing data and develop a predictive model using simple and multiple linear regression techniques. Once you have got to grips with the basics, you will explore unsupervised learning and clustering algorithms, followed by understanding how to write better NumPy code while keeping advanced considerations in mind. The book also demonstrates the use of different high-performance numerical computing libraries and their relationship with NumPy. You will study how to benchmark the performance of different configurations and choose the best for your system. By the end of this book, you will have become an expert in handling and performing complex data manipulations.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Working with NumPy Arrays

Technical requirements

Why do we need NumPy?

Who uses NumPy?

Introduction to vectors and matrices

Basics of NumPy array objects

NumPy array operations

Working with multidimensional arrays

Indexing, slicing, reshaping, resizing, and broadcasting

Summary

Linear Algebra with NumPy

Vector and matrix mathematics

What's an eigenvalue and how do we compute it?

Computing the norm and determinant

Solving linear equations

Computing gradient

Summary

Exploratory Data Analysis of Boston Housing Data with NumPy Statistics

Loading and saving files

Exploring our dataset

Looking at basic statistics

Computing histograms

Explaining skewness and kurtosis

Trimmed statistics

Box plots

Computing correlations

Summary

Predicting Housing Prices Using Linear Regression

Supervised learning and linear regression

Independent and dependent variables

Hyperparameters

Loss and error functions

Univariate linear regression with gradient descent

Using linear regression to model housing prices

Summary

Clustering Clients of a Wholesale Distributor Using NumPy

Unsupervised learning and clustering

Hyperparameters

The loss function

Implementing our algorithm for a single variable

Modifying our algorithm

Summary

NumPy, SciPy, Pandas, and Scikit-Learn

NumPy and SciPy

NumPy and pandas

SciPy and scikit-learn

Summary

Advanced Numpy

NumPy internals

Summary

Overview of High-Performance Numerical Computing Libraries

BLAS and LAPACK

ATLAS

Intel Math Kernel Library

OpenBLAS

Configuring NumPy with low-level libraries using AWS EC2

Compute-intensive tasks for benchmarking

Summary

Performance Benchmarks

Why do we need a benchmark?

Preparing for a performance benchmark

Results

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

NumPy and pandas

When you think about it, NumPy is a fairly low-level array-manipulation library, and the majority of other Python libraries are written on top of it.

One of these libraries is pandas, which is a high-level data-manipulation library. When you are exploring a dataset, you usually perform operations such as calculating descriptive statistics, grouping by a certain characteristic, and merging. The pandas library has many friendly functions to perform these various useful operations.

Let's use a diabetes dataset in this example. The diabetes dataset in sklearn.datasets is standardized with a zero mean and unit L2 norm.

The dataset contains 442 records with 10 features: age, sex, body mass index, average blood pressure, and six blood serum measurements.

The target represents the disease progression after these baseline measures are taken. You can look at the data...

Mastering Numerical Computing with NumPy

By : Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Mastering Numerical Computing with NumPy

By: Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Numerical Computing with NumPy

Python Data Analysis

Big Data Analysis with Python

Hands-On Automated Machine Learning