Mastering Numerical Computing with NumPy

By : Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Mastering Numerical Computing with NumPy

By: Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Overview of this book

NumPy is one of the most important scientific computing libraries available for Python. Mastering Numerical Computing with NumPy teaches you how to achieve expert level competency to perform complex operations, with in-depth coverage of advanced concepts. Beginning with NumPy's arrays and functions, you will familiarize yourself with linear algebra concepts to perform vector and matrix math operations. You will thoroughly understand and practice data processing, exploratory data analysis (EDA), and predictive modeling. You will then move on to working on practical examples which will teach you how to use NumPy statistics in order to explore US housing data and develop a predictive model using simple and multiple linear regression techniques. Once you have got to grips with the basics, you will explore unsupervised learning and clustering algorithms, followed by understanding how to write better NumPy code while keeping advanced considerations in mind. The book also demonstrates the use of different high-performance numerical computing libraries and their relationship with NumPy. You will study how to benchmark the performance of different configurations and choose the best for your system. By the end of this book, you will have become an expert in handling and performing complex data manipulations.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Working with NumPy Arrays

Technical requirements

Why do we need NumPy?

Who uses NumPy?

Introduction to vectors and matrices

Basics of NumPy array objects

NumPy array operations

Working with multidimensional arrays

Indexing, slicing, reshaping, resizing, and broadcasting

Summary

Linear Algebra with NumPy

Vector and matrix mathematics

What's an eigenvalue and how do we compute it?

Computing the norm and determinant

Solving linear equations

Computing gradient

Summary

Exploratory Data Analysis of Boston Housing Data with NumPy Statistics

Loading and saving files

Exploring our dataset

Looking at basic statistics

Computing histograms

Explaining skewness and kurtosis

Trimmed statistics

Box plots

Computing correlations

Summary

Predicting Housing Prices Using Linear Regression

Supervised learning and linear regression

Independent and dependent variables

Hyperparameters

Loss and error functions

Univariate linear regression with gradient descent

Using linear regression to model housing prices

Summary

Clustering Clients of a Wholesale Distributor Using NumPy

Unsupervised learning and clustering

Hyperparameters

The loss function

Implementing our algorithm for a single variable

Modifying our algorithm

Summary

NumPy, SciPy, Pandas, and Scikit-Learn

NumPy and SciPy

NumPy and pandas

SciPy and scikit-learn

Summary

Advanced Numpy

NumPy internals

Summary

Overview of High-Performance Numerical Computing Libraries

BLAS and LAPACK

ATLAS

Intel Math Kernel Library

OpenBLAS

Configuring NumPy with low-level libraries using AWS EC2

Compute-intensive tasks for benchmarking

Summary

Performance Benchmarks

Why do we need a benchmark?

Preparing for a performance benchmark

Results

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Trimmed statistics

As you will have noticed in the previous section, the distributions of our features are very dispersed. Handling the outliers in your model is a very important part of your analysis. It is also very crucial when you look at descriptive statistics. You can be easily confused and misinterpret the distribution because of these extreme values. SciPy has very extensive statistical functions for calculating your descriptive statistics in regards to trimming your data. The main idea of using the trimmed statistics is to remove the outliers (tails) in order to reduce their effect on statistical calculations. Let's see how we can use these functions and how they will affect our feature distribution:

In [58]: np.set_printoptions(suppress= True, linewidth= 125)
         samples = dataset.data
         CRIM = samples[:,0:1]
         minimum = np.round(np.amin(CRIM), decimals...

Mastering Numerical Computing with NumPy

By : Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Mastering Numerical Computing with NumPy

By: Umit Mert Cakmak, Tiago Antao, Mert Cuhadaroglu

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Numerical Computing with NumPy

Python Data Analysis

Big Data Analysis with Python

Hands-On Automated Machine Learning