2. Loading and Processing Data | The TensorFlow Workshop

Book Overview & Buying
Table Of Contents

The TensorFlow Workshop

By : Matthew Moocarme, Abhranshu Bagchi, Anthony So , Anthony Maddalone

4.6 (25)

Buy this Book

The TensorFlow Workshop

4.6 (25)

By: Matthew Moocarme, Abhranshu Bagchi, Anthony So , Anthony Maddalone

Buy this Book

Overview of this book

Getting to grips with tensors, deep learning, and neural networks can be intimidating and confusing for anyone, no matter their experience level. The breadth of information out there, often written at a very high level and aimed at advanced practitioners, can make getting started even more challenging. If this sounds familiar to you, The TensorFlow Workshop is here to help. Combining clear explanations, realistic examples, and plenty of hands-on practice, it’ll quickly get you up and running. You’ll start off with the basics – learning how to load data into TensorFlow, perform tensor operations, and utilize common optimizers and activation functions. As you progress, you’ll experiment with different TensorFlow development tools, including TensorBoard, TensorFlow Hub, and Google Colab, before moving on to solve regression and classification problems with sequential models. Building on this solid foundation, you’ll learn how to tune models and work with different types of neural network, getting hands-on with real-world deep learning applications such as text encoding, temperature forecasting, image augmentation, and audio processing. By the end of this deep learning book, you’ll have the skills, knowledge, and confidence to tackle your own ambitious deep learning projects with TensorFlow.

Preface

About the Book

1. Introduction to Machine Learning with TensorFlow

Introduction

Implementing Artificial Neural Networks in TensorFlow

The TensorFlow Library in Python

Introduction to Tensors

Tensor Addition

Reshaping

Tensor Multiplication

Optimization

Activation functions

Summary

Free Chapter

2. Loading and Processing Data

Introduction

Exploring Data Types

Data Preprocessing

Processing Tabular Data

Processing Image Data

Image Augmentation

Text Processing

Audio Processing

Summary

3. TensorFlow Development

Introduction

TensorBoard

TensorFlow Hub

Google Colab

Summary

4. Regression and Classification Models

Introduction

Sequential Models

Model Fitting

Classification Models

Summary

5. Classification Models

Introduction

Binary Classification

Metrics for Classifiers

Multi-Class Classification

Multi-Label Classification

Summary

6. Regularization and Hyperparameter Tuning

Introduction

Regularization Techniques

Hyperparameter Tuning

Summary

7. Convolutional Neural Networks

Introduction

CNNs

Image Representation

The Convolutional Layer

Pooling Layer

Image Augmentation

Binary Image Classification

Object Classification

Summary

8. Pre-Trained Networks

Introduction

ImageNet

Transfer Learning

Fine-Tuning

TensorFlow Hub

Feature Extraction

Summary

9. Recurrent Neural Networks

Introduction

Sequential Data

Recurrent Neural Networks

Natural Language Processing

Back Propagation Through Time (BPTT)

Summary

10. Custom TensorFlow Components

Introduction

TensorFlow APIs

Implementing Custom Loss Functions

Implementing Custom Layers

Summary

11. Generative Models

Introduction

Text Generation

Generative Adversarial Networks

Deep Convolutional Generative Adversarial Networks (DCGANs)

Summary

Appendix

1. Introduction to Machine Learning with TensorFlow

2. Loading and Processing Data

3. TensorFlow Development

4. Regression and Classification Models

5. Classification Models

6. Regularization and Hyperparameter Tuning

7. Convolutional Neural Networks

8. Pre-Trained Networks

9. Recurrent Neural Networks

10. Custom TensorFlow Components

11. Generative Models

Exploring Data Types

Depending on the source, raw data can be of different forms. Common forms of data include tabular data, images, video, audio, and text. For example, the output from a temperature logger (used to record the temperature at a given location over time) is tabular. Tabular data is structured with rows and columns, and, in the example of a temperature logger, each column may represent a characteristic for each record, such as the time, location, and temperature, while each row may represent the values of each record. The following table shows an example of numerical tabular data:

Figure 2.1: An example of 10 rows of tabular data that consists of numerical values

Image data represents another common form of raw data that is popular for building machine learning models. These models are popular due to the large volume of data that's available. With smartphones and security cameras recording all of life's moments, they have generated an enormous amount of data that can be used to train models.

The dimensions of image data for training are different than they are for tabular data. Each image has a height and width dimension, as well as a color channel adding a third dimension, and the quantity of images adding a fourth. As such, the input tensors for image data models are four-dimensional tensors, whereas the input tensors for tabular data are two-dimensional. The following figure shows an example of labeled training examples of boats and airplanes taken from the Open Images dataset (https://storage.googleapis.com/openimages/web/index.html); the images have been preprocessed so that they all have the same height and width. This data could be used, for example, to train a binary classification model to classify images as boats or airplanes:

Figure 2.2: A sample of image data that can be used for training machine learning models

Other types of raw data that can be used to build machine learning models include text and audio. Like images, their popularity in the machine learning community is derived from the large amount of data that's available. Both audio and text have the challenge of having indeterminate sizes. You will explore how this challenge can be overcome later in this chapter. The following figure shows an audio sample with a sample rate of 44.1 kHz, which means the audio data is sampled 44,100 times per second. This is an example of the type of raw data that is input into virtual assistants, from which they decipher the request and act accordingly:

Figure 2.3: A visual representation of audio data

Now that you know about some of the types of data you may encounter when building machine learning models, in the next section, you will uncover ways to preprocess different types of data.

The TensorFlow Workshop

By : Matthew Moocarme, Abhranshu Bagchi, Anthony So , Anthony Maddalone

The TensorFlow Workshop

By: Matthew Moocarme, Abhranshu Bagchi, Anthony So , Anthony Maddalone

Overview of this book

Exploring Data Types

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access