Hands-On Image Generation with TensorFlow

By : Soon Yau Cheong

Hands-On Image Generation with TensorFlow

By: Soon Yau Cheong

Overview of this book

The emerging field of Generative Adversarial Networks (GANs) has made it possible to generate indistinguishable images from existing datasets. With this hands-on book, you’ll not only develop image generation skills but also gain a solid understanding of the underlying principles. Starting with an introduction to the fundamentals of image generation using TensorFlow, this book covers Variational Autoencoders (VAEs) and GANs. You’ll discover how to build models for different applications as you get to grips with performing face swaps using deepfakes, neural style transfer, image-to-image translation, turning simple images into photorealistic images, and much more. You’ll also understand how and why to construct state-of-the-art deep neural networks using advanced techniques such as spectral normalization and self-attention layer before working with advanced models for face generation and editing. You'll also be introduced to photo restoration, text-to-image synthesis, video retargeting, and neural rendering. Throughout the book, you’ll learn to implement models from scratch in TensorFlow 2.x, including PixelCNN, VAE, DCGAN, WGAN, pix2pix, CycleGAN, StyleGAN, GauGAN, and BigGAN. By the end of this book, you'll be well versed in TensorFlow and be able to implement image generative technologies confidently.

Preface

Who this book is for

How to use this book

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: Fundamentals of Image Generation with TensorFlow

Free Chapter

Chapter 1: Getting Started with Image Generation Using TensorFlow

Technical requirements

Understanding probabilities

Generating faces with a probabilistic model

Building a PixelCNN model from scratch

Summary

Chapter 2: Variational Autoencoder

Technical requirements

Learning latent variables with autoencoders

Variational autoencoders

Generating faces with VAEs

Controlling face attributes

Summary

Chapter 3: Generative Adversarial Network

Technical requirements

Understanding the fundamentals of GANs

Building a Deep Convolutional GAN (DCGAN)

Challenges in training GANs

Building a Wasserstein GAN

Summary

Section 2: Applications of Deep Generative Models

Chapter 4: Image-to-Image Translation

Technical requirements

Conditional GANs

Image translation with pix2pix

Unpaired image translation with CycleGAN

Diversifying translation with BicyleGAN

Summary

Chapter 5: Style Transfer

Technical requirements

Neural style transfer

Improving style transfer

Arbitrary style transfer in real time

Introduction to style-based GANs

Summary

Chapter 6: AI Painter

Technical requirements

Introduction to iGAN

Segmentation map-to-image translation with GauGAN

Summary

Section 3: Advanced Deep Generative Techniques

Chapter 7: High Fidelity Face Generation

Technical requirements

ProGAN overview

Building a ProGAN

Implementing StyleGAN

Summary

Chapter 8: Self-Attention for Image Generation

Technical requirements

Spectral normalization

Self-attention modules

Building a SAGAN

Implementing BigGAN

Summary

Chapter 9: Video Synthesis

Technical requirements

Video synthesis overview

Implementing face image processing

Building a DeepFake model

Swapping faces

Improving DeepFakes with GANs

Summary

Chapter 10: Road Ahead

Reviewing GANs

Putting your skills into practice

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Text to image

Text-to-image GANs are conditional GANs. However, instead of using class labels as conditions, they use words as the condition to generate images. In earlier practice, GANs used word embeddings as the conditions into the generator and discriminator. Their architectures are similar to conditional GANs, which we learned about in Chapter 4, Image-to-Image Translation. The difference is merely that the embedding of text is generated using a natural language processing (NLP) preprocessing pipeline. The following diagram shows the architecture of a text-conditional GAN:

Figure 10.5 – Text-conditional convolutional GAN architecture where text encoding is used by both the generator and discriminator (Redrawn from: S. Reed et al., 2016, "Generative Adversarial Text to Image Synthesis," https://arxiv.org/abs/1605.05396)

Like normal GANs, generated high-resolution images tend to be blurry. StackGAN resolves this by stacking two networks...

Hands-On Image Generation with TensorFlow

By : Soon Yau Cheong

Hands-On Image Generation with TensorFlow

By: Soon Yau Cheong

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Image Generation with TensorFlow

Generative AI with Python and TensorFlow 2

Hands-On Generative Adversarial Networks with Keras

Generative Adversarial Networks Projects

Text to image