Hands-On Image Generation with TensorFlow

By : Soon Yau Cheong

Hands-On Image Generation with TensorFlow

By: Soon Yau Cheong

Overview of this book

The emerging field of Generative Adversarial Networks (GANs) has made it possible to generate indistinguishable images from existing datasets. With this hands-on book, you’ll not only develop image generation skills but also gain a solid understanding of the underlying principles. Starting with an introduction to the fundamentals of image generation using TensorFlow, this book covers Variational Autoencoders (VAEs) and GANs. You’ll discover how to build models for different applications as you get to grips with performing face swaps using deepfakes, neural style transfer, image-to-image translation, turning simple images into photorealistic images, and much more. You’ll also understand how and why to construct state-of-the-art deep neural networks using advanced techniques such as spectral normalization and self-attention layer before working with advanced models for face generation and editing. You'll also be introduced to photo restoration, text-to-image synthesis, video retargeting, and neural rendering. Throughout the book, you’ll learn to implement models from scratch in TensorFlow 2.x, including PixelCNN, VAE, DCGAN, WGAN, pix2pix, CycleGAN, StyleGAN, GauGAN, and BigGAN. By the end of this book, you'll be well versed in TensorFlow and be able to implement image generative technologies confidently.

Preface

Who this book is for

How to use this book

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: Fundamentals of Image Generation with TensorFlow

Free Chapter

Chapter 1: Getting Started with Image Generation Using TensorFlow

Technical requirements

Understanding probabilities

Generating faces with a probabilistic model

Building a PixelCNN model from scratch

Summary

Chapter 2: Variational Autoencoder

Technical requirements

Learning latent variables with autoencoders

Variational autoencoders

Generating faces with VAEs

Controlling face attributes

Summary

Chapter 3: Generative Adversarial Network

Technical requirements

Understanding the fundamentals of GANs

Building a Deep Convolutional GAN (DCGAN)

Challenges in training GANs

Building a Wasserstein GAN

Summary

Section 2: Applications of Deep Generative Models

Chapter 4: Image-to-Image Translation

Technical requirements

Conditional GANs

Image translation with pix2pix

Unpaired image translation with CycleGAN

Diversifying translation with BicyleGAN

Summary

Chapter 5: Style Transfer

Technical requirements

Neural style transfer

Improving style transfer

Arbitrary style transfer in real time

Introduction to style-based GANs

Summary

Chapter 6: AI Painter

Technical requirements

Introduction to iGAN

Segmentation map-to-image translation with GauGAN

Summary

Section 3: Advanced Deep Generative Techniques

Chapter 7: High Fidelity Face Generation

Technical requirements

ProGAN overview

Building a ProGAN

Implementing StyleGAN

Summary

Chapter 8: Self-Attention for Image Generation

Technical requirements

Spectral normalization

Self-attention modules

Building a SAGAN

Implementing BigGAN

Summary

Chapter 9: Video Synthesis

Technical requirements

Video synthesis overview

Implementing face image processing

Building a DeepFake model

Swapping faces

Improving DeepFakes with GANs

Summary

Chapter 10: Road Ahead

Reviewing GANs

Putting your skills into practice

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Segmentation map-to-image translation with GauGAN

GauGAN (named after 19th-century painter Paul Gauguin) is a GAN from Nvidia. Speaking of Nvidia, it is one of the handful of companies that has invested heavily in GANs. They have achieved several breakthroughs in this space, including ProgressiveGAN (we'll cover that in Chapter 7, High Fidelity Face Generation), to generate high-resolution images, and StyleGAN for high-fidelity faces.

Their main business is in making graphics chips rather than AI software. Therefore, unlike some other companies, who keep their code and trained models as closely guarded secrets, Nvidia tends to open source their software code to the general public. They have built a web page (http://nvidia-research-mingyuliu.com/gaugan/) to showcase GauGAN, which can generate photorealistic landscape photos from segmentation maps. The following screenshot is taken from their web page.

Feel free to pause this chapter for a bit and have a play with the application...

Hands-On Image Generation with TensorFlow

By : Soon Yau Cheong

Hands-On Image Generation with TensorFlow

By: Soon Yau Cheong

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Image Generation with TensorFlow

Generative AI with Python and TensorFlow 2

Hands-On Generative Adversarial Networks with Keras

Generative Adversarial Networks Projects

Segmentation map-to-image translation with GauGAN