Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Hands-On Computer Vision with Detectron2

By : Van Vung Pham

4.9 (14)

Hands-On Computer Vision with Detectron2

4.9 (14)

By: Van Vung Pham

Overview of this book

Computer vision is a crucial component of many modern businesses, including automobiles, robotics, and manufacturing, and its market is growing rapidly. This book helps you explore Detectron2, Facebook's next-gen library providing cutting-edge detection and segmentation algorithms. It’s used in research and practical projects at Facebook to support computer vision tasks, and its models can be exported to TorchScript or ONNX for deployment. The book provides you with step-by-step guidance on using existing models in Detectron2 for computer vision tasks (object detection, instance segmentation, key-point detection, semantic detection, and panoptic segmentation). You’ll get to grips with the theories and visualizations of Detectron2’s architecture and learn how each module in Detectron2 works. As you advance, you’ll build your practical skills by working on two real-life projects (preparing data, training models, fine-tuning models, and deployments) for object detection and instance segmentation tasks using Detectron2. Finally, you’ll deploy Detectron2 models into production and develop Detectron2 applications for mobile devices. By the end of this deep learning book, you’ll have gained sound theoretical knowledge and useful hands-on skills to help you solve advanced computer vision tasks using Detectron2.

Preface

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Code in Action

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Introduction to Detectron2

Part 1: Introduction to Detectron2

Free Chapter

Chapter 1: An Introduction to Detectron2 and Computer Vision Tasks

Chapter 1: An Introduction to Detectron2 and Computer Vision Tasks

Technical requirements

Computer vision tasks

An introduction to Detectron2 and its architecture

Detectron2 development environments

Summary

Chapter 2: Developing Computer Vision Applications Using Existing Detectron2 Models

Chapter 2: Developing Computer Vision Applications Using Existing Detectron2 Models

Technical requirements

Introduction to Detectron2’s Model Zoo

Developing an object detection application

Developing an instance segmentation application

Developing a keypoint detection application

Developing a panoptic segmentation application

Developing a semantic segmentation application

Putting it all together

Summary

Part 2: Developing Custom Object Detection Models

Part 2: Developing Custom Object Detection Models

Chapter 3: Data Preparation for Object Detection Applications

Chapter 3: Data Preparation for Object Detection Applications

Technical requirements

Common data sources

Getting images

Selecting an image labeling tool

Annotation formats

Labeling the images

Annotation format conversions

Summary

Chapter 4: The Architecture of the Object Detection Model in Detectron2

Chapter 4: The Architecture of the Object Detection Model in Detectron2

Technical requirements

Introduction to the application architecture

The backbone network

Region Proposal Network

Region of Interest Heads

Summary

Chapter 5: Training Custom Object Detection Models

Chapter 5: Training Custom Object Detection Models

Technical requirements

Processing data

Using the default trainer

Selecting the best model

Developing a custom trainer

Utilizing the hook system

Summary

Chapter 6: Inspecting Training Results and Fine-Tuning Detectron2’s Solvers

Chapter 6: Inspecting Training Results and Fine-Tuning Detectron2’s Solvers

Technical requirements

Inspecting training histories with TensorBoard

Understanding Detectron2’s solvers

Fine-tuning the learning rate and batch size

Summary

Chapter 7: Fine-Tuning Object Detection Models

Chapter 7: Fine-Tuning Object Detection Models

Technical requirements

Setting anchor sizes and anchor ratios

Setting pixel means and standard deviations

Putting it all together

Summary

Chapter 8: Image Data Augmentation Techniques

Chapter 8: Image Data Augmentation Techniques

Technical requirements

Image augmentation techniques

Detectron2’s image augmentation system

Summary

Chapter 9: Applying Train-Time and Test-Time Image Augmentations

Chapter 9: Applying Train-Time and Test-Time Image Augmentations

Technical requirements

The Detectron2 data loader

Applying existing image augmentation techniques

Developing custom image augmentation techniques

Applying test-time image augmentation techniques

Summary

Part 3: Developing a Custom Detectron2 Model for Instance Segmentation Tasks

Part 3: Developing a Custom Detectron2 Model for Instance Segmentation Tasks

Chapter 10: Training Instance Segmentation Models

Chapter 10: Training Instance Segmentation Models

Technical requirements

Preparing data for training segmentation models

The architecture of the segmentation models

Training custom segmentation models

Summary

Chapter 11: Fine-Tuning Instance Segmentation Models

Chapter 11: Fine-Tuning Instance Segmentation Models

Technical requirements

Introduction to PointRend

Using existing PointRend models

Training custom PointRend models

Summary

Part 4: Deploying Detectron2 Models into Production

Part 4: Deploying Detectron2 Models into Production

Chapter 12: Deploying Detectron2 Models into Server Environments

Chapter 12: Deploying Detectron2 Models into Server Environments

Technical requirements

Supported file formats and runtimes

Deploying custom Detectron2 models

Summary

Chapter 13: Deploying Detectron2 Models into Browsers and Mobile Environments

Chapter 13: Deploying Detectron2 Models into Browsers and Mobile Environments

Technical requirements

Deploying Detectron2 models using ONNX

Developing mobile computer vision apps with D2Go

Summary

Index

Index

Why subscribe?

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Index

As this ebook edition doesn't have fixed pagination, the page numbers below are hyperlinked for reference only, based on the printed edition of this book.

A

anchor ratios

computing 145, 146

hyperparameters 144, 145

setting 140

anchor sizes

generating 144, 145

setting 140

annotation formats 48-55

conversions 58

AugInput class 186, 187

augmentation classes 176

Augmentation class 177

AugmentationList class 186

FixedSizeCrop class 177

MinIoURandomCrop class 184

RandomApply class 178

RandomBrightness class 185

RandomContrast class 185

RandomCrop and CategoryAreaConstraint classes 183

RandomCrop class 178

RandomExtent class 179

RandomFlip class 180

RandomLighting class 185

RandomResize class 181

RandomRotation class 180, 181

RandomSaturation class 185

Resize class 181

ResizeScale class 182

ResizeShortestEdge class 182

average precision (AP) 21, 104, 281

B

...

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Hands-On Computer Vision with Detectron2

Search

Your notes and bookmarks