Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Modern Computer Vision with PyTorch
  • Table Of Contents Toc
Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch - Second Edition

By : V Kishore Ayyadevara, Yeshwanth Reddy
4 (21)
close
close
Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

4 (21)
By: V Kishore Ayyadevara, Yeshwanth Reddy

Overview of this book

Whether you are a beginner or are looking to progress in your computer vision career, this book guides you through the fundamentals of neural networks (NNs) and PyTorch and how to implement state-of-the-art architectures for real-world tasks. The second edition of Modern Computer Vision with PyTorch is fully updated to explain and provide practical examples of the latest multimodal models, CLIP, and Stable Diffusion. You’ll discover best practices for working with images, tweaking hyperparameters, and moving models into production. As you progress, you'll implement various use cases for facial keypoint recognition, multi-object detection, segmentation, and human pose detection. This book provides a solid foundation in image generation as you explore different GAN architectures. You’ll leverage transformer-based architectures like ViT, TrOCR, BLIP2, and LayoutLM to perform various real-world tasks and build a diffusion model from scratch. Additionally, you’ll utilize foundation models' capabilities to perform zero-shot object detection and image segmentation. Finally, you’ll learn best practices for deploying a model to production. By the end of this deep learning book, you'll confidently leverage modern NN architectures to solve real-world computer vision problems.
Table of Contents (27 chapters)
close
close
1
Section 1: Fundamentals of Deep Learning for Computer Vision
5
Section 2: Object Classification and Detection
13
Section 3: Image Manipulation
17
Section 4: Combining Computer Vision with Other Techniques
24
Other Books You May Enjoy
25
Index

Questions

  1. What are VGG and ResNet pre-trained architectures trained on?
  2. Why does VGG11 have an inferior accuracy to VGG16?
  3. What does the number 11 in VGG11 represent?
  4. What does the term residual mean in “residual network” refer to?
  5. What is the advantage of a residual network?
  6. What are the various popular pretrained models discussed in the book and what is the speciality of each network?
  7. During transfer learning, why should images be normalized with the same mean and standard deviation as those that were used during the training of the pre-trained model?
  8. When and why do we freeze certain parameters in a model?
  9. How do we know the various modules that are present in a pre-trained model?
  10. How do we train a model that predicts categorical and numerical values together?
  11. Why might age and gender prediction code not always work for an image of your own if we were to execute the same code as that which we wrote...
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Modern Computer Vision with PyTorch
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon