Book Image

Artificial Vision and Language Processing for Robotics

By : Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre
Book Image

Artificial Vision and Language Processing for Robotics

By: Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre

Overview of this book

Artificial Vision and Language Processing for Robotics begins by discussing the theory behind robots. You'll compare different methods used to work with robots and explore computer vision, its algorithms, and limits. You'll then learn how to control the robot with natural language processing commands. You'll study Word2Vec and GloVe embedding techniques, non-numeric data, recurrent neural network (RNNs), and their advanced models. You'll create a simple Word2Vec model with Keras, as well as build a convolutional neural network (CNN) and improve it with data augmentation and transfer learning. You'll study the ROS and build a conversational agent to manage your robot. You'll also integrate your agent with the ROS and convert an image to text and text to speech. You'll learn to build an object recognition system using a video. By the end of this book, you'll have the skills you need to build a functional application that can integrate with a ROS to extract useful information about your environment.
Table of Contents (12 chapters)
Artificial Vision and Language Processing for Robotics
Preface

Summary


Object recognition and detection is capable of identifying several objects within an image, to draw bounding boxes around those objects and predict the types of object they are.

The process of labeling the bounding boxes and their labels has been explained, but not in depth, due to the huge process required. Instead, we used state-of-the-art models to recognize and detect those objects.

YOLOV3 was the main model used in this chapter. OpenCV was used to explain how to run an object detection pipeline using its DNN module. ImageAI, an alternative library for object detection and recognition, has shown its potential for writing an object detection pipeline with a few lines and easy object customization.

Finally, the ImageAI object detection pipeline was put into practice by using a video, where every frame obtained from the video was passed through that pipeline to detect and identify objects from those frames and show them using Matplotlib.