Book Image

Artificial Vision and Language Processing for Robotics

By : Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre
Book Image

Artificial Vision and Language Processing for Robotics

By: Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre

Overview of this book

Artificial Vision and Language Processing for Robotics begins by discussing the theory behind robots. You'll compare different methods used to work with robots and explore computer vision, its algorithms, and limits. You'll then learn how to control the robot with natural language processing commands. You'll study Word2Vec and GloVe embedding techniques, non-numeric data, recurrent neural network (RNNs), and their advanced models. You'll create a simple Word2Vec model with Keras, as well as build a convolutional neural network (CNN) and improve it with data augmentation and transfer learning. You'll study the ROS and build a conversational agent to manage your robot. You'll also integrate your agent with the ROS and convert an image to text and text to speech. You'll learn to build an object recognition system using a video. By the end of this book, you'll have the skills you need to build a functional application that can integrate with a ROS to extract useful information about your environment.
Table of Contents (12 chapters)
Artificial Vision and Language Processing for Robotics
Preface

Introduction


Object recognition is an area of computer vision where a robot is capable of detecting objects in an environment using a camera or sensor that is capable of extracting images of the robot's surroundings. From these images, software detects an object within every image and then recognizes the type of object. Machines are capable of recognizing objects from an image or a video captured by the robot's sensors. This allows the robot to be aware of their environment.

If a robot can recognize its environment and obtain this information using object recognition, it will be able to perform more complex tasks, such as grabbing objects or moving around in an environment. In Chapter 9, Computer Vision for Robotics, we will look at a robot performing these tasks in a virtual environment.

The task to be performed here is to detect specific objects within an image and recognize those objects. This type of computer vision problem is a bit different from the ones that we have looked at earlier...