Book Image

Artificial Vision and Language Processing for Robotics

By : Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre
Book Image

Artificial Vision and Language Processing for Robotics

By: Álvaro Morena Alberola, Gonzalo Molina Gallego, Unai Garay Maestre

Overview of this book

Artificial Vision and Language Processing for Robotics begins by discussing the theory behind robots. You'll compare different methods used to work with robots and explore computer vision, its algorithms, and limits. You'll then learn how to control the robot with natural language processing commands. You'll study Word2Vec and GloVe embedding techniques, non-numeric data, recurrent neural network (RNNs), and their advanced models. You'll create a simple Word2Vec model with Keras, as well as build a convolutional neural network (CNN) and improve it with data augmentation and transfer learning. You'll study the ROS and build a conversational agent to manage your robot. You'll also integrate your agent with the ROS and convert an image to text and text to speech. You'll learn to build an object recognition system using a video. By the end of this book, you'll have the skills you need to build a functional application that can integrate with a ROS to extract useful information about your environment.
Table of Contents (12 chapters)
Artificial Vision and Language Processing for Robotics
Preface

Summary


Computer vision is a big field within AI. By understanding this field, you can achieve results such as extracting information from an image or generating images that look just like they do in real life, for example. This chapter has covered image preprocessing for feature extraction using the OpenCV library, which allows easy training and prediction for machine learning models. Some basic machine learning models have also been covered, such as decision trees and boosting algorithms. These served as an introduction to machine learning and were mostly used to play around. Finally, neural networks were introduced and coded using Keras and TensorFlow as a backend. Normalization was explained and put into practice, along with dense layers, though convolutional layers are known to work better with images than dense layers do, and they will be explained later in the book.

Concepts for avoiding overfitting were also covered, and toward the end, we used the model to make predictions and put...