Kinect in Motion - Audio and Visual Tracking by Example

Kinect in Motion - Audio and Visual Tracking by Example

Overview of this book

Kinect is a motion-sensing input device by Microsoft for the Xbox 360 video game console and Windows PCs. It provides capabilities to enhance human-machine interaction along with a zero-to-hero journey to engage the user in a multimodal interface dialog with your software solution. Kinect in Motion - Audio and Visual Tracking by Example guides you in developing more than five models you can use to capture gestures, movements, and voice spoken commands. The examples and the theory discussed provide you with the knowledge to let the user become a part of your application. Kinect in Motion - Audio and Visual Tracking by Example is a compact reference on how to master color, depth, skeleton, and audio data streams handled by Kinect for Windows.Starting with an introduction to Kinect and its characteristics, you will first be shown how to master the color data stream with no more than one page of lines of code. Learn how to manage the depth information and map them against the color ones. You will then learn how to define and manage gestures that enable the user to instruct the application simply by moving arms or any other type of natural action. Finally you will complete your journey through a multimodal interface, combining gestures with audio.The book will lead you through many detailed, real-world examples, and even guide you on how to test your application.

Kinect in Motion – Audio and Visual Tracking by Example

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Kinect for Windows – Hardware and SDK Overview

Motion computing and Kinect

Hardware overview

Summary

Starting with Image Streams

Color stream

Depth stream

Summary

Skeletal Tracking

Tracking users

Default and Seated mode

Detecting simple actions

Summary

Speech Recognition

Speech recognition

Tracking audio sources

Summary

Kinect Studio and Audio Recording

Kinect Studio – capturing Kinect data

Audio stream data – recording and injecting

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Detecting simple actions

Let's see now how we can enhance our application and leverage the Kinect sensor's Natural User Interface (NUI) capabilities.

We implement a manager that, using the skeleton data, is able to interpret a body motion or a posture and translate the same to an action as "click". Similarly, we could create other actions as "zoom in". Unfortunately, the Kinect for Windows SDK does not provide APIs for recognizing gestures, so we need to develop our custom gesture recognition engine.

Gesture detection can be relatively simple or intensely complex depending on the gesture and the environment (image noise, scene with more users, and so on).

In literature there are many approaches for implementing gesture recognition, the most common ones are as follows:

A neural network that utilizes the weighted networks (Gestures and neural networks in human-computer interaction, Beale R and Alistair D N E)
A DTW that utilizes the Dynamic Time Warping algorithm initially developed for the speech...

Kinect in Motion - Audio and Visual Tracking by Example

Kinect in Motion - Audio and Visual Tracking by Example

Overview of this book

Related Content you might be interested in

Current Title:

Kinect in Motion - Audio and Visual Tracking by Example

Detecting simple actions