Book Image

Kinect for Windows SDK Programming Guide

By : Abhijit Jana
Book Image

Kinect for Windows SDK Programming Guide

By: Abhijit Jana

Overview of this book

Kinect has been a game-changer in the world of motion games and applications since its first release. It has been touted as a controller for Microsoft Xbox but is much more than that. The developer version of Kinect, Kinect for Windows SDK, provides developers with the tools to develop applications that run on Windows. You can use this to develop applications that make interaction with your computer hands-free. This book focuses on developing applications using the Kinect for Windows SDK. It is a complete end to end solution using different features of Kinect for Windows SDK with step by step guidance. The book will also help you develop motion sensitive and speech recognition enabled applications. You will also learn about building application using multiple Kinects.The book begins with explaining the different components of Kinect and then moves into to the setting up the device and getting thedevelopment environment ready. You will be surprised at how quickly the book takes you through the details of Kinect APIs. You will use NUI to use the Kinect for Natural Inputs like skeleton tracking, sensing, speech recognizing. You will capture different types of stream, and images, handle stream event, and capture frame. Kinect device contains a motorized tilt to control sensor angles, you will learn how to adjust it automatically. The last part of the book teaches you how to build application using multiple Kinects and discuss how Kinect can be used to integrate with other devices such as Windows Phone and microcontroller.
Table of Contents (19 chapters)
Kinect for Windows SDK Programming Guide
Credits
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
Index

How speech recognition works


An application can have different types of user interface (UI), and controlling the UI using speech is one of the approaches of user interaction. Using the speech recognition system, users say what they want and the computer executes the command and the results are reflected on the UI.

We can categorize the patterns of speech recognition in the following two ways:

  • Command mode: This is the mode where you say a command and the speech recognition engine recognizes the speech. As an example, you may want to start and stop a game by just saying "start" and "stop".

  • Sentence mode or diction mode: This is the mode where you can say a sentence to perform an operation. As an example, to rotate a line you can say "rotate the line".

At the first glance, speech recognition looks like a simple matching logic, but indeed it is not. The speech recognition engine consists of the following two major modules:

  • Acoustic model

  • Language model

Each one of the modules has a sole responsibility...