Learning Microsoft Cognitive Services

Learning Microsoft Cognitive Services - Third Edition

By : Leif Larsen

Buy this Book

Learning Microsoft Cognitive Services - Third Edition

By: Leif Larsen

Buy this Book

Overview of this book

Microsoft Cognitive Services is a set of APIs for integrating artificial intelligence in your applications to solve logical business problems. If you’re new to developing applications with AI, Learning Microsoft Cognitive Services will give you a comprehensive introduction to Microsoft’s AI stack and get you up-to-speed in no time. The book introduces you to 24 APIs, including Emotion, Language, Vision, Speech, Knowledge, and Search. Using Visual Studio, you can develop applications with enhanced capabilities for image processing, speech recognition, text processing, and much more. Moving forward, you will work with datasets that enable your applications to process various data in the form of image, video, or text. By the end of the book, you’ll be able to confidently explore Cognitive Services APIs for building intelligent applications that can be deployed for real-world business uses.

Learning Microsoft Cognitive Services - Third Edition

Contributors

Acknowledgments

Preface

Free Chapter

Getting Started with Microsoft Cognitive Services

Cognitive Services in action for fun and life-changing purposes

Setting up the boilerplate code

Detecting faces with the Face API

An overview of different APIs

Getting feedback on detected faces

Summary

Analyzing Images to Recognize a Face

Analyze an image using the Computer Vision API

Diving deep into the Face API

Adding identification to our smart-house application

Knowing your mood using the Face API

Automatically moderating user content

Building your own image classifiers

Summary

Analyzing Videos

Diving into Video Indexer

Unlocking video insights using Video Indexer

Summary

Letting Applications Understand Commands

Creating language-understanding models

Training a model

Summary

Speaking with Your Application

Converting text to audio and vice versa

Knowing who is speaking

Verifying a person through speech

Customizing speech recognition

Translating speech on the fly

Summary

Understanding Text

Setting up a common core

Correcting spelling errors

Extracting information through textual analysis

Translating text on the fly

Summary

Building Recommendation Systems for Businesses

Providing personalized recommendations

Summary

Querying Structured Data in a Natural Way

Tapping into academic content using the academic API

Interpreting natural language queries

Finding academic entities in query expressions

Calculating the distribution of attributes from academic entities

Entity attributes

Creating the backend using the Knowledge Exploration Service

Defining attributes

Adding data

Building the index

Understanding natural language

Local hosting and testing

Going for scale

Answering FAQs using QnA Maker

Creating a knowledge base from frequently asked questions

Training the model

Publishing the model

Summary

Adding Specialized Searches

Searching the web using the smart-house application

Getting the news

Searching for images and videos

Helping the user with autosuggestions

Search commonalities

Searching for visual content using Bing Visual Search

Adding a custom search

Summary

Connecting the Pieces

Completing our smart-house application

Real-life applications using Microsoft Cognitive Services

Where to go from here

Summary

LUIS Entities

LUIS prebuilt entities

License Information

Video Frame Analyzer

OpenCvSharp3

Newtonsoft.Json

NAudio

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Knowing who is speaking

Using the Speaker Recognition API, we can identify who is speaking. By defining one or more speaker profiles with corresponding samples, we can identify whether any of them are speaking at any time.

To be able to utilize this feature, we need to go through a few steps:

We need to add one or more speaker profiles to the service.
Each speaker profile enrolls several spoken samples.
We call the service to identify a speaker based on audio input.

If you have not already done so, sign up for an API key for the Speaker Recognition API at https://portal.azure.com.

Start by adding a new NuGet package to your smart-house application. Search for and add Microsoft.ProjectOxford.SpeakerRecognition.

Add a new class called SpeakerIdentification to the Model folder of your project. This class will hold all of the functionality related to speaker identification.

Beneath the class, we will add another class, containing EventArgs for status updates:

    public class SpeakerIdentificationStatusUpdateEventArgs...

Learning Microsoft Cognitive Services - Third Edition

By : Leif Larsen

Learning Microsoft Cognitive Services - Third Edition

By: Leif Larsen

Overview of this book

Related Content you might be interested in

Current Title:

Learning Microsoft Cognitive Services - Third Edition

Hands-On Machine Learning with Azure

Building Bots with Microsoft Bot Framework

Knowing who is speaking