Book Image

Alexa Skills Projects

By : Madhur Bhargava
Book Image

Alexa Skills Projects

By: Madhur Bhargava

Overview of this book

Amazon Echo is a smart speaker developed by Amazon, which connects to Amazon’s Alexa Voice Service and is entirely controlled by voice commands. Amazon Echo is currently being used for a variety of purposes such as home automation, asking generic queries, and even ordering a cab or pizza. Alexa Skills Projects starts with a basic introduction to Amazon Alexa and Echo. You will then deep dive into Alexa Programming concepts such as Intents, Slots, Lambdas and maintaining your skill’s state using DynamoDB. You will get a clear understanding of how some of the most popular Alexa Skills work, and gain experience of working with real-world Amazon Echo applications. In the concluding chapters, you will explore the future of voice-enabled applications and their coverage with respect to the Internet of Things. By the end of the book, you will have learned to design Alexa Skills for specific purposes and interact with Amazon Echo to execute these skills.
Table of Contents (14 chapters)
Title Page
Dedication
Packt Upsell
Contributors
Preface
Index

Applications of Voice-Based Personal Assistants


We discussed the evolution of voice-based personal assistants in the previous section. In this section, we shall extend that discussion to some of the popular uses of each of the interactive voice-based personal assistants, irrespective of whether the assistant in question is desktop, smartphone, or smart home-based. We shall begin with one of the earliest and most well-known ones, Apple's Siri.

Siri

As indicated earlier, Siri started as a separate smartphone app in 2011 for iOS, which was later on acquired by Apple. Initially, the capabilities of Siri were limited to smartphones and simple functions such as:

  • Looking up contacts
  • Messaging (SMS)
  • Fetching weather updates on user demand, plus other simple queries as mentioned in the previous section

However, Apple's roadmap also extended the capabilities of Siri by closely integrating it with third-party apps and, true to their promise, with the coming of iOS 10, Apple also released SiriKit.

Note

To know more about SiriKit, please visit https://developer.apple.com/sirikit/.

If the user has the following third-party apps installed, he/she can request a ride using Siri:

  • Uber
  • Lyft

If the user has the following third-party apps installed, he/she can set those to send a message (and not just an SMS) using Siri:

  • WhatsApp
  • LinkedIn
  • WeChat
  • Slack

A user can also make VoIP calls using the following apps via Siri:

  • Skype
  • Viber

Note

Please note that the preceding lists are not exhaustive. However, third-party integrations were not the only thing on Apple's roadmap to extend the capabilities of Siri. The launch of macOS Sierra also brought the capabilities of Siri to the desktop. To know more about Siri's desktop capabilities, please visit https://support.apple.com/en-us/HT206993.

Siri can also help a user to:

  • Search files on his/her Mac
  • Notify the user about their storage space
  • Send requests to FaceTime with Contacts, and many others as shown here:

Figure 1.8: List of things Siri can help with (non-exhaustive) (source: www.osxdaily.com)

With a fair idea about Siri's desktop and smartphone capabilities, let's now move on to another popular voice assistant.

Google Now

We are going to discuss the Android and Google Now next, which at the time of writing is the biggest player in the smartphone market and also the home of Google Now, the voice assistant introduced by Google for Android smartphones in 2012.

In early 2010, the smartphone market was dominated by many players. Over the years, this has filtered down and only two major players remain in the market as depicted as follows:

Figure 1.9: Smartphone market share distribution comparison between the years 2010 and 2016 (Data sourced from Gartner)

Google Now can do pretty much all that Siri can accomplish; however, it has better integration with the web and web-based queries, since the web is Google's main forte. Some of the things that a user can ask Google Now are:

Figure 1.10: Some of the things that Google Now can do (Data source: www.cnet.com)

Apart from Google Now, Google also has introduced Google Assistant, which is a more evolved version of Google Now, given the fact that the user can hold full-length conversations with Google Assistant, which is not possible with Google Now.

It is very likely that Google Now will be phased out and Google Assistant will take its place; however, Google Assistant is currently only available on Google Home, which is Google's smart home speaker; the Android Pixel 2 smartphone; and for Android Wear:

Figure 1.11: Devices on which Google Assistant is available (Google and the Google logo are registered trademarks of Google Inc., used with permission.)

Now, moving on from the smartphone market to the desktop market:

Figure 1.12: Desktop market share as of January 2017 (Data source: www.windowscentral.com)

As shown in the preceding graph, as of January 2017, the desktop market had Windows, Linux, and Mac OS X as major players, with Microsoft being the dominant force, which brings us to our next personal assistant.

Cortana

With Microsoft's clear dominance of the desktop market, we cannot ignore Cortana, which is Microsoft's answer to Siri and Google Assistant, but focused on desktop and Windows Mobile:

Figure 1.13: List of some things that Cortana can help with

Not just limited to Windows 10, Windows 10 Mobile, and Windows Phone 8.1, Cortana is also available for:

  • iOS (as a separate app)
  • Android (as a separate app)
  • Xbox One
  • Invoke smart Bluetooth speaker by Harman Kardon

Some of the many things that Cortana can accomplish are:

  • Web-based queries using Bing Search (for example, "Who is the President of the United States?")
  • Launch apps and turn on/off Wi-Fi/Bluetooth
  • Ask about weather
  • Manage appointments, reminders, and events

With that, we come to discuss the Star of this book.

Alexa

Alexa, the whole center point of this chapter and the book, is the interactive voice-based personal assistant by Amazon, originally introduced with its family of Echo devices. Alexa as an assistant is oriented towards a smart home concept, hence most of its use comes from Amazon Echo, a smart speaker designed and developed to be kept in the living room of the user's home so that the user can ask it day-to-day queries about weather, food recipes, and jokes, or play interactive trivia games, set alarms, shop for day-to-day items, and much more. The following diagram shows some of the things that a user can ask Alexa:

Figure 1.14: List of some things that Alexa can help with 

The capabilities of Alexa can also be extended by installing third-party skills (similar to Google Home's third-party apps). Each third-party skill is meant to serve a specific purpose. For example, the Uber skill allows you to order a ride, the Domino's skill allows you to order a pizza—all from the comfort of your home and through the magic of your voice working together with Alexa.

As of the time of writing this, there are more than 15,000 skills available for Alexa with Uber and Lyft being the most used ones in the travel category, Pandora and Spotify for music streaming, and multiple other skills being utilized in home automation.