Book Image

Hands-On Machine Learning with Azure

By : Thomas K Abraham, Parashar Shah, Jen Stirrup, Lauri Lehman, Anindita Basak
Book Image

Hands-On Machine Learning with Azure

By: Thomas K Abraham, Parashar Shah, Jen Stirrup, Lauri Lehman, Anindita Basak

Overview of this book

Implementing Machine learning (ML) and Artificial Intelligence (AI) in the cloud had not been possible earlier due to the lack of processing power and storage. However, Azure has created ML and AI services that are easy to implement in the cloud. Hands-On Machine Learning with Azure teaches you how to perform advanced ML projects in the cloud in a cost-effective way. The book begins by covering the benefits of ML and AI in the cloud. You will then explore Microsoft’s Team Data Science Process to establish a repeatable process for successful AI development and implementation. You will also gain an understanding of AI technologies available in Azure and the Cognitive Services APIs to integrate them into bot applications. This book lets you explore prebuilt templates with Azure Machine Learning Studio and build a model using canned algorithms that can be deployed as web services. The book then takes you through a preconfigured series of virtual machines in Azure targeted at AI development scenarios. You will get to grips with the ML Server and its capabilities in SQL and HDInsight. In the concluding chapters, you’ll integrate patterns with other non-AI services in Azure. By the end of this book, you will be fully equipped to implement smart cognitive actions in your models.
Table of Contents (14 chapters)

Azure Data Lake Analytics

Azure Data Lake (ADL) is Microsoft's storage and analytics service for big data. It is capable of storing data on a petabyte scale and making efficient queries on the stored data. The storage and the analytics services are separate in Azure and the ADL service actually consists of two different products: Azure Data Lake Storage (ADLS) and Azure Data Lake Analytics (ADLA). In this section, we will focus on ADLA, but we will also touch on ADLS where appropriate.

Data Lake Storage is a file-based storage, with files organized into directories. This type of storage is called schemaless, since there are no constraints on what type of data can be stored in the Data Lake. Directories can contain text files and images, and the data type is specified only when the data is read out from the Data Lake. This is particularly useful in big data scenarios where...