Book Image

Effective Amazon Machine Learning

By : Alexis Perrier
Book Image

Effective Amazon Machine Learning

By: Alexis Perrier

Overview of this book

Predictive analytics is a complex domain requiring coding skills, an understanding of the mathematical concepts underpinning machine learning algorithms, and the ability to create compelling data visualizations. Following AWS simplifying Machine learning, this book will help you bring predictive analytics projects to fruition in three easy steps: data preparation, model tuning, and model selection. This book will introduce you to the Amazon Machine Learning platform and will implement core data science concepts such as classification, regression, regularization, overfitting, model selection, and evaluation. Furthermore, you will learn to leverage the Amazon Web Service (AWS) ecosystem for extended access to data sources, implement realtime predictions, and run Amazon Machine Learning projects via the command line and the Python SDK. Towards the end of the book, you will also learn how to apply these services to other problems, such as text mining, and to more complex datasets.
Table of Contents (17 chapters)
Title Page
Credits
About the Author
About the Reviewer
www.PacktPub.com
Customer Feedback
Dedication
Preface

Creating the datasource


When working with Amazon ML, the data always resides in S3 and it is not duplicated in Amazon ML. A datasource is the metadata that indicates the location of the input data allowing Amazon ML to access it. Creating a datasource also generates descriptive statistics related to the data and a schema with information on the nature of the variables. Basically, the datasource gives Amazon ML all the information it requires to be able to train a model. The following are the steps you need to follow to create a datasource:

  1. Go to Amazon Machine Learning: https://console.aws.amazon.com/machinelearning/home.
  2. Click on getting started, you will be given a choice between accessing the Dashboard and Standard setup. This time choose the standard setup:

Perform the following steps, as shown in the following screenshot:

  1. Choose an S3 location.
  2. Start typing the name of the bucket in the s3 location field, and the list folders and files should show up.
  3. Select the titanic_train.csv file.

 

  1. Give...