Book Image

Microsoft Azure Machine Learning

By : Sumit Mund, Christina Storm
Book Image

Microsoft Azure Machine Learning

By: Sumit Mund, Christina Storm

Overview of this book

Table of Contents (21 chapters)
Microsoft Azure Machine Learning
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

The dataset


We will use the sample dataset that comes by default in ML Studio. This is the subset of the passenger flight's on-time performance data taken from the TranStats data collection from U.S. Department of Transportation (DOT) (http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=236&DB_Short_Name=On-Time).

The dataset has been preprocessed and is filtered to include only the 70 busiest airports in the continental United States for the period between April 2013 to October 2013. It contains the following columns:

  • Carrier: This contains the code assigned by IATA and is commonly used to identify a carrier.

  • OriginAirportID (Origin Airport's Airport ID): This is an identification number assigned by DOT to identify a unique airport.

  • DestAirportID (Destination Airport;s Airport ID): This is an identification number assigned by DOT to identify a unique airport.

  • CRSDepTime: This is the scheduled departure time (local time in hh:mm).

  • DepDelay: This is the difference in minutes between...