Book Image

Microsoft Azure Machine Learning

By : Sumit Mund, Christina Storm
Book Image

Microsoft Azure Machine Learning

By: Sumit Mund, Christina Storm

Overview of this book

Table of Contents (21 chapters)
Microsoft Azure Machine Learning
Credits
About the Author
Acknowledgments
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 4. Getting Data in and out of ML Studio

For any data analysis, you need data as input. Data analysis generates results as a dataset, which needs to be stored for future use. ML Studio allows you to import and export data in a variety of different formats. You can use the Reader module to import a dataset in ML Studio from external sources and you can use the Writer module to export a dataset. You can also download and upload datasets to and from your PC respectively for different data formats.

ML Studio supports a number of data formats. The internal data format, data table (DotNetTable), is primarily used to move data between modules inside an experiment. When you import data from external sources to ML Studio, the formats supported as of now are ARFF, CSV, Hive Table, SVMLight, Text, and TSV. Let's take a look at the following term list:

  • ARFF: This is the machine learning data format defined by Weka. An Attribute-Relation File Format (ARFF) file is an ASCII text file that describes...