-
Book Overview & Buying
-
Table Of Contents
Principles of Data Science
By :
The distinction between structured and unstructured data is usually the first question you want to ask yourself about the entire dataset. The answer to this question can mean the difference between needing three days or three weeks of time to perform a proper analysis.
The basic breakdown is as follows (this is a rehashed definition of organized and unorganized data in the first chapter):
Structured (organized) data: This is data that can be thought of as observations and characteristics. It is usually organized using a table method (rows and columns).
Unstructured (unorganized) data: This data exists as a free entity and does not follow any standard organization hierarchy.
Here are a few examples that could help you differentiate between the two:
Most data that exists in text form, including server logs and Facebook posts, is unstructured
Scientific observations, as recorded by careful scientists, are kept in a very neat and organized (structured) format
A genetic...