Book Image

Turning Spreadsheets into Corporate Data

By : Bill Inmon
Book Image

Turning Spreadsheets into Corporate Data

By: Bill Inmon

Overview of this book

Spreadsheets are a popular way to store and communicate business data, but, although they are easy to create and update, they are not reliable enough to be used for making important corporate decisions. With this book, you can gain insight into how to maintain spreadsheets, how to format them, and then convert them into a database of reliable and useful information. Turning Spreadsheets into Corporate Data starts with a quick history of spreadsheet usage. You’ll learn the basics of formatting spreadsheets, including how to handle special characters and column headings, and how to convert the spreadsheet first into an intermediate database and then into corporate data. You will also learn how to utilize the mnemonic dictionary that is created along with the intermediate database. The later chapters discuss the immutability of data and the importance of organizational and political considerations during the data transformation. By the end of this book, you’ll have the skills and knowledge needed to convert your spreadsheets into reliable corporate data.
Table of Contents (16 chapters)
Free Chapter
1
Introduction
14
13: Case Study
15
Glossary
16
Index

In Summary

At a high level, spreadsheet disambiguation is the process of ingesting a spreadsheet and turning that spreadsheet into corporate data.

The process starts with the selection of spreadsheets as candidates for corporate data. Most spreadsheets are not good candidates for inclusion into corporate data.

The next step is to log the spreadsheet in for processing.

The next step is the inclusion of the spreadsheet into the path queue.

Next comes the definition of the headings into the ssdef table.

The next step is the pairing of the ssdef specification with the spreadsheet. This step is a useful “self-check” for determining whether changes have been made to the spreadsheet since the last iteration was processed.

The next step is the running of the spreadsheet disambiguation technology. In this step, the values from the spreadsheet are paired up with the context of the values. The context of the values is determined by finding the column name and the...