Book Image

Big Data Forensics: Learning Hadoop Investigations

Book Image

Big Data Forensics: Learning Hadoop Investigations

Overview of this book

Table of Contents (15 chapters)
Big Data Forensics – Learning Hadoop Investigations
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Locating sources of data


Finding the sources of data to collect is an iterative process. This process includes gathering data requirements, identifying which systems are available, determining which systems contain data that match the requirements, and assessing whether the data in those relevant systems is viable for collection. This top-down approach is represented in detail in the following figure:

Figure 1: The identification process

Compiling data requirements

The first step is to compile the list of data requirements. For a litigation-related collection, these data requirements are determined based on the merits of the case. In the meet and confer process, the requirements are often quite broad. For a business crisis, all stakeholders should agree on the date range of events, all possible business units involved, and the facts in question. From this, a list of high-level information that could answer the questions should be derived. For example, in an investigation for a financial restatement...