Book Image

NoSQL Data Models

By : Olivier Pivert
Book Image

NoSQL Data Models

By: Olivier Pivert

Overview of this book

Big Data environments are now to be handled in most current applications, this book addresses the latest issues and hurdles that are encountered in such environments. The book begins by presenting an overview of NoSQL languages and systems. Then, you’ll evaluate SPARQL queries over large RDF datasets and devise a solution that will use the MapReduce framework to process SPARQL graph patterns. Next, you’ll handle the production of web data, generate a set of links between two different datasets and overcome different heterogeneity problems. Moving ahead, you’ll take the multi-graph based approach to overcome challenges faced by the RDF data management community. Finally, you’ll deal with the flexible querying of graph databases and textual data management. By the end of this book, you’ll have gathered essential information on big data challenges faced by NoSQL databases.
Table of Contents (11 chapters)
Preface
8
List of Authors
9
Index
10
End User License Agreement

3.2. Datasets recommendation for data linking

Coreference resolution is a common thread across many communities, which is referred to as entity matching, entity disambiguation, cross-document coreference, duplicate record detection or record linkage. These terms all describe the process of determining the presence of different and heterogeneous descriptions of the same real-world objects and also the process of determining links and relations among these descriptions in order to make their correspondence explicit. Coreference resolution can build on a large body of related work across different communities. For example within database communities, we refer the interested reader to the works of Winkler et al. [WIN 06] on record linkage and Elmagarmid et al. [ELM 07] for duplicate detection. In the natural language processing field, we cite the survey of Soon et al. [SOO 01] where coreference resolution can be seen as the task of finding all expressions that refer to the same entity in...