Book Image

NoSQL Data Models

By : Olivier Pivert
Book Image

NoSQL Data Models

By: Olivier Pivert

Overview of this book

Big Data environments are now to be handled in most current applications, this book addresses the latest issues and hurdles that are encountered in such environments. The book begins by presenting an overview of NoSQL languages and systems. Then, you’ll evaluate SPARQL queries over large RDF datasets and devise a solution that will use the MapReduce framework to process SPARQL graph patterns. Next, you’ll handle the production of web data, generate a set of links between two different datasets and overcome different heterogeneity problems. Moving ahead, you’ll take the multi-graph based approach to overcome challenges faced by the RDF data management community. Finally, you’ll deal with the flexible querying of graph databases and textual data management. By the end of this book, you’ll have gathered essential information on big data challenges faced by NoSQL databases.
Table of Contents (11 chapters)
Preface
8
List of Authors
9
Index
10
End User License Agreement

7.1. Introduction

Sources of information have been multiplying on the Web for several years, especially due to the success of news portals and social networks that produce information in real time. These flows of information can be kept and processed, often in RSS [RSS 03] and Atom [GRE 07] formats. However, it turns out that nowadays the amount of data which has to be analyzed daily is so large [HME 11] that a user may miss information of interest. Thus, a given user can be lost with so many sources and the frequency of updates [TRA 14]. Pub/Sub (Publish/Subscribe) systems (Redis [CAR 13], Scribe [ROW 01], Siena [CAR 01], Echo [EIS 00]) have been designed to face the problem of aggregating and delivering information of interest (bookmarks and topics) to end users.

For these reasons, we advocate a content-based Publish/Subscribe paradigm for Web 2.0 syndication in which information consumers are decoupled (in both space and time) from feeds (produced flows of items) and instead express...