BITES – Unstructured/Semistructured document store
Because all data isn't structured in a rational database, it often comes unstructured in the form of free text or semistructured in the form of Word, Excel, or PDF documents. These documents often have to be treated specially, such as dumping the content in ElasticSearch or MongoDB. Stardog's BITES (Blob Indexing and Text Enrichment with Semantics) capability allows this unstructured/semistructured content in through external processors while also serving as a document store. Through a customizable set of entity-extraction processors, the contents of this documentation can now be unlocked and projected into the graph while providing a means to keep the documentation intact. Managing the documents is available through a full-featured command line, a local Java API, or via an HTTP API.
The conceptual architecture of the BITES Pipeline is illustrated here:
BITES allows the storage and retrieval of documents in the form of files. Stardog treats...