Book Image

Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Book Image

Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Overview of this book

Table of Contents (14 chapters)

Fuzzy DB


When the fuzzy database is configured, Ephesoft can populate document fields with content from a row in an external database. Ephesoft automatically selects the row whose values match the most content in the current document. Ephesoft uses the Lucene full-text search engine to implement this feature.

Let's configure Ephesoft to populate fields on our invoice documents by using information from a database. Let's assume that we have a database that contains vendor information, including the vendor's name and ID. This vendor ID differs from the customer number that we extracted from the document. First, create the new fields of VendorID and VendorName.

Note

The database must contain a unique integer value that will be mapped to a RowId value in the Ephesoft configuration (discussed later in this section).

Here, is an example of a database table containing vendor information:

An example of the vendor database table

We need to configure the fuzzy database plugin to connect to this database...