Book Image

Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Book Image

Intelligent Document Capture with Ephesoft, Second Edition - Second Edition

Overview of this book

Table of Contents (14 chapters)

Classification


In Chapter 2, Creating a Batch Class, we showed you how to configure search classification to enable Ephesoft to recognize an invoice document. There are several other classification types available; we will explain these alternatives now.

Classification types

You can select the process that Ephesoft will use to classify documents by editing your batch class, editing the Document Assembly module, editing the Document Assembler plugin module within that module, and then selecting a value for DA Classification Type.

Search

Search classification (also sometimes called Lucene classification) is the default classification method and is recommended for most content. When configured to perform search classification, Ephesoft compares the text on each input page to the text on training documents to determine its confidence that a document is of a certain type.

Image

Image classification is the best option when classification cannot be made based on content. This occurs on forms that do...