The following terms are commonly used when implementing document capture:
Batch class: A definition of document types, associated fields, extraction rules, monitored folders, and e-mails for a specified workflow
Classification : Determining the type of document being processed
ECM: Enterprise Content Management, an enterprise application for managing a large number of documents
Fixed form: A type of form where the positions and dimensions of the fields are always the same
HA: High Availability, a term applied to online applications, services, or technologies that are designed to be resistant to failure, and therefore, always accessible
Indexing: The process of defining field values for a particular document instance
Machine print: Text that is printed by a machine (not hand-written)
Metadata: Information about a document that is associated with that document but not stored in the body of the document itself
OOTB : Out-of-the-box, refers to the default configuration of an application
Regex: A regular expression, syntax for defining a pattern of text
Separation: The process of determining the start and end of documents, given a set of page images