Ephesoft Technical Specifications

Ephesoft is a Document Capture engine which has a pluggable infrastructure.

Why would I need it?
You would need ephesoft if:

  • You have any forms that are hand written by your customers and need a fast and effective way to ingest these
  • You want to capture paper records or migrate to a digital by default workplace
  • You want to automatically classify documents and use them within workflows

How does it work?
Ephesoft is built using open source technologies. Ephesoft provides a process for capturing incoming documents such as paper, fax, email attachments.

Without using Ephesoft, manual workflow for mailroom document processing is very inefficient. Operators receive documents via mail, email, or fax. They organize documents based on the destination/business unit and scan them using high-speed scanners. The scanned documents are processed for validation, verification, and data entry. Ephesoft document capture and mailroom automation intends to automate the whole document processing workflow and requires operator interference only to handle exceptions.

The product is web based but On-Premise, built using open source technologies. The product is available in two editions. The community edition is available for free and has community-driven support. The enterprise edition has paid maintenance and has additional features via commercial plug-ins.

The product is designed for three different profiles of users:

  • Data entry operators review and validate the scanned/imported documents. These users mainly use keyboard, mouse or keyboard shortcuts to have high efficiency.
  • Supervisors do system level operations like reporting, setup, configuration, etc.
  • Administrators configure the batch class and How Ephesoft interacts with Document Management Systems, Business Process Management Systems or Databases.


Ephesoft is built on top of a workflow engine. Each step in the workflow is called Plugin and independently responsible for one specific operation. A Plugin might be responsible of OCRing the page and another plugin might be responsible for exporting documents to a repository. Plugins are grouped into sub workflow containers which are called Modules. For example all plugin that are used to extract meta data from documents such as Free Form extraction or Zonal OCR/ICR, or table/Line item extraction plugins can be found in a module called Extraction.

Overall System Architecture
The below diagram shows the Overall Workflow and how Plugins and Modules are implemented. Workflow can be followed from left to right. Documents are imported by the plugins in the Import Module, each page is analysed by various plugins in Page Processing Module, Document boundaries are identified by the plugins in the Document Assembler Module, Operators review the documents classification results (if necessary) in Document Review, Meta Data in extracted from Documents based on document type using the plugins in Extraction Module, Operators validate the extracted values in Document Validation and finally Documents are exported to their destinations.

Ephesoft System Diagram

Ephesoft Document Capture and Mailroom Automation workflow consists of automatic modules and manual modules.