Documents clearly printed or typed can be scanned or imported to Laserfiche as images, and a process called Optical Character Recognition (OCR) can generate searchable text from these images. You can perform OCR processing on single or multiple documents, as well as on single or multiple pages within documents. You can also index those documents when the image is brought into Laserfiche.
Note: OCR is a resource-intensive process; it can be slow if image quality is poor, the image was scanned improperly, or if memory is limited.
Important: OCR is a resource-intensive process that cannot be performed by Web Access directly; however Web Access can be configured to send documents to a Laserfiche Distributed Computing Cluster Scheduler that is setup for OCR. Your system must be licensed for Laserfiche Distributed Computing Cluster, and your Web Access must be configured to communicate with the Distributed Computing Cluster Scheduler. The Laserfiche Client can perform OCR directly, without using Distributed Computing Cluster.
The following procedure can also be used to create text pages for electronic documents. For more information, see Retrieving Text from an Electronic File.
To generate text using OCR from one or more imaged documents
To generate text using OCR from one or more imaged documents
Note: To configure OCR options or improve OCR results, see Options: Generate Text: General in the Laserfiche Client, or Settings: Generate Text: OCR Settings in Web Access.