How to use the Windows TIFF IFilter for OCR to index images
Background
Meridian Enterprise supports Windows IFilters for the detection and extraction of document text for full text indexing. Computers with the Windows Server 2008 R2 or Windows 7 operating system installed include the Windows TIFF IFilter developed by Microsoft that provides optical character recognition (OCR) capability. Following are implementation notes for using this IFilter for content indexing with BlueCielo products.
Description
The installation, deployment, and configuration of the Windows TIFF IFilter are described in the Windows TIFF IFilter Installation and Operations Guide that can be downloaded from the Microsoft Download Center.
The IFilter works with BlueCielo products with the following limitations:
- Text is recognized according to the languages selected in the local group policy as described in the document above.
- Text is extracted for multi-page TIFF documents if the Force TIFF IFilter to OCR every page in a TIFF document option is enabled as described in the document above.