OCR

OCR

OCR is an Optical Character Recognition (OCR) tool businesses can use to accurately convert PDF and Images PDF documents that can be searched by any Windows-based search process, tool or module.

Email [email protected] for a bespoke quote

Overview

OCR is an Optical Character Recognition (OCR) tool businesses can use to accurately convert PDF and Images PDF documents that can be searched by any Windows-based search process, tool or module.

Using full-text OCR or zonal data extraction tools, developers can rapidly and seamlessly capture network or cloud-based data

Besides full-text OCR, Zonal data capture to Verify, Capture and Extract data from image-based or scanned PDF files. OCR supports 120+ languages including Chinese Traditional, Arabic, Chinese Simplified, Thai, Japanese and Korean.

Features

Automate document conversion for searchable text PDF
Access OCR capabilities from any platform
Multi-page input formats: TIFF/TIFF-FX and PDF
PDF output is optimized for searchability and/or editability
Outputs to PDF 1.3 - 1.7
Outputs PDF/A - 1a, 2a, 3a, 1b, 3b, 2u, 3u
Output Options: File compression, PDF encrypted 40, 128, or 256 bit, PDF info (author, subject, keywords, etc.)
ISO Standards: Generate PDF and PDF/A from source files
Metadata: Include document information in output files
Deskewing, despeckling, and autorotate
Input Formats:TIFF, JPEG, JPEG2000, PNG, BMP, PDF with REST API