kandi background

Building Optical Character Recognition using reusable libraries

by sarvan

Optical character recognition (OCR) is a technology solution discovered to automate data extraction. The data is extracted from printed or written text from a scanned document or image file. Once after extraction, convert the text into a machine-readable format for data processing like editing or searching—the more accurate your OCR system in processing and identifying the characters in an image, the better. The processing steps for an OCR are: 1. Image Extraction 2. Image Preprocessing 3. Segmentation 4. Training a Neural Network 5. Post-Processing You can customize and create an OCR system using reusable libraries.

Image Preprocessing

Image Extraction

Image Segmentation

Post-Processing

Training a Neural Network