Building Optical Character Recognition using reusable libraries
by sarvan Updated: Sep 1, 2021
Solution Kit ย
Optical character recognition (OCR) is a technology solution discovered to automate data extraction. The data is extracted from printed or written text from a scanned document or image file. Once after extraction, convert the text into a machine-readable format for data processing like editing or searchingโthe more accurate your OCR system in processing and identifying the characters in an image, the better. The processing steps for an OCR are: 1. Image Extraction 2. Image Preprocessing 3. Segmentation 4. Training a Neural Network 5. Post-Processing You can customize and create an OCR system using reusable libraries.
Image Preprocessing
icr-character-image-preprocessorby this-is-ari
An Python application used to pre-process images of individual handwritten characters to increase OCR/ICR accuracy
icr-character-image-preprocessorby this-is-ari
Python
36
Version:Current
License: Permissive (Apache-2.0)
ocr-tesseract-wrapperby pjalusic
Tiny wrapper around pytesseract with image preprocessing and OCR configurations
ocr-tesseract-wrapperby pjalusic
Python
1
Version:0.0.2
License: Permissive (MIT)
torchioby fepegar
Medical imaging toolkit for deep learning
torchioby fepegar
Python
1743
Version:v0.18.90
License: Permissive (Apache-2.0)
display_ocrby arturaugusto
Real-time image preprocess and OCR.
display_ocrby arturaugusto
Python
200
Version:Current
License: Strong Copyleft (GPL-2.0)
Image Extraction
ocr-text-extractionby jasonlfunk
A simple program to extract the text from an image before performing OCR
ocr-text-extractionby jasonlfunk
Python
200
Version:Current
License: Permissive (MIT)
ocrby victorqribeiro
Simple app to extract text from pictures using Tesseract
ocrby victorqribeiro
HTML
95
Version:Current
License: Permissive (MIT)
php-apache-tikaby vaites
Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
php-apache-tikaby vaites
PHP
90
Version:v1.2.2
License: Permissive (MIT)
img2txtby mathigatti
Easy formatted text extraction from images using Google Vision API
img2txtby mathigatti
Python
19
Version:Current
License: Permissive (MIT)
textractby dbashford
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
textractby dbashford
HTML
1554
Version:Current
License: Permissive (MIT)
pdf-text-extractby nisaacson
Extract text from pdfs that contain searchable pdf text
pdf-text-extractby nisaacson
JavaScript
102
Version:Current
License: Permissive (BSD-3-Clause)
pineby sdushantha
๐ท A simple image to text OCR scanner for macOS
pineby sdushantha
Python
47
Version:Current
License: Permissive (MIT)
Image Segmentation
Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet
Unet-Segmentation-Pytorch-Nest-of-Unetsby bigmb
Python
1405
Version:Current
License: Permissive (MIT)
dilationby fyu
Dilated Convolution for Semantic Image Segmentation
dilationby fyu
Python
741
Version:Current
License: Permissive (MIT)
LightNetby ansleliu
LightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)
LightNetby ansleliu
Python
702
Version:Current
License: Permissive (MIT)
ImageSegmentationby AKSHAYUBHAT
Perform image segmentation and background removal in javascript using superpixes
ImageSegmentationby AKSHAYUBHAT
JavaScript
543
Version:Current
License: Permissive (MIT)
vnet.pytorchby mattmacy
A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
vnet.pytorchby mattmacy
Python
623
Version:Current
License: Permissive (BSD-3-Clause)
3DUnetCNNby ellisdg
Pytorch 3D U-Net Convolution Neural Network (CNN) designed for medical image segmentation
3DUnetCNNby ellisdg
Python
1630
Version:Current
License: Permissive (MIT)
Post-Processing
go-ocrby maxim2266
A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.
go-ocrby maxim2266
Go
31
Version:v0.4.2
License: Permissive (BSD-3-Clause)
ocromoreby UB-Mannheim
Process, enhance and evaluate multiple OCR output.
ocromoreby UB-Mannheim
Python
16
Version:Current
License: Permissive (Apache-2.0)
unpaperby ImageProcessing-ElectronicPublications
Post-processing tool for scanned pages
unpaperby ImageProcessing-ElectronicPublications
C
1
Version:0.5.3
License: Strong Copyleft (GPL-2.0)
ocr-post-processing-with-googleby PedroBarcha
Given a text, wrap it into phrases and send them to google's search engine. If it yields a "did you mean:", substitute the original phrase for the suggestion. The software was originally developed for correcting OCR output.
ocr-post-processing-with-googleby PedroBarcha
Python
2
Version:Current
License: Strong Copyleft (GPL-3.0)
Training a Neural Network
tensorpackby tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
tensorpackby tensorpack
Python
6274
Version:doc-v0.9.0.1
License: Permissive (Apache-2.0)
chainerby chainer
A flexible framework of neural networks for deep learning
chainerby chainer
Python
5789
Version:v7.8.1.post1
License: Permissive (MIT)
tensorspaceby tensorspace-team
Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js
tensorspaceby tensorspace-team
JavaScript
4873
Version:v0.6
License: Permissive (Apache-2.0)
textgenrnnby minimaxir
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
textgenrnnby minimaxir
Python
4902
Version:v2.0.0
License: Others (Non-SPDX)
igniteby pytorch
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
igniteby pytorch
Python
4277
Version:v0.4.12
License: Permissive (BSD-3-Clause)
rnn_ctcby rakeshvar
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
rnn_ctcby rakeshvar
Python
220
Version:Current
License: Permissive (Apache-2.0)