Building Optical Character Recognition using reusable libraries
by sarvan Updated: Sep 1, 2021
Solution Kit ย
Optical character recognition (OCR) is a technology solution discovered to automate data extraction. The data is extracted from printed or written text from a scanned document or image file. Once after extraction, convert the text into a machine-readable format for data processing like editing or searchingโthe more accurate your OCR system in processing and identifying the characters in an image, the better. The processing steps for an OCR are: 1. Image Extraction 2. Image Preprocessing 3. Segmentation 4. Training a Neural Network 5. Post-Processing You can customize and create an OCR system using reusable libraries.
Image Preprocessing
icr-character-image-preprocessorby this-is-ari
An Python application used to pre-process images of individual handwritten characters to increase OCR/ICR accuracy
icr-character-image-preprocessorby this-is-ari
Python 36 Version:Current License: Permissive (Apache-2.0)
ocr-tesseract-wrapperby pjalusic
Tiny wrapper around pytesseract with image preprocessing and OCR configurations
ocr-tesseract-wrapperby pjalusic
Python 1 Version:0.0.2 License: Permissive (MIT)
torchioby fepegar
Medical imaging toolkit for deep learning
torchioby fepegar
Python 1743 Version:v0.18.90 License: Permissive (Apache-2.0)
display_ocrby arturaugusto
Real-time image preprocess and OCR.
display_ocrby arturaugusto
Python 200 Version:Current License: Strong Copyleft (GPL-2.0)
Image Extraction
ocr-text-extractionby jasonlfunk
A simple program to extract the text from an image before performing OCR
ocr-text-extractionby jasonlfunk
Python 200 Version:Current License: Permissive (MIT)
ocrby victorqribeiro
Simple app to extract text from pictures using Tesseract
ocrby victorqribeiro
HTML 95 Version:Current License: Permissive (MIT)
php-apache-tikaby vaites
Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats
php-apache-tikaby vaites
PHP 90 Version:v1.2.2 License: Permissive (MIT)
img2txtby mathigatti
Easy formatted text extraction from images using Google Vision API
img2txtby mathigatti
Python 19 Version:Current License: Permissive (MIT)
textractby dbashford
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
textractby dbashford
HTML 1554 Version:Current License: Permissive (MIT)
pdf-text-extractby nisaacson
Extract text from pdfs that contain searchable pdf text
pdf-text-extractby nisaacson
JavaScript 102 Version:Current License: Permissive (BSD-3-Clause)
pineby sdushantha
๐ท A simple image to text OCR scanner for macOS
pineby sdushantha
Python 47 Version:Current License: Permissive (MIT)
Image Segmentation
Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet
Unet-Segmentation-Pytorch-Nest-of-Unetsby bigmb
Python 1405 Version:Current License: Permissive (MIT)
dilationby fyu
Dilated Convolution for Semantic Image Segmentation
dilationby fyu
Python 741 Version:Current License: Permissive (MIT)
LightNetby ansleliu
LightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)
LightNetby ansleliu
Python 702 Version:Current License: Permissive (MIT)
ImageSegmentationby AKSHAYUBHAT
Perform image segmentation and background removal in javascript using superpixes
ImageSegmentationby AKSHAYUBHAT
JavaScript 543 Version:Current License: Permissive (MIT)
vnet.pytorchby mattmacy
A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
vnet.pytorchby mattmacy
Python 623 Version:Current License: Permissive (BSD-3-Clause)
3DUnetCNNby ellisdg
Pytorch 3D U-Net Convolution Neural Network (CNN) designed for medical image segmentation
3DUnetCNNby ellisdg
Python 1630 Version:Current License: Permissive (MIT)
Post-Processing
go-ocrby maxim2266
A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.
go-ocrby maxim2266
Go 31 Version:v0.4.2 License: Permissive (BSD-3-Clause)
ocromoreby UB-Mannheim
Process, enhance and evaluate multiple OCR output.
ocromoreby UB-Mannheim
Python 16 Version:Current License: Permissive (Apache-2.0)
unpaperby ImageProcessing-ElectronicPublications
Post-processing tool for scanned pages
unpaperby ImageProcessing-ElectronicPublications
C 1 Version:0.5.3 License: Strong Copyleft (GPL-2.0)
ocr-post-processing-with-googleby PedroBarcha
Given a text, wrap it into phrases and send them to google's search engine. If it yields a "did you mean:", substitute the original phrase for the suggestion. The software was originally developed for correcting OCR output.
ocr-post-processing-with-googleby PedroBarcha
Python 2 Version:Current License: Strong Copyleft (GPL-3.0)
Training a Neural Network
tensorpackby tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
tensorpackby tensorpack
Python 6274 Version:doc-v0.9.0.1 License: Permissive (Apache-2.0)
chainerby chainer
A flexible framework of neural networks for deep learning
chainerby chainer
Python 5789 Version:v7.8.1.post1 License: Permissive (MIT)
tensorspaceby tensorspace-team
Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js
tensorspaceby tensorspace-team
JavaScript 4873 Version:v0.6 License: Permissive (Apache-2.0)
textgenrnnby minimaxir
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
textgenrnnby minimaxir
Python 4902 Version:v2.0.0 License: Others (Non-SPDX)
igniteby pytorch
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
igniteby pytorch
Python 4277 Version:v0.4.12 License: Permissive (BSD-3-Clause)
rnn_ctcby rakeshvar
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
rnn_ctcby rakeshvar
Python 220 Version:Current License: Permissive (Apache-2.0)