Building Optical Character Recognition using reusable libraries

by sarvan Updated: Sep 1, 2021

Solution Kit

Optical character recognition (OCR) is a technology solution discovered to automate data extraction. The data is extracted from printed or written text from a scanned document or image file. Once after extraction, convert the text into a machine-readable format for data processing like editing or searching—the more accurate your OCR system in processing and identifying the characters in an image, the better. The processing steps for an OCR are: 1. Image Extraction 2. Image Preprocessing 3. Segmentation 4. Training a Neural Network 5. Post-Processing You can customize and create an OCR system using reusable libraries.

Image Preprocessing

icr-character-image-preprocessorby this-is-ari

Python

Version:Current

License: Permissive (Apache-2.0)

An Python application used to pre-process images of individual handwritten characters to increase OCR/ICR accuracy

Support

Quality

Security

License

Reuse

icr-character-image-preprocessorby this-is-ari

Python 36 Version:Current License: Permissive (Apache-2.0)

An Python application used to pre-process images of individual handwritten characters to increase OCR/ICR accuracy

Support

Quality

Security

License

Reuse

ocr-tesseract-wrapperby pjalusic

Python

Version:0.0.2

License: Permissive (MIT)

Tiny wrapper around pytesseract with image preprocessing and OCR configurations

Support

Quality

Security

License

Reuse

ocr-tesseract-wrapperby pjalusic

Python 1 Version:0.0.2 License: Permissive (MIT)

Tiny wrapper around pytesseract with image preprocessing and OCR configurations

Support

Quality

Security

License

Reuse

torchioby fepegar

Python

1743

Version:v0.18.90

License: Permissive (Apache-2.0)

Medical imaging toolkit for deep learning

Support

Quality

Security

License

Reuse

torchioby fepegar

Python 1743 Version:v0.18.90 License: Permissive (Apache-2.0)

Medical imaging toolkit for deep learning

Support

Quality

Security

License

Reuse

display_ocrby arturaugusto

Python

200

Version:Current

License: Strong Copyleft (GPL-2.0)

Real-time image preprocess and OCR.

Support

Quality

Security

License

Reuse

display_ocrby arturaugusto

Python 200 Version:Current License: Strong Copyleft (GPL-2.0)

Real-time image preprocess and OCR.

Support

Quality

Security

License

Reuse

Image Extraction

ocr-text-extractionby jasonlfunk

Python

200

Version:Current

License: Permissive (MIT)

A simple program to extract the text from an image before performing OCR

Support

Quality

Security

License

Reuse

ocr-text-extractionby jasonlfunk

Python 200 Version:Current License: Permissive (MIT)

A simple program to extract the text from an image before performing OCR

Support

Quality

Security

License

Reuse

ocrby victorqribeiro

HTML

Version:Current

License: Permissive (MIT)

Simple app to extract text from pictures using Tesseract

Support

Quality

Security

License

Reuse

ocrby victorqribeiro

HTML 95 Version:Current License: Permissive (MIT)

Simple app to extract text from pictures using Tesseract

Support

Quality

Security

License

Reuse

php-apache-tikaby vaites

PHP

Version:v1.2.2

License: Permissive (MIT)

Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats

Support

Quality

Security

License

Reuse

php-apache-tikaby vaites

PHP 90 Version:v1.2.2 License: Permissive (MIT)

Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats

Support

Quality

Security

License

Reuse

img2txtby mathigatti

Python

Version:Current

License: Permissive (MIT)

Easy formatted text extraction from images using Google Vision API

Support

Quality

Security

License

Reuse

img2txtby mathigatti

Python 19 Version:Current License: Permissive (MIT)

Easy formatted text extraction from images using Google Vision API

Support

Quality

Security

License

Reuse

textractby dbashford

HTML

1554

Version:Current

License: Permissive (MIT)

node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!

Support

Quality

Security

License

Reuse

textractby dbashford

HTML 1554 Version:Current License: Permissive (MIT)

node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!

Support

Quality

Security

License

Reuse

pdf-text-extractby nisaacson

JavaScript

102

Version:Current

License: Permissive (BSD-3-Clause)

Extract text from pdfs that contain searchable pdf text

Support

Quality

Security

License

Reuse

pdf-text-extractby nisaacson

JavaScript 102 Version:Current License: Permissive (BSD-3-Clause)

Extract text from pdfs that contain searchable pdf text

Support

Quality

Security

License

Reuse

pineby sdushantha

Python

Version:Current

License: Permissive (MIT)

📷 A simple image to text OCR scanner for macOS

Support

Quality

Security

License

Reuse

pineby sdushantha

Python 47 Version:Current License: Permissive (MIT)

📷 A simple image to text OCR scanner for macOS

Support

Quality

Security

License

Reuse

Image Segmentation

Unet-Segmentation-Pytorch-Nest-of-Unetsby bigmb

Python

1405

Version:Current

License: Permissive (MIT)

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Support

Quality

Security

License

Reuse

Unet-Segmentation-Pytorch-Nest-of-Unetsby bigmb

Python 1405 Version:Current License: Permissive (MIT)

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Support

Quality

Security

License

Reuse

dilationby fyu

Python

741

Version:Current

License: Permissive (MIT)

Dilated Convolution for Semantic Image Segmentation

Support

Quality

Security

License

Reuse

dilationby fyu

Python 741 Version:Current License: Permissive (MIT)

Dilated Convolution for Semantic Image Segmentation

Support

Quality

Security

License

Reuse

LightNetby ansleliu

Python

702

Version:Current

License: Permissive (MIT)

LightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)

Support

Quality

Security

License

Reuse

LightNetby ansleliu

Python 702 Version:Current License: Permissive (MIT)

LightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)

Support

Quality

Security

License

Reuse

ImageSegmentationby AKSHAYUBHAT

JavaScript

543

Version:Current

License: Permissive (MIT)

Perform image segmentation and background removal in javascript using superpixes

Support

Quality

Security

License

Reuse

ImageSegmentationby AKSHAYUBHAT

JavaScript 543 Version:Current License: Permissive (MIT)

Perform image segmentation and background removal in javascript using superpixes

Support

Quality

Security

License

Reuse

vnet.pytorchby mattmacy

Python

623

Version:Current

License: Permissive (BSD-3-Clause)

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Support

Quality

Security

License

Reuse

vnet.pytorchby mattmacy

Python 623 Version:Current License: Permissive (BSD-3-Clause)

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Support

Quality

Security

License

Reuse

3DUnetCNNby ellisdg

Python

1630

Version:Current

License: Permissive (MIT)

Pytorch 3D U-Net Convolution Neural Network (CNN) designed for medical image segmentation

Support

Quality

Security

License

Reuse

3DUnetCNNby ellisdg

Python 1630 Version:Current License: Permissive (MIT)

Pytorch 3D U-Net Convolution Neural Network (CNN) designed for medical image segmentation

Support

Quality

Security

License

Reuse

Post-Processing

go-ocrby maxim2266

Version:v0.4.2

License: Permissive (BSD-3-Clause)

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Support

Quality

Security

License

Reuse

go-ocrby maxim2266

Go 31 Version:v0.4.2 License: Permissive (BSD-3-Clause)

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Support

Quality

Security

License

Reuse

ocromoreby UB-Mannheim

Python

Version:Current

License: Permissive (Apache-2.0)

Process, enhance and evaluate multiple OCR output.

Support

Quality

Security

License

Reuse

ocromoreby UB-Mannheim

Python 16 Version:Current License: Permissive (Apache-2.0)

Process, enhance and evaluate multiple OCR output.

Support

Quality

Security

License

Reuse

unpaperby ImageProcessing-ElectronicPublications

Version:0.5.3

License: Strong Copyleft (GPL-2.0)

Post-processing tool for scanned pages

Support

Quality

Security

License

Reuse

unpaperby ImageProcessing-ElectronicPublications

C 1 Version:0.5.3 License: Strong Copyleft (GPL-2.0)

Post-processing tool for scanned pages

Support

Quality

Security

License

Reuse

ocr-post-processing-with-googleby PedroBarcha

Python

Version:Current

License: Strong Copyleft (GPL-3.0)

Given a text, wrap it into phrases and send them to google's search engine. If it yields a "did you mean:", substitute the original phrase for the suggestion. The software was originally developed for correcting OCR output.

Support

Quality

Security

License

Reuse

ocr-post-processing-with-googleby PedroBarcha

Python 2 Version:Current License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Training a Neural Network

tensorpackby tensorpack

Python

6274

Version:doc-v0.9.0.1

License: Permissive (Apache-2.0)

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Support

Quality

Security

License

Reuse

tensorpackby tensorpack

Python 6274 Version:doc-v0.9.0.1 License: Permissive (Apache-2.0)

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Support

Quality

Security

License

Reuse

chainerby chainer

Python

5789

Version:v7.8.1.post1

License: Permissive (MIT)

A flexible framework of neural networks for deep learning

Support

Quality

Security

License

Reuse

chainerby chainer

Python 5789 Version:v7.8.1.post1 License: Permissive (MIT)

A flexible framework of neural networks for deep learning

Support

Quality

Security

License

Reuse

tensorspaceby tensorspace-team

JavaScript

4873

Version:v0.6

License: Permissive (Apache-2.0)

Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js

Support

Quality

Security

License

Reuse

tensorspaceby tensorspace-team

JavaScript 4873 Version:v0.6 License: Permissive (Apache-2.0)

Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js

Support

Quality

Security

License

Reuse

textgenrnnby minimaxir

Python

4902

Version:v2.0.0

License: Others (Non-SPDX)

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Support

Quality

Security

License

Reuse

textgenrnnby minimaxir

Python 4902 Version:v2.0.0 License: Others (Non-SPDX)

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Support

Quality

Security

License

Reuse

igniteby pytorch

Python

4277

Version:v0.4.12

License: Permissive (BSD-3-Clause)

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Support

Quality

Security

License

Reuse

igniteby pytorch

Python 4277 Version:v0.4.12 License: Permissive (BSD-3-Clause)

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Support

Quality

Security

License

Reuse

rnn_ctcby rakeshvar

Python

220

Version:Current

License: Permissive (Apache-2.0)

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Support

Quality

Security

License

Reuse

rnn_ctcby rakeshvar

Python 220 Version:Current License: Permissive (Apache-2.0)

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Support

Quality

Security

License

Reuse

See similar Kits and Libraries

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Building Optical Character Recognition using reusable libraries

Open Weaver – Develop Applications Faster with Open Source

kandi

Community and Support

Company

Follow