qt-box-editor | QT4 editor of tesseract-ocr box files | Computer Vision library
kandi X-RAY | qt-box-editor Summary
kandi X-RAY | qt-box-editor Summary
QT Box Editor is tool for adjusting [tesseract-ocr] box files. Aim of this project is to provide easy and efficient way for editing regardless file size. Release information can be found in CHANGELOG file. Code and artwork contribution is welcomed. QT box editor is a successor of [tesseract-gui project] that is not developed anymore. Name of application was changed due to name collision with project
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of qt-box-editor
qt-box-editor Key Features
qt-box-editor Examples and Code Snippets
Community Discussions
Trending Discussions on qt-box-editor
QUESTION
having troubles running jTessBoxEditor 1.7.3 in Ubuntu 16.04 64Bit.
installed openjdk-9-jdk, got this message:
...ANSWER
Answered 2017-Sep-13 at 13:10Try launching it with Oracle JDK/JRE 8.
QUESTION
With regard to this question and this question, where I ask how to download thousands of PDF
and processes them to extract their texts with OCR
, I am hitting a brick wall again when it comes to enhancing the text outputs.
I am interested to extract texts of a bunch of PDF
in order to search for surnames in the text (I do not need necessarily to be able to read the rest of the text). The PDF
represent old newspaper articles, published between 1810 and 1832 and written in German Fraktur. This font seems to be particularly challenging for tesseract
.
Q: How can I further improve the image quality for tesseract
to - at least - have a change to find the surnames in the text? Which procedure would you suggest?
If we take this pdf as an example, I receive the following image when applying
...ANSWER
Answered 2017-Jul-10 at 09:20My father had a similar problem with his old newspaper clippings, and I had moderately good success by preprocessing with GhostScript and then applying Tesseract. Your mileage may vary. My commands (Windows) were
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install qt-box-editor
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page