gocr | OCR implementation with Golang | Computer Vision library
kandi X-RAY | gocr Summary
kandi X-RAY | gocr Summary
Created by Withee Poositasai. Simple Thai OCR Project, CPE489 Image Processing and Computer Vision, KMUTT.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Main entry point
- SplitLine splits an array of rows into an array of line numbers
- GetSegmentChar returns the SegmentChar of the image array
- writeGlyp draws a string to the glyph .
- Resize resizes the given dimensions to the given dimensions .
- MedianFilter returns the median filter
- GetImgArray returns an array of image dimensions
- getThresholdValue returns the threshold value
- GenTemplate generate glyphs
- GetImgMat builds an image mat from an array of dimensions .
gocr Key Features
gocr Examples and Code Snippets
Community Discussions
Trending Discussions on gocr
QUESTION
Please allow me to ask a rather newbie question. So far, I have been using local tools like imagemagick or GOCR to perform the job, but that is rather old-fashioned, and I am urged to "move to google cloud AI".
The setupI have a (training) data set of various documents (as JPG and PDF) of different kinds, and by certain features (like prevailing color, repetitive layout) I intend to classify them, e.g. as invoice type 1
, invoice type 2
, not an invoice
. In a 2nd step, I would like to OCR certain predefined areas of each document and extract e.g. the address of the company sending the invoice and the date.
- In a modern platform as a service (pass), I have already set up an UI where I can upload new files. These are then locally stored in a directory with filenames (or in a MongoDB). Meta info like upload timestamp, user, original file name is stored in a DB.
- The newly uploaded file should should then be submitted to google cloud which should do the classification step, and deliver back the label to be saved in the database.
- The document pages should be auto-cropped, i.e. black or white margins are removed, most probably with google cloud as well. The parameters of the crop should be persisted in the DB.
- In case it is e.g. an invoice, OCR should be performed (again by google cloud) for certain regions of the documents, e.g. a bounding box of spanning from the mid of the page to the right margin in the upper 10% of the cropped page. The results of the OCR should be again persisted locally.
I seem to be missing the correct search term to figure out how to do it with google cloud. Is there an google-API (e.g. REST), I can use to upload and which gives me back the results of steps 2 to 4?
...ANSWER
Answered 2021-Feb-26 at 15:49I think that your best option here is to use Document AI (REST API and Libraries).
Using Document AI, you can:
- Convert images to text
- Classify documents
- Analyze and extract entities
Additionally, for your use case, we have a new Document AI feature that is still in preview and has limited access which is the Invoice parser.
Invoice parser is similar to Form parser but for invoices instead of forms. Check out the Invoice parser page and you will see what I mean by preview and limited access.
AFIK, there isn't any GCP tool for image edition.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install gocr
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page