PaddleOCR | Awesome multilingual OCR toolkits based on PaddlePaddle | Machine Learning library

by PaddlePaddle Python Version: 2.7.5 License: Apache-2.0

X-Ray Key Features Code Snippets(3)Community Discussions(3)Vulnerabilities Install Support

kandi X-RAY | PaddleOCR Summary

PaddleOCR is a Python library typically used in Artificial Intelligence, Machine Learning, Deep Learning, Pytorch applications. PaddleOCR has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has medium support. You can install using 'pip install PaddleOCR' or download it from GitHub, PyPI.

PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and apply them into practice.

Support

Quality

Security

License

Reuse

Support

PaddleOCR has a medium active ecosystem.

It has 31086 star(s) with 6338 fork(s). There are 404 watchers for this library.

It had no major release in the last 12 months.

There are 1719 open issues and 5651 have been closed. On average issues are closed in 83 days. There are 154 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of PaddleOCR is 2.7.5

Quality

PaddleOCR has 0 bugs and 0 code smells.

Security

PaddleOCR has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

PaddleOCR code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

PaddleOCR is licensed under the Apache-2.0 License. This license is Permissive.

Permissive licenses have the least restrictions, and you can use them in most projects.

Reuse

PaddleOCR releases are available to install and integrate.

Deployable package is available in PyPI.

Build file is available. You can build the component from source.

PaddleOCR saves you 6686 person hours of effort in developing the same functionality from scratch.

It has 33455 lines of code, 1199 functions and 201 files.

It has medium code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi has reviewed PaddleOCR and discovered the below as its top functions. This is intended to give you an instant insight into PaddleOCR implemented functionality, and help decide if they suit your requirements.

Train a model
Convert preds to float32
Build the laloader
Evaluate a model
Create a predictor
Get output tensors
Return the current GPU id
Evaluate the image
Return the intersection of two polygons
Run the analysis
Recognize the image
Run soft_nms algorithm
Get the warp transformation matrix
Sort boxes according to layout
Generate one - hot attention layer
Process start tag
Match all match results
Recognize the table
Evaluate E2EE evaluation
Export a single model
Mouse move event handler
Preprocess training
Export the label
Load function label annotations
Evaluate E2E
Generate a configuration dictionary

Get all kandi verified functions for this library.

PaddleOCR Key Features

No Key Features are available at this moment for PaddleOCR.

PaddleOCR Examples and Code Snippets

PaddleSdk,使用Paddle的C导出dll,可自行编译Paddle官方预测库源码,Demo的模型改至PaddleOCR自己训练的识别模型.

Lines of Code : 77

License : No License

Copy

PD_CpuMathLibraryNumThreads
PD_CudnnEnabled
PD_DeleteAnalysisConfig
PD_DeletePaddleBuf
PD_DeletePaddleTensor
PD_DeletePass
PD_DeletePredictor
PD_DeleteZeroCopyTensor
PD_DestroyZeroCopyTensor
PD_DisableGlogInfo
PD_DisableGpu
PD_EnableCUDNN
PD_EnableMK

本地使用

HTML

Lines of Code : 37

License : No License

Copy

python server.py

'''
 * Debugger is active!
 * Debugger PIN: 109-572-001
 * Running on http://127.0.0.1:8090/ (Press CTRL+C to quit)
'''

python test-post.py

'''
{
  "服务状态": "success",
  "识别时间": "3.2900s",
  "识别结果": [
    "健康宝",
    "2022年06月02日",

车牌识别任务二次开发-模型训练

Python

Lines of Code : 13

License : Permissive (Apache-2.0)

Copy

git clone git@github.com:PaddlePaddle/PaddleOCR.git

#检测预训练模型：
mkdir models
cd models
wget https://paddleocr.bj.bcebos.com/PP-OCRv3/chinese/ch_PP-OCRv3_det_distill_train.tar
tar -xf ch_PP-OCRv3_det_distill_train.tar

#识别预训练模型：
wget https://paddleocr.

Community Discussions

Trending Discussions on PaddleOCR

sed replace line by escaping special chars

OCR - How to recognize numbers inside square boxes using python?

PaddleOCR error OSError: [WinError 126] The specified module could not be found

QUESTION

sed replace line by escaping special chars

Asked 2022-Mar-07 at 15:21

I am trying to replace lines using sed, and usually this works fine, but I am now encountering a string which does not seem to play ball with sed :(

file: test.py

...

ANSWER

Answered 2022-Mar-07 at 14:41

I think your first " should be a '

Also, you need to escape the \ which aren't part of the sed syntax e.g.:

Source https://stackoverflow.com/questions/71382615

QUESTION

OCR - How to recognize numbers inside square boxes using python?

Asked 2021-Nov-01 at 15:01

one problem with optical character recognition (ocr) is it can't recognize numbers properly when numbers are inside square boxes. one failure example with tesseract is discussed here : Tesseract - How can I recognize numbers in box? i was testing with paddleocr here : https://www.paddlepaddle.org.cn/hub/scene/ocr you can quickly try that api too,,for this input image :

it returns nothing..

again when i try image like this :

it returns all the numbers successfully.most of the times these number recognition(both printed and handwritten) failing when they are inside square boxes.for recognizing numbers inside square boxes we need to convert these so called numbers in box image into numbers in image by removing all the square boxes. i have some images like below :

see, the full square box outside numbers are not fully visible,,only some part of the square boxes are visible.i want to convert these images into image where i will have only the numbers by removing square boxes or some part of square boxes that is present in these images after then hopefully number/digit recognition will work. i tried this code :

...

ANSWER

Answered 2021-Nov-01 at 15:01

the code below for me is doing decent job but it's hyper parameter sensitive :

Source https://stackoverflow.com/questions/69736363

QUESTION

PaddleOCR error OSError: [WinError 126] The specified module could not be found

Asked 2021-Oct-01 at 15:10

I'm currently trying to run a piece of code using PaddleOCR, but I'm stuck at importing PaddleOCR. It gives me the error OSError: [WinError 126] The specified module could not be found.

...

ANSWER

Answered 2021-Oct-01 at 15:10

conda install -c conda-forge shapely

resolved by this line of code

Source https://stackoverflow.com/questions/69408018

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install PaddleOCR

You can install using 'pip install PaddleOCR' or download it from GitHub, PyPI.
You can use PaddleOCR like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.