pdfc | Simple python script to compress PDF | Document Editor library

by theeko74 Python Version: Current License: No License

X-Ray Key Features Code Snippets Community Discussions(1)Vulnerabilities Install Support

kandi X-RAY | pdfc Summary

pdfc is a Python library typically used in Editor, Document Editor applications. pdfc has no bugs and it has low support. However pdfc has 1 vulnerabilities and it build file is not available. You can download it from GitHub.

Simple python script to compress PDF.

Support

Quality

Security

License

Reuse

Support

pdfc has a low active ecosystem.

It has 283 star(s) with 105 fork(s). There are 6 watchers for this library.

It had no major release in the last 6 months.

There are 1 open issues and 6 have been closed. On average issues are closed in 314 days. There are 6 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of pdfc is current.

Quality

pdfc has 0 bugs and 0 code smells.

Security

pdfc has 1 vulnerability issues reported (1 critical, 0 high, 0 medium, 0 low).

pdfc code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

pdfc does not have a standard license declared.

Check the repository for any license declaration and review the terms closely.

Without a license, all rights are reserved, and you cannot use the library in your applications.

Reuse

pdfc releases are not available. You will need to build from source code and install.

pdfc has no build file. You will be need to create the build yourself to build the component from source.

Installation instructions, examples and code snippets are available.

It has 68 lines of code, 3 functions and 1 files.

It has low code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi has reviewed pdfc and discovered the below as its top functions. This is intended to give you an instant insight into pdfc implemented functionality, and help decide if they suit your requirements.

Compress a PDF file
Return the path to Ghostscript executable

Get all kandi verified functions for this library.

pdfc Key Features

No Key Features are available at this moment for pdfc.

pdfc Examples and Code Snippets

No Code Snippets are available at this moment for pdfc.

Community Discussions

Trending Discussions on pdfc

Difference between text as image and graphics as image

QUESTION

Difference between text as image and graphics as image

Asked 2017-Sep-20 at 06:34

The question seems to be weird, but I need to ask this, since I am witnessing a quite interesting output when I compare text as image and graphics as image.

Ideally I am in process of identifying an tool, or algorithm to compare two pdfs, generate output which will highlight the difference between them.

There are possibilities in pdfs, which will have text as image format (legacy text on papers, are converted to pdfs).

and we are doing migration of those legacy pdfs, and finally we are comparing with legacy and converted pdf output.

I am evaluating couple of tools like Adobe dc pro, i-net pdfc and power pdf etc, for comparing two pdfs.

While evaluating, I am able to see graphic images are getting compared(not accurate either) on either side of the pdfs. Where as text as images are completely ignored, unanimously same results in all the tools.

But I am more interested in text as image, since we deal more of legacy text pdfs.

Below, is attached graphic image comparison result, where it could able to capture the differences between the images.

But when I compare text image, differences are not highlighted in the tool.

What I understand from this, text is not compared as image graphics, and tool is completely ignoring the comparison. I would like have clarification whether my assumption is correct.

Secondly, I would like to know how to compare text image in pdfs to generate the differences?.

...

ANSWER

Answered 2017-Sep-20 at 06:34

I'm working for the company that is author of i-net PDFC so I'll answer your first question as well:

Your assumption is correct. i-net PDFC is able to compare images and shapes, but it cannot detect if some content completely changed it's meaning, e.G. a line shape that is used to draw a letter or in your case an image that has to be recognized as text. Recognizing ASCII art as image won't work for the same reason either. Such cases will always be detected as differences even though their visual appearance is similar.

On your second question: Using an OCR conversion tool for one or both documents is a common solution to this problem. A simple image comparison of the compared pages in unlikely to work due to the different font styles and line wrappings in the converted file. Please note that most OCR applications will use the rendered page images for the recognition. This may lead to incorrect recognition results even if there are no images in the PDF file.

i-net Software is aware of this general issue and an OCR module is currently in development. It'll provide an option to apply the recognition solely to the images in the PDF files.

Source https://stackoverflow.com/questions/45941373

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

CVE-2020-11431 CRITICAL

The documentation component in i-net Clear Reports 16.0 to 19.2, HelpDesk 8.0 to 8.3, and PDFC 4.3 to 6.2 allows a remote unauthenticated attacker to read arbitrary system files and directories on the target server via Directory Traversal.

https://www.inetsoftware.de/support/news/i-net-clear-reports-security-advisory-2020-apr-06

https://www.inetsoftware.de/support/news/i-net-helpdesk-sicherheitsankuendigung-2020-apr-06

https://www.inetsoftware.de/support/news/i-net-pdfc-security-advisory-2020-apr-06

https://www.inetsoftware.de/documentation/clear-reports/release-notes/releases/changes_19.2

Install pdfc

Install dependency Ghostscript. On MacOSX: brew install ghostscript On Windows: install binaries via [official website] (https://www.ghostscript.com/). Create a symbolic link if you want to run it everywhere in bash ln -s pdf_creator.py pdfc. Add in PATH environment variable On MacOSX: echo export=/absolute/path/of/the/folder/script/:$PATH >> ~/.bash_profile.
Install dependency Ghostscript. On MacOSX: brew install ghostscript On Windows: install binaries via [official website] (https://www.ghostscript.com/)
Create a symbolic link if you want to run it everywhere in bash ln -s pdf_creator.py pdfc
Add in PATH environment variable On MacOSX: echo export=/absolute/path/of/the/folder/script/:$PATH >> ~/.bash_profile

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: