pdfc | Simple python script to compress PDF | Document Editor library

 by   theeko74 Python Version: Current License: No License

kandi X-RAY | pdfc Summary

kandi X-RAY | pdfc Summary

pdfc is a Python library typically used in Editor, Document Editor applications. pdfc has no bugs and it has low support. However pdfc has 1 vulnerabilities and it build file is not available. You can download it from GitHub.

Simple python script to compress PDF.
Support
    Quality
      Security
        License
          Reuse

            kandi-support Support

              pdfc has a low active ecosystem.
              It has 283 star(s) with 105 fork(s). There are 6 watchers for this library.
              OutlinedDot
              It had no major release in the last 6 months.
              There are 1 open issues and 6 have been closed. On average issues are closed in 314 days. There are 6 open pull requests and 0 closed requests.
              It has a neutral sentiment in the developer community.
              The latest version of pdfc is current.

            kandi-Quality Quality

              pdfc has 0 bugs and 0 code smells.

            kandi-Security Security

              OutlinedDot
              pdfc has 1 vulnerability issues reported (1 critical, 0 high, 0 medium, 0 low).
              pdfc code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              pdfc does not have a standard license declared.
              Check the repository for any license declaration and review the terms closely.
              OutlinedDot
              Without a license, all rights are reserved, and you cannot use the library in your applications.

            kandi-Reuse Reuse

              pdfc releases are not available. You will need to build from source code and install.
              pdfc has no build file. You will be need to create the build yourself to build the component from source.
              Installation instructions, examples and code snippets are available.
              It has 68 lines of code, 3 functions and 1 files.
              It has low code complexity. Code complexity directly impacts maintainability of the code.

            Top functions reviewed by kandi - BETA

            kandi has reviewed pdfc and discovered the below as its top functions. This is intended to give you an instant insight into pdfc implemented functionality, and help decide if they suit your requirements.
            • Compress a PDF file
            • Return the path to Ghostscript executable
            Get all kandi verified functions for this library.

            pdfc Key Features

            No Key Features are available at this moment for pdfc.

            pdfc Examples and Code Snippets

            No Code Snippets are available at this moment for pdfc.

            Community Discussions

            QUESTION

            Difference between text as image and graphics as image
            Asked 2017-Sep-20 at 06:34

            The question seems to be weird, but I need to ask this, since I am witnessing a quite interesting output when I compare text as image and graphics as image.

            Ideally I am in process of identifying an tool, or algorithm to compare two pdfs, generate output which will highlight the difference between them.

            There are possibilities in pdfs, which will have text as image format (legacy text on papers, are converted to pdfs).

            and we are doing migration of those legacy pdfs, and finally we are comparing with legacy and converted pdf output.

            I am evaluating couple of tools like Adobe dc pro, i-net pdfc and power pdf etc, for comparing two pdfs.

            While evaluating, I am able to see graphic images are getting compared(not accurate either) on either side of the pdfs. Where as text as images are completely ignored, unanimously same results in all the tools.

            But I am more interested in text as image, since we deal more of legacy text pdfs.

            Below, is attached graphic image comparison result, where it could able to capture the differences between the images.

            But when I compare text image, differences are not highlighted in the tool.

            What I understand from this, text is not compared as image graphics, and tool is completely ignoring the comparison. I would like have clarification whether my assumption is correct.

            Secondly, I would like to know how to compare text image in pdfs to generate the differences?.

            ...

            ANSWER

            Answered 2017-Sep-20 at 06:34

            I'm working for the company that is author of i-net PDFC so I'll answer your first question as well:

            Your assumption is correct. i-net PDFC is able to compare images and shapes, but it cannot detect if some content completely changed it's meaning, e.G. a line shape that is used to draw a letter or in your case an image that has to be recognized as text. Recognizing ASCII art as image won't work for the same reason either. Such cases will always be detected as differences even though their visual appearance is similar.

            On your second question: Using an OCR conversion tool for one or both documents is a common solution to this problem. A simple image comparison of the compared pages in unlikely to work due to the different font styles and line wrappings in the converted file. Please note that most OCR applications will use the rendered page images for the recognition. This may lead to incorrect recognition results even if there are no images in the PDF file.

            i-net Software is aware of this general issue and an OCR module is currently in development. It'll provide an option to apply the recognition solely to the images in the PDF files.

            Source https://stackoverflow.com/questions/45941373

            Community Discussions, Code Snippets contain sources that include Stack Exchange Network

            Vulnerabilities

            The documentation component in i-net Clear Reports 16.0 to 19.2, HelpDesk 8.0 to 8.3, and PDFC 4.3 to 6.2 allows a remote unauthenticated attacker to read arbitrary system files and directories on the target server via Directory Traversal.

            Install pdfc

            Install dependency Ghostscript. On MacOSX: brew install ghostscript On Windows: install binaries via [official website] (https://www.ghostscript.com/). Create a symbolic link if you want to run it everywhere in bash ln -s pdf_creator.py pdfc. Add in PATH environment variable On MacOSX: echo export=/absolute/path/of/the/folder/script/:$PATH >> ~/.bash_profile.
            Install dependency Ghostscript. On MacOSX: brew install ghostscript On Windows: install binaries via [official website] (https://www.ghostscript.com/)
            Create a symbolic link if you want to run it everywhere in bash ln -s pdf_creator.py pdfc
            Add in PATH environment variable On MacOSX: echo export=/absolute/path/of/the/folder/script/:$PATH >> ~/.bash_profile

            Support

            For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
            CLONE
          • HTTPS

            https://github.com/theeko74/pdfc.git

          • CLI

            gh repo clone theeko74/pdfc

          • sshUrl

            git@github.com:theeko74/pdfc.git

          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link