jwarc | Java library for reading and writing WARC files with a typed API
kandi X-RAY | jwarc Summary
kandi X-RAY | jwarc Summary
jwarc is a Java library. jwarc has no bugs, it has no vulnerabilities, it has build file available, it has a Permissive License and it has low support. You can download it from GitHub, Maven.
A Java library for reading and writing WARC files. This library includes a high level API modeling the standard record types as individual classes with typed accessors. The API is exensible and you can register extension record types and accessors for extension header fields. It uses a finite state machine parser generated from a strict grammar using Ragel. Gzipped records are automatically decompressed. The parser interprets ARC/1.1 record as if they are a WARC dialect and populates the appropriate WARC headers. All I/O is performed using NIO and an an effort is made to minimize data copies and share buffers whenever feasible. Direct buffers and even memory-mapped files can be used, but only with uncompressed WARCS until they're supported by Inflater (coming in JDK 11).
A Java library for reading and writing WARC files. This library includes a high level API modeling the standard record types as individual classes with typed accessors. The API is exensible and you can register extension record types and accessors for extension header fields. It uses a finite state machine parser generated from a strict grammar using Ragel. Gzipped records are automatically decompressed. The parser interprets ARC/1.1 record as if they are a WARC dialect and populates the appropriate WARC headers. All I/O is performed using NIO and an an effort is made to minimize data copies and share buffers whenever feasible. Direct buffers and even memory-mapped files can be used, but only with uncompressed WARCS until they're supported by Inflater (coming in JDK 11).
Support
Quality
Security
License
Reuse
Support
jwarc has a low active ecosystem.
It has 39 star(s) with 5 fork(s). There are 5 watchers for this library.
It had no major release in the last 12 months.
There are 12 open issues and 33 have been closed. On average issues are closed in 46 days. There are no pull requests.
It has a neutral sentiment in the developer community.
The latest version of jwarc is 0.28.3
Quality
jwarc has 0 bugs and 0 code smells.
Security
jwarc has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
jwarc code analysis shows 0 unresolved vulnerabilities.
There are 0 security hotspots that need review.
License
jwarc is licensed under the Apache-2.0 License. This license is Permissive.
Permissive licenses have the least restrictions, and you can use them in most projects.
Reuse
jwarc releases are available to install and integrate.
Deployable package is available in Maven.
Build file is available. You can build the component from source.
Installation instructions, examples and code snippets are available.
It has 7897 lines of code, 596 functions and 99 files.
It has medium code complexity. Code complexity directly impacts maintainability of the code.
Top functions reviewed by kandi - BETA
kandi has reviewed jwarc and discovered the below as its top functions. This is intended to give you an instant insight into jwarc implemented functionality, and help decide if they suit your requirements.
- Main entry point
- Runs the parser
- Command - line entry point
- Parse a media type string
- Dedupe tool
- Determines the output path of a file
- Determine a WarcRecord to use
- Deduplicate a Warc file
- Responds to the archive
- Iterates over the CdxReader
- Creates a channel to a channel using the given prefix
- Serialize this HTTP header to the output stream
- Serialize this HTTP status message to an Appendable
- Parses the http response captured by this record
- Reads the body to the digest
- Returns the array of illegal characters
- Handles incoming request
- Runs test tool
- Writes the timestamps to the archive
- Initializes the default class constructor
- Reads bytes into destination
- Consume body
- Test program
- Reads bytes into the destination
- Writes a byte buffer
- Creates a JavaScript handler for a resource
Get all kandi verified functions for this library.
jwarc Key Features
No Key Features are available at this moment for jwarc.
jwarc Examples and Code Snippets
No Code Snippets are available at this moment for jwarc.
Community Discussions
No Community Discussions are available at this moment for jwarc.Refer to stack overflow page for discussions.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install jwarc
Experimental native binary versions of the jwarc command-line tool are now available from the releases page for Linux and MacOS. These start faster than the cross-platform jar version and do not require Java to be installed.
Support
For any new features, suggestions and bugs create an issue on GitHub.
If you have any questions check and ask questions on community page Stack Overflow .
Find more information at:
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page