gecco | Easy to use lightweight web crawler ( easy to use | Crawler library
kandi X-RAY | gecco Summary
kandi X-RAY | gecco Summary
Easy to use lightweight web crawler (easy to use lightweight web crawler)
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Perform a download
- To byte array input stream
- Convert relativePath to absolutePath
- Get the content from an input stream
- Run spider
- Matches a regex against a regex
- Returns the spider for the given request
- Perform the HTTP download
- Get a sub - request
- Starts the request queue
- Returns the HTTP request
- Intercept the method invocation
- Create a HtmlRender object
- Checks if content is HTML table
- Get the proxy
- Create downloader
- Add a JSON path annotation
- Entry point for testing
- Removes a field
- Create a JSONRender instance
- Compares this object to another
- Inject a SpiderBean
- Perform a HTTP request
- Main entry point
- Pushes a request to the queue
- Intercept the download request
gecco Key Features
gecco Examples and Code Snippets
@article{le2020scaling,
title={Scaling tree-based automated machine learning to biomedical big data with a feature set selector},
author={Le, Trang T and Fu, Weixuan and Moore, Jason H},
journal={Bioinformatics},
volume={36},
number={1},
Community Discussions
Trending Discussions on gecco
QUESTION
I am looking at this project: https://github.com/xtuhcy/gecco
It includes some demo code: https://github.com/xtuhcy/gecco/blob/master/src/test/java/com/geccocrawler/gecco/demo/MyGithub.java
I cloned the repo.
I'm trying to run the demo code but I can't figure out how.
I don't really understand how to use maven.
I managed to install maven and install the dependencies for the project.
mvn verify
and mvn test
both run without error
I found this article:
http://www.vineetmanohar.com/2009/11/3-ways-to-run-java-main-from-maven/
Which suggests doing something like this:
mvn exec:java -Dexec.mainClass="com.vineetmanohar.module.Main"
But I can't figure how what the value of mainClass
should be.
I have tried:
...ANSWER
Answered 2019-Nov-10 at 10:28Ugh, you've made everything right, the issue is that the class that you're trying to run is in src/test/java...
and not in src/main/java/
. I'll explain.
Maven is pretty advanced build tool, it differentiates between sources that will reach "production" eventually and the tests that are executed during the build process.
More specifically it provides different classpaths for them, so that for tests for example you'll want to have a testing framework in a classpath and for production its irrelevant.
Tests should be tests and not something with "main" method (tests unlike real classes do not have the main method and are driven by testing framework). This is maven's "by-the-book" approach.
But this project for some reason uses the classes with method main in folder that should contain tests (I admin, I've seen this from time to time in other projects as well).
So you need to run the classs with the "classpath" of tests (otherwise maven won't see this class at all), here is how its done:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install gecco
You can use gecco like any standard Java library. Please include the the jar files in your classpath. You can also use any IDE and you can run and debug the gecco component as you would do with any other Java program. Best practice is to use a build tool that supports dependency management such as Maven or Gradle. For Maven installation, please refer maven.apache.org. For Gradle installation, please refer gradle.org .
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page