jplag | JPlag - Detecting Software Plagiarism | Computer Vision library
kandi X-RAY | jplag Summary
kandi X-RAY | jplag Summary
JPlag - Detecting Software Plagiarism
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Checks if the token stream is a binary operator
- Finds the index of the next operator
- Checks if token is a character from bit set
- Fills the token stream
- Starts the JPlag
- Calculate the community strength of a list of integers
- Builds the options instance from the given namespace
- Compute the hashes for all subsequences in the specified window
- Adds the start index to the map
- Performs the clustering
- Visits a terminal node
- Converts a match into a report match
- Overrides the visitor to look for methods
- Checks if the next op is next to the token stream
- Compares the submissions
- Visits a new array
- Prints all errors
- Checks if the token is an operator
- Visits a IF - if node
- Returns true if the token stream is a prefix operator
- Returns true if the token is a post - operator
- Preprocess the similarity matrix
- Parse the command line argument parser with the given parser
- Overrides the visitor to collect classes
- Determines whether the given token stream starts at the given index
- Reads a character from the stream
jplag Key Features
jplag Examples and Code Snippets
Community Discussions
Trending Discussions on jplag
QUESTION
I need to develop an application to detect plagiarism imprint in source codes using a cognitive system, such as Watson. At this time I'm using Jplag (https://github.com/jplag/jplag) to analyze source codes and get tokenized files.
Here my problems, I have seen that Watson is a highly used system for unstructured files analysis. Instead, tokenized files are highly structured and can not be analyzed with most of the services provided by IBM Watson. For example, Watson Discovery can provide information about feelings or topics, within an email or publication and finds links between the files that make up the corpus. Unfortunately, Watson can not read the tokenized file for this purpose (or I do not know how to do it).
I have found Watson Analytics and it should provide correlation between numeric and structured files. But I can't reach the trial period so I can't test it.
I wonder if I can use Watson Analytics for my purpose. Should I insist in this way or did I miss some suggestion?
...ANSWER
Answered 2017-Sep-26 at 12:46Have you tried Watson Content Analytics? It supports tokenization, and hopefully help you find the plagiarism in the source code files.
I am not 100% sure, it is what you are looking for but here is more information about it: https://www.ibm.com/support/knowledgecenter/en/SS5RWK_3.5.0/com.ibm.discovery.es.ta.doc/iiysputai.htm
Watson Analytics, however, might not be a good fit for your project.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install jplag
You can use jplag like any standard Java library. Please include the the jar files in your classpath. You can also use any IDE and you can run and debug the jplag component as you would do with any other Java program. Best practice is to use a build tool that supports dependency management such as Maven or Gradle. For Maven installation, please refer maven.apache.org. For Gradle installation, please refer gradle.org .
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page