Dependencies
This program was made by python3. So, you must be installed python3. And you have to install below packages by using pip3
Details
This program finds the plagiarism by using the MinHash algorithms.
Usage
You can use this program like below (also can see this document with ./plagiarism.py -h)
Terminology
template file: The template file refers to a file that is distributed in common. For instance, isn't there something always included when we create the
hello worldexample? It serves to remove such content.This value should be given as
~/dir1/dir2/template.c.
output file: This is the storage location of the CSV file with comparison full results.
This value should be given as
~/dir1/dir2/output.csv.
working path: This is the path that contains all the files you want to compare.
remove regex: Contains the pattern the user wants to delete. Typically, it is used to uncomment the source code.
This value should be given as
~/dir1/.
summary file: This is the storage location of the CSV file with comparison summary results.
This value should be given as
~/dir1/dir2/summary.csv.
graph weight: Corresponds to the threshold value determines the target to draw in the graph.




