Předzpracování zdrojových kódů pro účely detekce plagiátů
Loading...
Downloads
2
Date issued
Authors
Havlas, Jan
Journal Title
Journal ISSN
Volume Title
Publisher
Vysoká škola báňská - Technická univerzita Ostrava
Location
Signature
Abstract
This work deals with plagiarism and plagiarism software.
The objective is to understand what these terms mean and generally become familiar with this issue.
The next step is to design and implement an application that would preprocess source codes of programs.
The output will be tokenized or n-gram document, which is used for subsequent detection of plagiarism.
The application will work with source code in C# and besides tokenized output will also have the opportunity tokenized code to compare with others.
The last step will be to test this application on the selected data collection.
In this case it is the student projects of VSB and then will be evaluation.
Description
Import 22/07/2015
Subject(s)
Preprocessing, Comparison, Source code, Detection, Tokenization, Levenshtein, Ngram, Plagiarism, Plagiarism software, Visualization of results