Předzpracování zdrojových kódů pro účely detekce plagiátů

Loading...
Thumbnail Image

Downloads

2

Date issued

Authors

Havlas, Jan

Journal Title

Journal ISSN

Volume Title

Publisher

Vysoká škola báňská - Technická univerzita Ostrava

Location

Signature

Abstract

This work deals with plagiarism and plagiarism software. The objective is to understand what these terms mean and generally become familiar with this issue. The next step is to design and implement an application that would preprocess source codes of programs. The output will be tokenized or n-gram document, which is used for subsequent detection of plagiarism. The application will work with source code in C# and besides tokenized output will also have the opportunity tokenized code to compare with others. The last step will be to test this application on the selected data collection. In this case it is the student projects of VSB and then will be evaluation.

Description

Import 22/07/2015

Subject(s)

Preprocessing, Comparison, Source code, Detection, Tokenization, Levenshtein, Ngram, Plagiarism, Plagiarism software, Visualization of results

Citation