Shlukování na základě hustoty pro velká data

Bill, Vojtěch

Shlukování na základě hustoty pro velká data

Files

BIL0059_FEI_N2647_2612T025_2018.pdf (3.17 MB)

BIL0059_FEI_N2647_2612T025_2018_priloha.zip (19.98 MB)

BIL0059_FEI_N2647_2612T025_2018_posudek_vedouci_Platos_Jan.pdf (51.32 KB)

BIL0059_FEI_N2647_2612T025_2018_posudek_oponent_Drazdilova_Pavla.pdf (48.83 KB)

Downloads

17

Date issued

2018

Authors

Bill, Vojtěch

Publisher

Vysoká škola báňská - Technická univerzita Ostrava

Abstract

This diploma thesis focuses on clustering with special interest in density based cluster analysis for big data. In the beginnig, there is a theory behind clustering and mainly behind density based cluster analysis and the DBSCAN algorithm. Significant part of the first half of this theses consists of the data structures for efficient data storage and quering. In the second part, we propose our own version of DBSCAN with kd-tree used as a data structure and with parallel aproach of some of DBSCAN’s steps. We than measure the impact of parallelizing the DBSCAN algorithm and compare the basic approach of querying data using brute force in contrast to kd-tree. In the final part we propose possible enhancements and functionality for further improvement.

Subject(s)

clustering, DBSCAN, data structure, k-d tree, parallelization, OpenMP

Item identifier

http://hdl.handle.net/10084/128336

Collections

Vysokoškolské kvalifikační práce Fakulty elektrotechniky a informatiky / Theses and dissertations of Faculty of Electrical Engineering and Computer Science (FEI)

Show full item record

Shlukování na základě hustoty pro velká data

Files

Downloads

Date issued

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Location

Signature

Abstract

Description

Subject(s)

Citation

Item identifier

Collections