Neural Networks in High Performance Computing Environment

Vojáček, Lukáš

dc.contributor.advisor	Dvorský, Jiří	cs
dc.contributor.author	Vojáček, Lukáš	cs
dc.date.accessioned	2015-11-04T08:59:26Z
dc.date.available	2015-11-04T08:59:26Z
dc.date.issued	2015	cs
dc.identifier.other	OSD002	cs
dc.identifier.uri	http://hdl.handle.net/10084/110921
dc.description	Import 04/11/2015	cs
dc.description.abstract	Roste příležitost analyzovat velké datové zdroje, ale uvědomili jsme si, že existuje nedostatek efektivních úloh zabývající dolováním dat se zaměřením na velké řídké datové soubory s velikou dimenzí. Tato práce se zaměřuje na tuto problematiku a na efektivní shlukování pomocí modelů umělé inteligence. Pro snížení dimenze dat, navrhujeme efektivní shlukovací metody, který využívá vlastnosti neuronových sítí, zaměřujeme se na Self Organizing Maps (SOM) a Growing Neural Gas (GNG). Problémy s časovou náročností řešíme použitím paralelizace SOM a GNG algoritmů. Zaměřili jsme se na zrychlení uvedených algoritmů a dále že budou vhodné pro kolekce dat s určitou úrovní řídkost. Efektivní zrychlení je dosaženo zdokonalením fáze, která se zabývá hledání vítězného neuronu a dále fáze, která aktualizuje váhy u neuronů. Prezentované výsledky ukazují dostatečné zrychlení a to i při zachování kvality výstupní neuronové sítě. Dále navrhujeme opět paralelizace SOM a GNG algoritmů, ale nyní bez omezení na totožné výsledky jako u standardní algoritmů. U SOM algoritmu se jedná o kombinaci Euklidovy vzdálenosti a kosinovy podobnosti u obou zmíněných fází. Druhou možností u fáze aktualizací vah je kombinace standardního SOM algoritmu s batch SOM algoritmem. U GNG algoritmu se jedná o předzpracování dat SOM algoritmem s kombinací Spanning tree algoritmem. Nakonec jsme se zaměřili na možnost migrace paralelních aplikací napsaných v jazyce C\# mezi Windows HPC a Linux HPC, včetně možných časových ztrát.	cs
dc.description.abstract	With increasing opportunities for analysing large data sources, we have noticed a lack of effective processing in data mining tasks working with large sparse datasets of high dimensions. This work focuses on this issue and on effective clustering using models of artificial intelligence. We propose an effective clustering algorithm to exploit the features of neural networks, especially Self Organizing Maps (SOM) and Growing Neural Gas (GNG), for the reduction of data dimensionality. The issue of computational complexity is resolved by using a parallelization of the standard SOM and GNG algorithm. We have focused on the acceleration of the presented algorithm using a version suitable for data collections with a certain level of sparsity. Effective acceleration is achieved by improving the winning neuron finding phase and the weight actualization phase. The output presented here demonstrates sufficient acceleration of the standard SOM and GNG algorithm while preserving the appropriate accuracy. Next, we again proposed a parallelization of the SOM and GNG algorithms, but now with out restrictions to identical results as in the case of the standard algorithms. In the SOM algorithm there is a combination of Euclidean distance and cosine similarity in the winning neuron finding phase and the weight actualization phase. Another option is a combination of standard SOM and batch SOM in the weight actualization phase. In the GNG algorithm there is a pre-processing by Self Organizing Maps with Spanning tree algorithm. Finally, we focused on the possibility of migration of parallel applications written in C# language between Windows HPC and Linux HPC, including a study of possible time losses.	en
dc.format	95 s. : il.	cs
dc.format.extent	1880474 bytes	cs
dc.format.mimetype	application/pdf	cs
dc.language.iso	cs	cs
dc.publisher	Vysoká škola báňská - Technická univerzita Ostrava	cs
dc.subject	neuronové sítě, Self Organizing Maps, Growing Neural Gas, high dimension datasets, large sparse datasets, high performance computing, shlukování	cs
dc.subject	neural networks, Self Organizing Maps, Growing Neural Gas, high dimension datasets, large sparse datasets, high performance computing, clustering	en
dc.title	Neural Networks in High Performance Computing Environment	cs
dc.title.alternative	Neuronové sítě v prostředí High Performance Computing	cs
dc.type	Disertační práce	cs
dc.identifier.signature	201500931	cs
dc.identifier.location	ÚK/Sklad diplomových prací	cs
dc.contributor.referee	Vondrák, Ivo	cs
dc.contributor.referee	Saeed, Khalid	cs
dc.contributor.referee	Karagoz, Pinar	cs
dc.date.accepted	2015-10-16	cs
dc.thesis.degree-name	Ph.D.	cs
dc.thesis.degree-level	Doktorský studijní program	cs
dc.thesis.degree-grantor	Vysoká škola báňská - Technická univerzita Ostrava. Fakulta elektrotechniky a informatiky	cs
dc.description.category	Prezenční	cs
dc.description.department	460 - Katedra informatiky	cs
dc.thesis.degree-program	Informatika, komunikační technologie a aplikovaná matematika	cs
dc.thesis.degree-branch	Informatika	cs
dc.description.result	vyhověl	cs
dc.identifier.sender	S2724	cs
dc.identifier.thesis	VOJ189_FEI_P1807_1801V001_2015
dc.rights.access	openAccess

Soubory tohoto záznamu

Název:: VOJ189_FEI_P1807_1801V001_2015.PDF
Velikost:: 1.793Mb
Formát:: PDF

Zobrazit/otevřít

Název:: VOJ189_FEI_P1807_1801V001_2015 ...
Velikost:: 1.723Mb
Formát:: PDF

Zobrazit/otevřít

Název:: VOJ189_FEI_P1807_1801V001_2015 ...
Velikost:: 99.80Kb
Formát:: PDF
Popis:: Posudek oponenta – Karagoz, Pinar

Zobrazit/otevřít

Název:: VOJ189_FEI_P1807_1801V001_2015 ...
Velikost:: 110.6Kb
Formát:: PDF
Popis:: Posudek oponenta – Saeed, Khalid

Zobrazit/otevřít

Název:: VOJ189_FEI_P1807_1801V001_2015 ...
Velikost:: 83.09Kb
Formát:: PDF
Popis:: Posudek oponenta – Vondrák, Ivo

Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Vysokoškolské kvalifikační práce Fakulty elektrotechniky a informatiky / Theses and dissertations of Faculty of Electrical Engineering and Computer Science (FEI) [13253]
Kolekce obsahuje vysokoškolské kvalifikační práce Fakulty elektrotechniky a informatiky.

Zobrazit minimální záznam