Geometrical and topological approaches to Big Data

Snášel, Václav; Nowaková, Jana; Xhafa, Fatos; Barolli, Leonard

dc.contributor.author	Snášel, Václav
dc.contributor.author	Nowaková, Jana
dc.contributor.author	Xhafa, Fatos
dc.contributor.author	Barolli, Leonard
dc.date.accessioned	2017-01-05T12:35:05Z
dc.date.available	2017-01-05T12:35:05Z
dc.date.issued	2017
dc.identifier.citation	Future Generation Computer Systems. 2017, vol. 67, p. 286-296.	cs
dc.identifier.issn	0167-739X
dc.identifier.issn	1872-7115
dc.identifier.uri	http://hdl.handle.net/10084/116567
dc.description.abstract	Modern data science uses topological methods to find the structural features of data sets before further supervised or unsupervised analysis. Geometry and topology are very natural tools for analysing massive amounts of data since geometry can be regarded as the study of distance functions. Mathematical formalism, which has been developed for incorporating geometric and topological techniques, deals with point cloud data sets, i.e. finite sets of points. It then adapts tools from the various branches of geometry and topology for the study of point cloud data sets. The point clouds are finite samples taken from a geometric object, perhaps with noise. Topology provides a formal language for qualitative mathematics, whereas geometry is mainly quantitative. Thus, in topology, we study the relationships of proximity or nearness, without using distances. A map between topological spaces is called continuous if it preserves the nearness structures. Geometrical and topological methods are tools allowing us to analyse highly complex data. These methods create a summary or compressed representation of all of the data features to help to rapidly uncover particular patterns and relationships in data. The idea of constructing summaries of entire domains of attributes involves understanding the relationship between topological and geometric objects constructed from data using various features. A common thread in various approaches for noise removal, model reduction, feasibility reconstruction, and blind source separation, is to replace the original data with a lower dimensional approximate representation obtained via a matrix or multi-directional array factorization or decomposition. Besides those transformations, a significant challenge of feature summarization or subset selection methods for Big Data will be considered by focusing on scalable feature selection. Lower dimensional approximate representation is used for Big Data visualization. The cross-field between topology and Big Data will bring huge opportunities, as well as challenges, to Big Data communities. This survey aims at bringing together state-of-the-art research results on geometrical and topological methods for Big Data.	cs
dc.format.extent	1185851 bytes
dc.format.mimetype	application/pdf
dc.language.iso	en	cs
dc.publisher	Elsevier	cs
dc.relation.ispartofseries	Future Generation Computer Systems	cs
dc.relation.uri	http://dx.doi.org/10.1016/j.future.2016.06.005	cs
dc.rights	© 2016 The Author(s). Published by Elsevier B.V.	cs
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	cs
dc.subject	Industry 4.0	cs
dc.subject	big data	cs
dc.subject	topological data analysis	cs
dc.subject	persistent homology	cs
dc.subject	dimensionality reduction	cs
dc.subject	big data visualization	cs
dc.title	Geometrical and topological approaches to Big Data	cs
dc.type	article	cs
dc.identifier.doi	10.1016/j.future.2016.06.005
dc.rights.access	openAccess
dc.type.version	publishedVersion	cs
dc.type.status	Peer-reviewed	cs
dc.description.source	Web of Science	cs
dc.description.volume	67	cs
dc.description.lastpage	296	cs
dc.description.firstpage	286	cs
dc.identifier.wos	000389555700023

Soubory tohoto záznamu

Název:: 0167-739X-2017v67p286.pdf
Velikost:: 1.130Mb
Formát:: PDF

Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor.
OpenAIRE [5085]
Kolekce určená pro sklízení infrastrukturou OpenAIRE; obsahuje otevřeně přístupné publikace, případně další publikace, které jsou výsledkem projektů rámcových programů Evropské komise (7. RP, H2020, Horizon Europe).
Publikační činnost Katedry informatiky / Publications of Department of Computer Science (460) [562]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry informatiky (460) v časopisech a v Lecture Notes in Computer Science registrovaných ve Web of Science od roku 2003 po současnost.

Zobrazit minimální záznam