Detekce klíčových slov v odborných článcích

Blažek, Ondřej

dc.contributor.advisor	Kudělka, Miloš	cs
dc.contributor.author	Blažek, Ondřej	cs
dc.date.accessioned	2013-06-26T11:20:25Z
dc.date.available	2013-06-26T11:20:25Z
dc.date.issued	2013	cs
dc.identifier.other	OSD002	cs
dc.identifier.uri	http://hdl.handle.net/10084/99015
dc.description	Import 26/06/2013	cs
dc.description.abstract	Předmětem této bakalářské práce je jedna typická úloha vědecké disciplíny zvané dolování z textu (text mining). Konkrétně tedy detekce klíčových slov dokumentů, jenž mohou sloužit například pro rozdělení dokumentů do kategorií. Teoretická část je rozdělena na dvě části, kde část první je věnována základním pojmům a jejich objasnění v této problematice. Jedná se především o způsob, jak vhodně reprezentovat dokumenty ve vektorovém prostoru. Druhá část se věnuje průzkumu existujících metod pro určení kategorií dokumentů a detekci klíčových slov, na jejichž základě jsou především tyto kategorie sloučeny.	cs
dc.description.abstract	The subject of this thesis is one typical role of a scientific discipline called text mining. Specifically it is a keyword spotting documents, which can be used for example for the distribution of documents into categories. The theoretical part is divided into two parts where the first part is devoted to the basic concepts and explains them in this issue. This is essentially a way to properly represent documents in a vector space. The second part deals with the exploration of existing methods for determining the categories of documents and keywords detection on the basis of those categories are merged. An important part of the work is its own implementation, which describes the steps of my process. For example we can find here steps to create a vector that will represent the document and clustering a set of documents into a given number of categories, based on their similarity. This clustering is used as a tool for categorization, which subsequently due to frequency analysis, keywords of categories are detected.	en
dc.format.extent	3873080 bytes	cs
dc.format.mimetype	application/pdf	cs
dc.language.iso	cs	cs
dc.publisher	Vysoká škola báňská - Technická univerzita Ostrava	cs
dc.subject	kategorizace, tématizace, dolování textu, klíčová slova	cs
dc.subject	categorization , thematization , text mining , key words	en
dc.title	Detekce klíčových slov v odborných článcích	cs
dc.title.alternative	Keywords Detection in Research Papers	en
dc.type	Bakalářská práce	cs
dc.contributor.referee	Horák, Zdeněk	cs
dc.date.accepted	2013-06-05	cs
dc.thesis.degree-name	Bc.	cs
dc.thesis.degree-level	Bakalářský studijní program	cs
dc.thesis.degree-grantor	Vysoká škola báňská - Technická univerzita Ostrava. Fakulta elektrotechniky a informatiky	cs
dc.description.department	460 - Katedra informatiky	cs
dc.thesis.degree-program	Informační a komunikační technologie	cs
dc.thesis.degree-branch	Informatika a výpočetní technika	cs
dc.description.result	velmi dobře	cs
dc.identifier.sender	S2724	cs
dc.identifier.thesis	BLA0045_FEI_B2647_2612R025_2013
dc.rights.access	openAccess

Soubory tohoto záznamu

Název:: BLA0045_FEI_B2647_2612R025_2013.pdf
Velikost:: 3.693Mb
Formát:: PDF

Zobrazit/otevřít

Název:: BLA0045_FEI_B2647_2612R025_201 ...
Velikost:: 568.7Kb
Formát:: Neznámý

Zobrazit/otevřít

Název:: BLA0045_FEI_B2647_2612R025_201 ...
Velikost:: 49.41Kb
Formát:: PDF
Popis:: Posudek vedoucího – Kudělka, Miloš

Zobrazit/otevřít

Název:: BLA0045_FEI_B2647_2612R025_201 ...
Velikost:: 49.61Kb
Formát:: PDF
Popis:: Posudek oponenta – Horák, Zdeněk

Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Vysokoškolské kvalifikační práce Fakulty elektrotechniky a informatiky / Theses and dissertations of Faculty of Electrical Engineering and Computer Science (FEI) [13253]
Kolekce obsahuje vysokoškolské kvalifikační práce Fakulty elektrotechniky a informatiky.

Zobrazit minimální záznam