dc.contributor.author | Staš, Ján | |
dc.contributor.author | Hládek, Daniel | |
dc.contributor.author | Juhár, Jozef | |
dc.contributor.author | Zlacký, Daniel | |
dc.date.accessioned | 2012-12-06T10:04:56Z | |
dc.date.available | 2012-12-06T10:04:56Z | |
dc.date.issued | 2012 | |
dc.identifier.citation | Advances in electrical and electronic engineering. 2012, vol. 10, no. 4, p. 291-296. | cs |
dc.identifier.issn | 1804-3119 | |
dc.identifier.uri | http://hdl.handle.net/10084/95813 | |
dc.description.abstract | The inflection of the Slovak language causes a large number of unique word forms, which produces not only a large vocabulary, but also a number of out-of-vocabulary words. Morph-based language models solve this problem by decomposition of inflected word forms into small sub-word units and resolve the general problem of sparsity the training data. In this paper, we present several rule-based and data-driven approaches to the automatic segmentation of words into morphs. These data are later used in the modeling of the Slovak language for large vocabulary continuous speech recognition. Preliminary results show a significant decrease in the number of out-of-vocabulary words and reduction of resultant language model perplexity. | cs |
dc.format.extent | 508017 bytes | cs |
dc.format.mimetype | application/pdf | cs |
dc.language.iso | en | cs |
dc.publisher | Vysoká škola báňská - Technická univerzita Ostrava | cs |
dc.relation.ispartofseries | Advances in electrical and electronic engineering | cs |
dc.relation.uri | http://advances.utc.sk/index.php/AEEE/article/view/717/812 | cs |
dc.rights | © Vysoká škola báňská - Technická univerzita Ostrava | |
dc.rights | Creative Commons Attribution 3.0 Unported (CC BY 3.0) | |
dc.subject | Automatic word segmentation | cs |
dc.subject | language modeling | cs |
dc.subject | morphological analysis | cs |
dc.subject | speech recognition | cs |
dc.title | Analysis of morph-based language modeling and speech recognition in Slovak | cs |
dc.type | article | cs |
dc.rights.access | openAccess | |
dc.type.version | publishedVersion | cs |
dc.type.status | Peer-reviewed | cs |