Zobrazit minimální záznam

dc.contributor.authorStaš, Ján
dc.contributor.authorHládek, Daniel
dc.contributor.authorJuhár, Jozef
dc.contributor.authorZlacký, Daniel
dc.date.accessioned2012-12-06T10:04:56Z
dc.date.available2012-12-06T10:04:56Z
dc.date.issued2012
dc.identifier.citationAdvances in electrical and electronic engineering. 2012, vol. 10, no. 4, p. 291-296.cs
dc.identifier.issn1804-3119
dc.identifier.urihttp://hdl.handle.net/10084/95813
dc.description.abstractThe inflection of the Slovak language causes a large number of unique word forms, which produces not only a large vocabulary, but also a number of out-of-vocabulary words. Morph-based language models solve this problem by decomposition of inflected word forms into small sub-word units and resolve the general problem of sparsity the training data. In this paper, we present several rule-based and data-driven approaches to the automatic segmentation of words into morphs. These data are later used in the modeling of the Slovak language for large vocabulary continuous speech recognition. Preliminary results show a significant decrease in the number of out-of-vocabulary words and reduction of resultant language model perplexity.cs
dc.format.extent508017 bytescs
dc.format.mimetypeapplication/pdfcs
dc.language.isoencs
dc.publisherVysoká škola báňská - Technická univerzita Ostravacs
dc.relation.ispartofseriesAdvances in electrical and electronic engineeringcs
dc.relation.urihttp://advances.utc.sk/index.php/AEEE/article/view/717/812cs
dc.rights© Vysoká škola báňská - Technická univerzita Ostrava
dc.rightsCreative Commons Attribution 3.0 Unported (CC BY 3.0)
dc.subjectAutomatic word segmentationcs
dc.subjectlanguage modelingcs
dc.subjectmorphological analysiscs
dc.subjectspeech recognitioncs
dc.titleAnalysis of morph-based language modeling and speech recognition in Slovakcs
dc.typearticlecs
dc.rights.accessopenAccess
dc.type.versionpublishedVersioncs
dc.type.statusPeer-reviewedcs


Soubory tohoto záznamu

Tento záznam se objevuje v následujících kolekcích

  • AEEE. 2012, vol. 10 [57]
  • OpenAIRE [2318]
    Kolekce určená pro sklízení infrastrukturou OpenAIRE; obsahuje otevřeně přístupné publikace, případně další publikace, které jsou výsledkem projektů rámcových programů Evropské komise (7. RP, H2020).

Zobrazit minimální záznam