Zobrazit minimální záznam

dc.contributor.authorDubey, Rajesh Kumar
dc.contributor.authorKumar, Arun
dc.date.accessioned2017-11-30T07:25:14Z
dc.date.available2017-11-30T07:25:14Z
dc.date.issued2017
dc.identifier.citationAdvances in electrical and electronic engineering. 2017, vol. 15, no. 3, p. 400-407 : ill.cs
dc.identifier.issn1336-1376
dc.identifier.issn1804-3119
dc.identifier.urihttp://hdl.handle.net/10084/122089
dc.description.abstractThe use of single time-instance features, where entire speech utterance is used for feature computation, is not accurate and adequate in capturing the time localized information of short-time transient distortions and their distinction from plosive sounds of speech, particularly degraded by impulsive noise. Hence, the importance of estimating features at multiple time-instances is sought. In this, only active speech segments of degraded speech are used for features computation at multiple time-instances on per frame basis. Here, active speech means both voiced and unvoiced frames except silence. The features of different combinations of multiple contiguous active speech segments are computed and called multiple time-instances features. The joint GMM training has been done using these features along with the subjective MOS of the corresponding speech utterance to obtain the parameters of GMM. These parameters of GMM and multiple time-instances features of test speech are used to compute the objective MOS values of different combinations of multiple contiguous active speech segments. The overall objective MOS of the test speech utterance is obtained by assigning equal weight to the objective MOS values of the different combinations of multiple contiguous active speech segments. This algorithm outperforms the Recommendation ITU-T P.563 and recently published algorithms.cs
dc.format.extent862748 bytes
dc.format.mimetypeapplication/pdf
dc.languageNeuvedenocs
dc.language.isoencs
dc.publisherVysoká škola báňská - Technická univerzita Ostravacs
dc.relation.ispartofseriesAdvances in electrical and electronic engineeringcs
dc.relation.urihttp://dx.doi.org/10.15598/aeee.v15i3.2330
dc.rights© Vysoká škola báňská - Technická univerzita Ostrava
dc.rights© Vysoká škola báňská - Technická univerzita Ostrava
dc.rightsAttribution 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectauditory featurecs
dc.subjectdegraded speechcs
dc.subjectspeech qualitycs
dc.titleMultiple time-instances features of degraded speech for single ended quality measurementcs
dc.typearticlecs
dc.identifier.doi10.15598/aeee.v15i3.2330
dc.rights.accessopenAccess
dc.type.versionpublishedVersion
dc.type.statusPeer-reviewed


Soubory tohoto záznamu

Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam

© Vysoká škola báňská - Technická univerzita Ostrava
Kromě případů, kde je uvedeno jinak, licence tohoto záznamu je © Vysoká škola báňská - Technická univerzita Ostrava