Multiple time-instances features of degraded speech for single ended quality measurement

dc.contributor.authorDubey, Rajesh Kumar
dc.contributor.authorKumar, Arun
dc.date.accessioned2017-11-30T07:25:14Z
dc.date.available2017-11-30T07:25:14Z
dc.date.issued2017
dc.description.abstractThe use of single time-instance features, where entire speech utterance is used for feature computation, is not accurate and adequate in capturing the time localized information of short-time transient distortions and their distinction from plosive sounds of speech, particularly degraded by impulsive noise. Hence, the importance of estimating features at multiple time-instances is sought. In this, only active speech segments of degraded speech are used for features computation at multiple time-instances on per frame basis. Here, active speech means both voiced and unvoiced frames except silence. The features of different combinations of multiple contiguous active speech segments are computed and called multiple time-instances features. The joint GMM training has been done using these features along with the subjective MOS of the corresponding speech utterance to obtain the parameters of GMM. These parameters of GMM and multiple time-instances features of test speech are used to compute the objective MOS values of different combinations of multiple contiguous active speech segments. The overall objective MOS of the test speech utterance is obtained by assigning equal weight to the objective MOS values of the different combinations of multiple contiguous active speech segments. This algorithm outperforms the Recommendation ITU-T P.563 and recently published algorithms.cs
dc.format.extent862748 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.citationAdvances in electrical and electronic engineering. 2017, vol. 15, no. 3, p. 400-407 : ill.cs
dc.identifier.doi10.15598/aeee.v15i3.2330
dc.identifier.issn1336-1376
dc.identifier.issn1804-3119
dc.identifier.urihttp://hdl.handle.net/10084/122089
dc.languageNeuvedenocs
dc.language.isoencs
dc.publisherVysoká škola báňská - Technická univerzita Ostravacs
dc.relation.ispartofseriesAdvances in electrical and electronic engineeringcs
dc.relation.urihttp://dx.doi.org/10.15598/aeee.v15i3.2330
dc.rights© Vysoká škola báňská - Technická univerzita Ostrava
dc.rights© Vysoká škola báňská - Technická univerzita Ostrava
dc.rightsAttribution 4.0 International*
dc.rights.accessopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectauditory featurecs
dc.subjectdegraded speechcs
dc.subjectspeech qualitycs
dc.titleMultiple time-instances features of degraded speech for single ended quality measurementcs
dc.typearticlecs
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
2330-12521-1-PB.pdf
Size:
842.53 KB
Format:
Adobe Portable Document Format
Description:
publishedVersion

License bundle

Now showing 1 - 1 out of 1 results
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: