Observation uncertainty measures for sparse imputation
2010, Interspeech 2010
https://doi.org/10.21437/INTERSPEECH.2010-621Abstract
Missing data imputation estimates the clean speech features for automatic speech recognition in noisy environments. The estimates are usually considered equally reliable while in reality, the estimation accuracy varies from feature to feature. In this work, we propose uncertainty measures to characterise the expected accuracy of a sparse imputation (SI) based missing data method. In experiments on noisy large vocabulary speech data, using observation uncertainties derived from the proposed measures improved the speech recognition performance on features estimated with SI. Relative error reductions up to 15 % compared to the baseline system using SI without uncertainties were achieved with the best measures.
References (11)
- References
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho," Robust au- tomatic speech recognition with missing and unreliable acoustic data", Speech Communication 34 (3): 267-285, 2001.
- J. Gemmeke and B. Cranen, "Missing data imputation using com- pressive sensing techniques for connected digit recognition", in Proc. DSP, Santorini, Greece, pp. 1-8, 2009.
- J. F. Gemmeke, B. Cranen, and U. Remes, "Sparse imputa- tion for large vocabulary noise robust ASR", accepted for pub- lication in Computer Speech and Language, preprint available: http://www.amadana.nl/publications.html
- L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty com- puted from a parametric model of speech distortion", IEEE Trans. SAP 13 (3): 412-421, 2005.
- J. A. Arrowood and M. A. Clements, "Using observation uncer- tainty in HMM decoding", in Proc. ICSLP, Denver, Colorado, USA, pp. 1561-1564, 2002.
- S. Srinivasan and D. L. Wang, "A supervised learning approach to uncertainty decoding for robust speech recognition", in Proc. ICASSP, Toulouse, France, pp. 297-300, 2006.
- E. J. Candés and M. B. Wakin, "An introduction to compres- sive sampling", IEEE Signal Processing Magazine, 25 (2): 21-30, 2008.
- H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition", Speech Communica- tion 50 (4): 265-277, 2008.
- T. Hirsimäki, M. Creutz, V. Siivola, M. Kurimo, S. Virpioja, and J. Pylkkönen, "Unlimited vocabulary speech recognition with morph language models applied to Finnish", Computer Speech and Language 20 (4): 515-541, 2006.
- J. Wright, A. Y. Yang, A. Ganesh, S. Shankar Sastry and Y. Ma, "Robust Face Recognition via Sparse Representation", IEEE Transactions on Pattern Analysis and Machine Intelligence,31 (2): 210-227, 2009.