Emotion Recognition from Hindi Speech using MFCC and Sparse DTW
2015, International Journal of Engineering Research and
https://doi.org/10.17577/IJERTV4IS060003Abstract
Recently increasing attention has been directed to the study of emotional content of speech signals, and hence, many systems have been proposed to identify the emotional content of a spoken utterance. The project of Emotion Recognition from Hindi Speech address to three main aspects of speech recognition system. The first one is the choice of suitable features for speech representation. Using Sparse DTW for feature recognition has improved space efficiency and time complexity. Implementation of automatic emotion recognition system (using MATLAB) provides an accuracy of over 75% for 5 emotions namely: happy, sad, surprise, anger and neutral over a database containing large variety of speakers.
References (8)
- REFERENCES
- L. R. Rabiner, M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances", Bell System Technical Journal, 54, p. 297-315, Feb. 1975.
- T. B. Amin, I. Mahmood, "Speech Recognition Using Dynamic Time Warping", 2nd International Conference on Advances in Space Technologies, Proceedings of ICAST, vol. 2, pp. 74-79, November, 2008.
- K. Yamamoto, F. Jabloun, K. Reinhard and A. Kawamura, "Robust method for end point detection using discriminative feature extraction", IEEE Proceedings, Europe, 2006.
- K. R. Aida-Zade, C. Ardil and S.S. Rustamov , " Investigation of Combined use of MFCC and LPC Features in Speech Recognition Systems ", World Academy of Science, Engineering and Technology, 2006.
- Digital Signal Processing Mini-Project, "An Automatic Speaker Recognition System", Minh N. Do, Audio Visual Communications Laboratory, Swiss Federal Institute of Technology, Lausanne, Switzerland.
- L. Rabiner, and B. Juan, Fundamentals of speech recognition, Prentice Hall PTR, New Yersey, ISBN 0-13- 015157-2.
- F. Dellaert, T. Polzin and A. Waibel, "Recognizing emotion in speech", IEEE International Conference on Emotion and Signal Processing, pp. 1970-1973, 2004.