State mixture modelling applied to speech recognition
1999, Pattern Recognition Letters
https://doi.org/10.1016/S0167-8655(99)00113-0Abstract
In state mixture modelling (SMM), the temporal structure of the observation sequences is represented by the state joint probability distribution where mixtures of states are considered. This technique is considered in an iterative scheme via maximum likelihood estimation. A fuzzy estimation approach is also introduced to cooperate with the SMM model. This new approach not only saves calculations from 2x (HMM direct calculation) and x 2 (Forward± backward algorithm) to just only 2NT calculations, but also achieves a better recognition result.
References (18)
- Baum, L.E., 1972. An inequality and associated maximisation technique in statistical estimation for probabilistic functions of a Markov process. Inequalities 3, 1±8.
- Bezdek, J.C., 1987. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York.
- Bezdek, J.C., Pal, S.K., 1992. Fuzzy Models for Pattern Recognition. IEEE Press, New York.
- Duda, R.O., Hart, P.E., 1973. Pattern Classi®cation and Scene Analysis. Wiley, New York.
- Dunn, J., 1974. A fuzzy relative of the ISODATA process and its use in detecting compact well-separated cluster. J. Cybernetics 3, 32±57.
- Gustafson, D.E., Kessel, W., 1979. Fuzzy clustering with a fuzzy covariance matrix. In: Fu, K.S. (Ed.), Proc. IEEE- CDC. IEEE Press, Piscataway, NJ, Vol. 2, pp. 761±766.
- Huang, X.D., Ariki, Y., Jack, M.A., 1990. Hidden Markov Models for Speech Recognition. Edinburgh University Press.
- Juang, B.H., 1985. Maximum likelihood estimation for multi- variate observations of Markov sources. AT&T Technical J. 64, 1235±1239.
- Kulkarni, V.G., 1995. Modeling and Analysis of Stochastic Systems. Chapman & Hall, UK.
- Levinson, S.E., Rabiner, L.R., Sondhi, M.M., 1983. An introduction to the application of the theory of Probabilistic functions of a Markov process to automatic speech recog- nition. The Bell System Technical J. 62 (4), 1035±1074.
- Linde, Y., Buzo, A., Gray, R.M., 1980. An algorithm for vector quantisation. IEEE Trans. Comm. 28, 84±95.
- Rabiner, L.R., 1989. A tutorial on hidden Markov models and selected applications speech recognition. Proc. IEEE 77 (2), 257±286.
- Rabiner, L.R., Juang, B.H., 1986. An introduction to hidden Markov models. IEEE Acoust. Speech Signal Process. Soc. Mag. 3 (1), 4±16.
- Rabiner, L.R., Juang, B.H., 1993. Fundamentals of Speech Recognition. Prentice-Hall PTR, Englewood Clis, NJ.
- Reynolds, D.A., 1992. A Gaussian mixture modeling approach to text-independent speaker identi®cation. Ph.D. thesis, Georgia Institute of Technology, USA.
- Tran, D., Le, T.V., Wagner, M., 1998. Fuzzy Gaussian mixture models for speaker recognition. In: Proceedings of the International Conference on Spoken Language Processing (ICSLP98). Sydney, Australia, Vol. 3, pp. 759±762.
- Upper, D.R., 1997. Theory and algorithms for hidden Markov models and generalised hidden Markov models. Ph.D. thesis in Mathematics, University of California at Berkeley.
- Wagner, M., 1996. Combined speech-recognition/speaker- veri®cation system with modest training requirements. In: Proceedings of the 6th Australian International Conference on Speech Science and Technology. Adelaide, Australia, pp. 139±143.