On the Inference of Stochastic Regular Grammars
1978, Information and Control
https://doi.org/10.1016/S0019-9958(78)90106-7Abstract
The relevance of grammatical inference techniques to the semiautomatic construction from empirical data, of a model of human decision making, is outlined. A grammatical inference problem is presented in which the least complex stochastic regular grammar is sought which describes a given set of strings. An upper bound on the complexity of the best grammar for a given data set is found, and some properties of the grammars which are less complex than the bound are proved. The technique of splitting grammars is used to organize a search of these grammars. An initial grammar is defined, and it is established that any best grammar is obtainable by repeated splitting of the initial grammar. The performance of a program based on these results is" described.
References (19)
- AMAREL, S. (1971), Representations and modelling in problems of program formation, in "Machine Intelligence 6" (Michie and Meltzer, Eds.), pp. 411-466, Univ. of Edinburgh Press.
- BIERMANN, A. W., AND FELDMAN, J. A. (1973), A survey of results in grammatical inference, in "Frontiers of Pattern Recognition" (M. S. Watanabe, Ed.), pp. 31-54, Academic Press, New York.
- BUCHANAN, B. G., FEIGENBAUM, E. A., AND SRIDHARAN, N. S. (1972), Heuristic theory formation: data interpretation and rule formation, in "Machine Intelligence 7" (Meltzer and Michie, Eds.), pp. 267-292, Univ. of Edinburgh Press.
- DAVIS, R., BUCHANAN, B., AND SHORTLIFFE, E. (1977), Production rules as a representation for a knowledge-based consultation program, Artificial Intelligence 8, 15.
- DAVIS, R., AND KING, J. (1977), An overview of production systems, in "Machine Intelligence 8: Machine Representations of Knowledge" (Elcock and Michie, Eds.), pp. 300-331, Wiley, New York.
- ELLIS, C. (1969), "Probabilistic Languages and Automata," Report No. 355, Dept. of Computer Sci., Univ. Illinois, Urbana.
- FELDMAN, J. A., AND SHIELDS, P. C. (1977), Total complexity and the inference of best programs, Math. Syst. Theor., 10, 181.
- FU, K. S., AND BOOTH, W. L. (1975), Grammatical inference: introduction and survey, parts I & II, IEEE Trans. Systems, Man, and Cybernetics SMC-5, 95.
- GAINES, B. R. (1977), System identification, approximation, and complexity, Internat. J. General Systems 3, 145.
- GIMPEL, J. F. (1972), "SITBOL Version 1.0," Report $4D30, Bell Telephone Lab., Holmdel, N. J.
- GOLD, M. (1967), Language identification in the limit, Inform. Contr. 10, 447.
- HOPCROFT, J. E., AND ULLMAN, J. D. (1969), "Formal Languages and Their Relation to Automata," Addison-Wesley, Reading, Mass.
- HORNING, J. (1969), "A Study of Grammatical Inference," Report CS 139, Dept. Computer Sci., Stanford Univ.
- SHORTL~ErE, E. (1976), "Computer Based Medical Consultations: MYCIN," American Elsevier, New York.
- WALKER, A. (1977), "On the Induction of a Decision-Making System from a Data Base," Technical Report 80, Dept. Computer Sci., Rutgers University.
- WATANABE, S. (1969), "Knowing and Guessing," Wiley, New York.
- WATERMAN, D. A. (1977), Exemplary programming in RtTA, h* "Pattern-Directed Inference Systems" (Waterman and I~ayes-Roth, Eds.), Academic Press, New York.
- WEISS, S., KULIKOWSKI, C. A., AND SAFIR, A. (1978), Glaucoma consultation by computer, Comput. Biol. Med. 8, 25.
- WHARTON, R. M. (1977), Grammar enumeration and inference, Inform. Contr. 33, 253.