Fuzzy Optimality Equations for Perceptive MDPs
Abstract
This paper is a sequel to Kurano et al [9], [10], in which the fuzzy perceptive models for optimal stopping or discounted Markov decision process are proposed and the methods of computing the corresponding fuzzy perceptive values are given. Here, we deal with the average case for Markov decisin processes with fuzzy perceptive transition matrices and characterize the optimal average expected reward, called the average perceptive value, by a fuzzy optimality equation. Also, we give a numerical example.
References (18)
- Blackwell,D., Discrete dynamic programming, Ann. Math. Statist., 33, (1962), 719-726.
- Dantzig,G.B., Folkman,J. and Shapiro,N., On the continuity of the minimum set of a continuous func- tion, J. Math. Anal. Appl., 17, (1967), 519-548.
- Derman,C., Finite State Markovian Decision Pro- cesses, Academic Press, New York, (1970).
- Dubois,D. and Prade,H., Fuzzy Sets and Systems : Theory and Applications, Academic Press, (1980).
- Howard,R., Dynamic Programming and Markov Process, MIT Press, Cambridge, MA, (1960).
- Kurano,M., Song,J., Hosaka,M. and Huang,Y., Con- trolled Markov set-chains with discounting, J. Appl. Prob., 35, (1998), 293-302.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., Ordering of fuzzy sets -A brief survey and new results, J. Operations Research Society of Japan, 43, (2000), 138-148.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., A fuzzy treatment of uncertain Markov decision process, RIMS Kokyuroku, Kyoto University, 1132, (2000), 221-229.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., A fuzzy stopping problem with the concept of perception, Fuzzy Optimization and Decision Making, 3, (2004), 367-374.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., Fuzzy perceptive values for MDPs with discounting, (Preprint).
- Mine,H. and Osaki,S., Markov Decision Process, El- sevier, Amsterdam, (1970).
- Nummelin,E., General irreducible Markov chains and non-negative operators, Cambridge University Press, (1984).
- Puterman,M.L., Markov Decision Process: Discrete Stochastic Dynamic Programming, John Wiley & Sons, INC, (1994).
- Schweizer,D.T., Perturbation theory and finite Markov chains, J. Applied Probab., 5, (2068), 401- 413.
- Solan,E., Continuity of the value of competitive Markov decision processes, J. Theoretical Probabil- ity, 16, (2004), 831-845.
- Yoshida,Y. and Kerre,E.E., A fuzzy ordering on multi-dimensional fuzzy sets induced from convex cones, Fuzzy Sets and Systems, 130, (2002), 343- 355.
- Zadeh,L.A., Fuzzy sets, Inform. and Control, 8, (1965), 338-353.
- Zadeh,L.A., Toward a perception-based theory of probabilistic reasoning with imprecise probabilities, J. of Statistical Planning and Inference, 105, (2002), 233-264.