Fuzzy optimality relation for perceptive MDPs—the average case
2007, Fuzzy Sets and Systems
https://doi.org/10.1016/J.FSS.2007.04.006Abstract
This paper is a sequel to Kurano et al [9], [10], in which the fuzzy perceptive models for optimal stopping or discounted Markov decision process is given. We proposed a method of computing the corresponding fuzzy perceptive values. Here, we deal with the average case for Markov decision processes with fuzzy perceptive transition matrices and characterize the optimal average expected reward, called the average perceptive value, by a fuzzy optimality relation. Also, we give a numerical example.
References (18)
- Blackwell,D., Discrete dynamic programming, Ann. Math. Statist., 33, (1962), 719-726.
- Dantzig,G.B., Folkman,J. and Shapiro,N., On the continuity of the minimum set of a continuous function, J. Math. Anal. Appl., 17, (1967), 519-548.
- Derman,C., Finite State Markovian Decision Processes, Academic Press, New York, (1970).
- Dubois,D. and Prade,H., Fuzzy Sets and Systems : Theory and Applications, Academic Press, (1980).
- Howard,R., Dynamic Programming and Markov Process, MIT Press, Cambridge, MA, (1960).
- Kurano,M., Song,J., Hosaka,M. and Huang,Y., Controlled Markov set-chains discounting, J. Appl. Prob., 35, (1998), 293-302.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., Ordering of fuzzy sets - A brief survey and new results, J. Operations Research Society of Japan, 43, (2000), 138-148.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., A fuzzy treatment of uncertain Markov decision process, RIMS Kokyuroku, Kyoto University, 1132, (2000), 221-229.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., A fuzzy stopping problem with the concept of perception, Fuzzy Optimization and Decision Making, 3, (2004), 367-374.
- Kurano,M., Yasuda,M. Nakagami,J. and Yoshida,Y., Fuzzy perceptive values for MDPs with discounting, in: V.Torra, Y,Narukawa and S.Miyamoto eds., MDAI 2005, LNAI 3558, Springer, (2005), 283-293.
- Mine,H. and Osaki,S., Markov Decision Process, Elsevier, Amsterdam, (1970).
- Nummelin,E., General irreducible Markov chains and non-negative operators, Cambridge University Press, (1984).
- Puterman,M.L., Markov Decision Process: Discrete Stochastic Dynamic Programming, John Wiley & Sons, INC, (1994).
- Schweizer,D.T., Perturbation theory and finite Markov chains, J. Applied Probab., 5, (2068), 401-413.
- Solan,E., Continuity of the value of competitive Markov decision processes, J. Theoretical Probability, 16, (2004), 831-845.
- Yoshida,Y. and Kerre,E.E., A fuzzy ordering on multi-dimensional fuzzy sets induced from convex cones, Fuzzy Sets and Systems, 130, (2002), 343-355.
- Zadeh,L.A., Fuzzy sets, Inform. and Control, 8, (1965), 338-353.
- Zadeh,L.A., Toward a perception-based theory of probabilistic reasoning with imprecise probabilities, J. of Statistical Planning and Inference, 105, (2002), 233-264.