Cavazos-Cadena, Rolando - In: Mathematical Methods of Operations Research 57 (2003) 2, pp. 263-285
This work concerns discrete-time Markov decision processes with finite state space and bounded costs per stage. The decision maker ranks random costs via the expectation of the utility function associated to a constant risk sensitivity coefficient, and the performance of a control policy is...