Extreme Risk Averse Policy for Goal-Directed Risk-Sensitive Markov Decision Process

Main Article Content

Valdinei Freire Karina Valdivia Delgado

Abstract

The Goal-Directed Risk-Sensitive Markov Decision Process allows arbitrary risk attitudes for the probabilistic planning problem to reach a goal state. In this problem, the risk attitude is modeled by an expected exponential utility and a risk factor λ. However, the problem is not well defined for every λ, posing the problem of defining the maximum (extreme) value for this factor. In this paper, we propose an algorithm to find this -extreme risk factor and the corresponding optimal policy.

Article Details

How to Cite
FREIRE, Valdinei; VALDIVIA DELGADO, Karina. Extreme Risk Averse Policy for Goal-Directed Risk-Sensitive Markov Decision Process. BRACIS, [S.l.], july 2017. Available at: <http://143.54.25.88/index.php/bracis/article/view/86>. Date accessed: 19 sep. 2024. doi: https://doi.org/10.1235/bracis.vi.86.
Section
Artigos