Search Results for author: Zhengqi Wu

Found 1 papers, 0 papers with code

Risk-sensitive Markov Decision Process and Learning under General Utility Functions

no code implementations22 Nov 2023 Zhengqi Wu, Renyuan Xu

In this paper, we consider a scenario where the decision-maker seeks to optimize a general utility function of the cumulative reward in the framework of a Markov decision process (MDP).

Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.