no code implementations • 6 Dec 2023 • Tuoyuan Cheng, Kan Chen
We consider similarity and optimality measures for value models and employ probability-matching ("blending") and a greedy algorithm ("switching") for policy models.
no code implementations • 15 Sep 2022 • Kan Chen, Tuoyuan Cheng
In this paper, we propose a tail risk measure based on the most probable maximum size of risk events (MPMR) that can occur over a length of time.