no code implementations • 20 Aug 2021 • Luo Ji, Qin Qi, Bingqing Han, Hongxia Yang
In RL-LTV, the critic studies historical trajectories of items and predict the future LTV of fresh item, while the actor suggests a score-based policy which maximizes the future LTV expectation.