no code implementations • 4 Oct 2023 • Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang
Despite policy-based RL methods dominating the literature on RL for program synthesis, the nature of program synthesis tasks hints at a natural alignment with value-based methods.
no code implementations • 18 May 2022 • Ian A. Kash, Lev Reyzin, Zishun Yu
Reinforcement learning generalizes multi-armed bandit problems with additional difficulties of a longer planning horizon and unknown transition kernel.
1 code implementation • 12 May 2022 • Hongwei Jin, Zishun Yu, Xinhua Zhang
Comparing structured data from possibly different metric-measure spaces is a fundamental task in machine learning, with applications in, e. g., graph classification.