Search Results for author: Zishun Yu

Found 3 papers, 1 papers with code

$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis

no code implementations • 4 Oct 2023 • Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang

Despite policy-based RL methods dominating the literature on RL for program synthesis, the nature of program synthesis tasks hints at a natural alignment with value-based methods.

Code Generation Program Synthesis +2

Paper
Add Code

Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs

no code implementations • 18 May 2022 • Ian A. Kash, Lev Reyzin, Zishun Yu

Reinforcement learning generalizes multi-armed bandit problems with additional difficulties of a longer planning horizon and unknown transition kernel.

Multi-Armed Bandits reinforcement-learning +1

Paper
Add Code

Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

1 code implementation • 12 May 2022 • Hongwei Jin, Zishun Yu, Xinhua Zhang

Comparing structured data from possibly different metric-measure spaces is a fundamental task in machine learning, with applications in, e. g., graph classification.

Graph Classification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.