no code implementations • 25 May 2023 • Ashim Gupta, Carter Wood Blum, Temma Choji, Yingjie Fei, Shalin Shah, Alakananda Vempala, Vivek Srikumar
For example, on sentiment classification using the SST-2 dataset, our method improves the adversarial accuracy over the best existing defense approach by more than 4% with a smaller decrease in task accuracy (0. 5% vs 2. 5%).
no code implementations • 7 Mar 2022 • Yingjie Fei, Ruitu Xu
In this paper, we study gap-dependent regret guarantees for risk-sensitive reinforcement learning based on the entropic risk measure.
no code implementations • NeurIPS 2021 • Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang
The exponential Bellman equation inspires us to develop a novel analysis of Bellman backup procedures in risk-sensitive RL algorithms, and further motivates the design of a novel exploration mechanism.
no code implementations • 1 Jan 2021 • Yingjie Fei, Zhuoran Yang, Zhaoran Wang
We study risk-sensitive reinforcement learning with the entropic risk measure and function approximation.
no code implementations • NeurIPS 2020 • Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie
We consider reinforcement learning (RL) in episodic MDPs with adversarial full-information reward feedback and unknown fixed transition kernels.
no code implementations • NeurIPS 2020 • Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie
We study risk-sensitive reinforcement learning in episodic Markov decision processes with unknown transition kernels, where the goal is to optimize the total reward under the risk measure of exponential utility.
no code implementations • 21 Apr 2019 • Yingjie Fei, Yudong Chen
We study the statistical performance of semidefinite programming (SDP) relaxations for clustering under random graph models.
no code implementations • 17 Mar 2018 • Yingjie Fei, Yudong Chen
The error of the integer program, and hence that of the SDP, are further shown to decay exponentially in the signal-to-noise ratio.
no code implementations • 23 May 2017 • Yingjie Fei, Yudong Chen
In this paper we consider the cluster estimation problem under the Stochastic Block Model.