no code implementations • 27 Feb 2024 • Zijian Guo, Weichao Zhou, Wenchao Li
Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset.
1 code implementation • 12 Oct 2023 • Qingliang Fan, Zijian Guo, Ziwei Mei, Cun-Hui Zhang
In this paper, we propose new estimation and inference procedures for nonparametric treatment effect functions with endogeneity and potentially high-dimensional covariates.
no code implementations • 12 Sep 2023 • Xin Xiong, Zijian Guo, Tianxi Cai
Many existing transfer learning methods rely on leveraging information from source data that closely resembles the target data.
no code implementations • 5 Sep 2023 • Zhenyu Wang, Peter Bühlmann, Zijian Guo
Classical machine learning methods may lead to poor prediction performance when the target distribution differs from the source populations.
3 code implementations • 15 Jun 2023 • Zuxin Liu, Zijian Guo, Haohong Lin, Yihang Yao, Jiacheng Zhu, Zhepeng Cen, Hanjiang Hu, Wenhao Yu, Tingnan Zhang, Jie Tan, Ding Zhao
This paper presents a comprehensive benchmarking suite tailored to offline safe reinforcement learning (RL) challenges, aiming to foster progress in the development and evaluation of safe learning algorithms in both the training and deployment phases.
no code implementations • 2 Apr 2023 • David Carl, Corinne Emmenegger, Peter Bühlmann, Zijian Guo
TSCI implements a two-stage algorithm.
no code implementations • 9 Mar 2023 • Yucheng Xu, Li Nanbo, Arushi Goel, Zijian Guo, Zonghai Yao, Hamidreza Kasaei, Mohammadreze Kasaei, Zhibin Li
Videos depict the change of complex dynamical systems over time in the form of discrete image sequences.
1 code implementation • 14 Feb 2023 • Zuxin Liu, Zijian Guo, Yihang Yao, Zhepeng Cen, Wenhao Yu, Tingnan Zhang, Ding Zhao
Safe reinforcement learning (RL) trains a constraint satisfaction policy by interacting with the environment.
1 code implementation • 29 May 2022 • Zuxin Liu, Zijian Guo, Zhepeng Cen, huan zhang, Jie Tan, Bo Li, Ding Zhao
One interesting and counter-intuitive finding is that the maximum reward attack is strong, as it can both induce unsafe behaviors and make the attack stealthy by maintaining the reward.
1 code implementation • 30 Apr 2022 • Qingliang Fan, Zijian Guo, Ziwei Mei
The theoretical power based on the maximum norm is shown to be higher than that in the modified Cragg-Donald test (Koles\'{a}r, 2018), which is the only existing test allowing for large-dimensional covariates.
1 code implementation • 24 Mar 2022 • Zijian Guo, Mengchu Zheng, Peter Bühlmann
The success of TSCI requires the instrumental variable's effect on treatment to differ from its violation form.
no code implementations • 4 May 2021 • Jue Hou, Zijian Guo, Tianxi Cai
Risk modeling with EHR data is challenging due to a lack of direct observations on the disease outcome, and the high dimensionality of the candidate predictors.
no code implementations • 15 Nov 2020 • Zijian Guo
Integrative analysis of data from multiple sources is critical to making generalizable discoveries.
1 code implementation • 8 Apr 2020 • Zijian Guo, Domagoj Ćevid, Peter Bühlmann
Inferring causal relationships or related associations from observational data can be invalidated by the existence of hidden confounding.
Methodology Statistics Theory Statistics Theory