1 code implementation • 19 Apr 2024 • Xiao-Yin Liu, Guotao Li, Xiao-Hu Zhou, Xu Liang, Zeng-Guang Hou
The developed multi-source UDA theory is theoretical and the generalization error on target subject is guaranteed.
Intent Detection Multi-Source Unsupervised Domain Adaptation +1
1 code implementation • 7 Dec 2023 • Xiao-Yin Liu, Xiao-Hu Zhou, Guotao Li, Hao Li, Mei-Jiang Gui, Tian-Yu Xiang, De-Xing Huang, Zeng-Guang Hou
This method trades off performance and robustness via introducing the robust Bellman operator into the algorithm.
1 code implementation • 26 Oct 2023 • Hao Li, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Zhen-Qiu Feng, Xiao-Yin Liu, Mei-Jiang Gui, Tian-Yu Xiang, De-Xing Huang, Bo-Xian Yao, Zeng-Guang Hou
Offline reinforcement learning (RL) aims to optimize policy using collected data without online interactions.
no code implementations • 16 Sep 2023 • Xiao-Yin Liu, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Zhen-Qiu Feng, Hao Li, Mei-Jiang Gui, Tian-Yu Xiang, De-Xing Huang, Zeng-Guang Hou
However, uncertainty estimation is unreliable and leads to poor performance in certain scenarios, and the previous methods ignore differences between the model data, which brings great conservatism.