no code implementations • 11 Mar 2024 • Chi-Yang Hsu, Kyle Cox, Jiawei Xu, Zhen Tan, Tianhua Zhai, Mengzhou Hu, Dexter Pratt, Tianlong Chen, Ziniu Hu, Ying Ding
We present the Thought Graph as a novel framework to support complex reasoning and use gene set analysis as an example to uncover semantic relationships between biological processes.
1 code implementation • 5 Mar 2024 • Zhen Yao, Jiawei Xu, Shuhang Hou, Mooi Choo Chuah
LCSD consists of 102 well-illuminated crack images and 41 low-light crack images.
1 code implementation • 5 Feb 2024 • Yingru Li, Jiawei Xu, Lei Han, Zhi-Quan Luo
To solve complex tasks under resource constraints, reinforcement learning (RL) agents need to be simple, efficient, and scalable, addressing (1) large state spaces and (2) the continuous accumulation of interaction data.
2 code implementations • 19 Oct 2023 • Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang
Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment.
no code implementations • 17 Apr 2023 • Jiawei Xu, Zongqing Lu, Qingmin Liao
Lack of texture often causes ambiguity in matching, and handling this issue is an important challenge in optical flow estimation.
no code implementations • 13 Jun 2022 • Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han
Integrating the two algorithms results in the complete Relative Policy-Transition Optimization (RPTO) algorithm, in which the policy interacts with the two environments simultaneously, such that data collections from two environments, policy and transition updates are completed in one closed loop to form a principled learning framework for policy transfer.
no code implementations • 8 May 2022 • Jiawei Xu, Wenxuan Fu, Haoyu Bu, Zhi Wang, Lingyun Ying
We demonstrate the effectiveness of our methods and the low training cost requirement of SeqNet in our experiments.
1 code implementation • 29 Mar 2022 • Zhishe Wang, Wenyu Shao, Yanlin Chen, Jiawei Xu, Xiaoqin Zhang
The existing generative adversarial fusion methods generally concatenate source images and extract local features through convolution operation, without considering their global characteristics, which tends to produce an unbalanced result and is biased towards the infrared image or visible image.
no code implementations • 29 Sep 2021 • Shuxing Li, Jiawei Xu, Chun Yuan, Peng Sun, Zhuobin Zheng, Zhengyou Zhang, Lei Han
We provide comprehensive analysis and experiments to elaborate the effect of each component in affecting the agent performance, and demonstrate that the proposed and adopted techniques are important to achieve superior performance in general end-to-end FPS games.
no code implementations • 29 Sep 2021 • Jiawei Xu, Shuxing Li, Chun Yuan, Zhengyou Zhang, Lei Han
In this paper, inspired by Bootstrapped DQN, we use multiple heads in DDPG and take advantage of the diversity and uncertainty among multiple heads to improve the data efficiency with relabeled goals.
1 code implementation • 25 Nov 2020 • Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang
This poses non-trivial difficulties for researchers or engineers and prevents the application of MARL to a broader range of real-world problems.