Search Results for author: Jiawei Xu

Found 11 papers, 5 papers with code

Thought Graph: Generating Thought Process for Biological Reasoning

no code implementations • 11 Mar 2024 • Chi-Yang Hsu, Kyle Cox, Jiawei Xu, Zhen Tan, Tianhua Zhai, Mengzhou Hu, Dexter Pratt, Tianlong Chen, Ziniu Hu, Ying Ding

We present the Thought Graph as a novel framework to support complex reasoning and use gene set analysis as an example to uncover semantic relationships between biological processes.

Paper
Add Code

CrackNex: a Few-shot Low-light Crack Segmentation Model Based on Retinex Theory for UAV Inspections

1 code implementation • 5 Mar 2024 • Zhen Yao, Jiawei Xu, Shuhang Hou, Mooi Choo Chuah

LCSD consists of 102 well-illuminated crack images and 41 low-light crack images.

Crack Segmentation Segmentation

Paper
Code

HyperAgent: A Simple, Scalable, Efficient and Provable Reinforcement Learning Framework for Complex Environments

1 code implementation • 5 Feb 2024 • Yingru Li, Jiawei Xu, Lei Han, Zhi-Quan Luo

To solve complex tasks under resource constraints, reinforcement learning (RL) agents need to be simple, efficient, and scalable, addressing (1) large state spaces and (2) the continuous accumulation of interaction data.

LEMMA Reinforcement Learning (RL)

Paper
Code

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

2 code implementations • 19 Oct 2023 • Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang

Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment.

Offline RL Q-Learning +2

Paper
Code

LLA-FLOW: A Lightweight Local Aggregation on Cost Volume for Optical Flow Estimation

no code implementations • 17 Apr 2023 • Jiawei Xu, Zongqing Lu, Qingmin Liao

Lack of texture often causes ambiguity in matching, and handling this issue is an important challenge in optical flow estimation.

Optical Flow Estimation

Paper
Add Code

Relative Policy-Transition Optimization for Fast Policy Transfer

no code implementations • 13 Jun 2022 • Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han

Integrating the two algorithms results in the complete Relative Policy-Transition Optimization (RPTO) algorithm, in which the policy interacts with the two environments simultaneously, such that data collections from two environments, policy and transition updates are completed in one closed loop to form a principled learning framework for policy transfer.

Continuous Control LEMMA +1

Paper
Add Code

SeqNet: An Efficient Neural Network for Automatic Malware Detection

no code implementations • 8 May 2022 • Jiawei Xu, Wenxuan Fu, Haoyu Bu, Zhi Wang, Lingyun Ying

We demonstrate the effectiveness of our methods and the low training cost requirement of SeqNet in our experiments.

Efficient Neural Network Feature Engineering +2

Paper
Add Code

Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning

1 code implementation • 29 Mar 2022 • Zhishe Wang, Wenyu Shao, Yanlin Chen, Jiawei Xu, Xiaoqin Zhang

The existing generative adversarial fusion methods generally concatenate source images and extract local features through convolution operation, without considering their global characteristics, which tends to produce an unbalanced result and is biased towards the infrared image or visible image.

Decoder Infrared And Visible Image Fusion

Paper
Code

Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning

no code implementations • 29 Sep 2021 • Shuxing Li, Jiawei Xu, Chun Yuan, Peng Sun, Zhuobin Zheng, Zhengyou Zhang, Lei Han

We provide comprehensive analysis and experiments to elaborate the effect of each component in affecting the agent performance, and demonstrate that the proposed and adopted techniques are important to achieve superior performance in general end-to-end FPS games.

FPS Games General Reinforcement Learning +2

Paper
Add Code

Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization

no code implementations • 29 Sep 2021 • Jiawei Xu, Shuxing Li, Chun Yuan, Zhengyou Zhang, Lei Han

In this paper, inspired by Bootstrapped DQN, we use multiple heads in DDPG and take advantage of the diversity and uncertainty among multiple heads to improve the data efficiency with relabeled goals.

Q-Learning

Paper
Add Code

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

1 code implementation • 25 Nov 2020 • Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang

This poses non-trivial difficulties for researchers or engineers and prevents the application of MARL to a broader range of real-world problems.

Dota 2 Multi-agent Reinforcement Learning +4

131

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.