Search Results for author: Xishan Zhang

Found 15 papers, 4 papers with code

Pushing the Limits of Machine Design: Automated CPU Design with AI

1 code implementation • 21 Jun 2023 • Shuyao Cheng, Pengwei Jin, Qi Guo, Zidong Du, Rui Zhang, Yunhao Tian, Xing Hu, Yongwei Zhao, Yifan Hao, Xiangtao Guan, Husheng Han, Zhengyue Zhao, Ximing Liu, Ling Li, Xishan Zhang, Yuejie Chu, Weilong Mao, Tianshi Chen, Yunji Chen

By efficiently exploring a search space of unprecedented size 10^{10^{540}}, which is the largest one of all machine-designed objects to our best knowledge, and thus pushing the limits of machine design, our approach generates an industrial-scale RISC-V CPU within only 5 hours.

Paper
Code

Online Prototype Alignment for Few-shot Policy Transfer

1 code implementation • 12 Jun 2023 • Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Yunkai Gao, Kaizhao Yuan, Ruizhi Chen, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen

Domain adaptation in reinforcement learning (RL) mainly deals with the changes of observation when transferring the policy to a new environment.

Domain Adaptation Reinforcement Learning (RL)

Paper
Code

Ultra-low Precision Multiplication-free Training for Deep Neural Networks

no code implementations • 28 Feb 2023 • Chang Liu, Rui Zhang, Xishan Zhang, Yifan Hao, Zidong Du, Xing Hu, Ling Li, Qi Guo

The energy-efficient works try to decrease the precision of multiplication or replace the multiplication with energy-efficient operations such as addition or bitwise shift, to reduce the energy consumption of FP32 multiplications.

Quantization

Paper
Add Code

Causality-driven Hierarchical Structure Discovery for Reinforcement Learning

no code implementations • 13 Oct 2022 • Shaohui Peng, Xing Hu, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen

To address this issue, we propose CDHRL, a causality-driven hierarchical reinforcement learning framework, leveraging a causality-driven discovery instead of a randomness-driven exploration to effectively build high-quality hierarchical structures in complicated environments.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Object-Category Aware Reinforcement Learning

no code implementations • 13 Oct 2022 • Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu, Zidong Du, Xishan Zhang, Qi Guo, Yunji Chen

Object-oriented reinforcement learning (OORL) is a promising way to improve the sample efficiency and generalization ability over standard RL.

Feature Engineering Object +3

Paper
Add Code

Neural Program Synthesis with Query

no code implementations • ICLR 2022 • Di Huang, Rui Zhang, Xing Hu, Xishan Zhang, Pengwei Jin, Nan Li, Zidong Du, Qi Guo, Yunji Chen

In this work, we propose a query-based framework that trains a query neural network to generate informative input-output examples automatically and interactively from a large query space.

Program Synthesis

Paper
Add Code

Distilling Object Detectors with Feature Richness

1 code implementation • NeurIPS 2021 • Zhixing Du, Rui Zhang, Ming Chang, Xishan Zhang, Shaoli Liu, Tianshi Chen, Yunji Chen

Second, these methods imitate some features which are mistakenly regarded as the background by the teacher detector.

Knowledge Distillation Model Compression +1

Paper
Code

Learning Controllable Elements Oriented Representations for Reinforcement Learning

no code implementations • 29 Sep 2021 • Qi Yi, Jiaming Guo, Rui Zhang, Shaohui Peng, Xing Hu, Xishan Zhang, Ke Tang, Zidong Du, Qi Guo, Yunji Chen

Deep Reinforcement Learning (deep RL) has been successfully applied to solve various decision-making problems in recent years.

Decision Making reinforcement-learning +2

Paper
Add Code

Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms

no code implementations • 4 Sep 2021 • Ruizhi Chen, Xiaoyu Wu, Yansong Pan, Kaizhao Yuan, Ling Li, TianYun Ma, JiYuan Liang, Rui Zhang, Kai Wang, Chen Zhang, Shaohui Peng, Xishan Zhang, Zidong Du, Qi Guo, Yunji Chen

In this framework, the environment can be easily configured to realize all kinds of RL tasks in the mainstream research.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment

1 code implementation • 26 Jul 2021 • Jiaming Guo, Rui Zhang, Xishan Zhang, Shaohui Peng, Qi Yi, Zidong Du, Xing Hu, Qi Guo, Yunji Chen

In this paper, we propose to replace the state value function with a novel hindsight value function, which leverages the information from the future to reduce the variance of the gradient estimate for stochastic dynamic environments.

Policy Gradient Methods

Paper
Code

Domain-Specific Suppression for Adaptive Object Detection

no code implementations • CVPR 2021 • Yu Wang, Rui Zhang, Shuo Zhang, Miao Li, Yangyang Xia, Xishan Zhang, Shaoli Liu

The directions of weights, and the gradients, can be divided into domain-specific and domain-invariant parts, and the goal of domain adaptation is to concentrate on the domain-invariant direction while eliminating the disturbance from domain-specific one.

Domain Adaptation Object +2

Paper
Add Code

Fixed-Point Back-Propagation Training

no code implementations • CVPR 2020 • Xishan Zhang, Shaoli Liu, Rui Zhang, Chang Liu, Di Huang, Shiyi Zhou, Jiaming Guo, Qi Guo, Zidong Du, Tian Zhi, Yunji Chen

Recent emerged quantization technique (i. e., using low bit-width fixed-point data instead of high bit-width floating-point data) has been applied to inference of deep neural networks for fast and efficient execution.

Image Classification Machine Translation +4

Paper
Add Code

DWM: A Decomposable Winograd Method for Convolution Acceleration

no code implementations • 3 Feb 2020 • Di Huang, Xishan Zhang, Rui Zhang, Tian Zhi, Deyuan He, Jiaming Guo, Chang Liu, Qi Guo, Zidong Du, Shaoli Liu, Tianshi Chen, Yunji Chen

In this paper, we propose a novel Decomposable Winograd Method (DWM), which breaks through the limitation of original Winograd's minimal filtering algorithm to a wide and general convolutions.

Paper
Add Code

Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers

no code implementations • 1 Nov 2019 • Xishan Zhang, Shaoli Liu, Rui Zhang, Chang Liu, Di Huang, Shiyi Zhou, Jiaming Guo, Yu Kang, Qi Guo, Zidong Du, Yunji Chen

Adaptive Precision Training: Quantify Back Propagation in Neural Networks with Fixed-point Numbers.

Image Classification Machine Translation +2

Paper
Add Code

Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description

no code implementations • CVPR 2017 • Xishan Zhang, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li, Qi Tian

This paper contributes to: 1)The first in-depth study of the weakness inherent in data-driven static fusion methods for video captioning.

Video Captioning Video Description

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.