Search Results for author: Yongchang Hao

Found 6 papers, 4 papers with code

Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks

no code implementations5 Feb 2024 Yongchang Hao, Yanshuai Cao, Lili Mou

The major reason is due to the quadratic memory and cubic time complexity to compute the inverse of the matrix.

Flora: Low-Rank Adapters Are Secretly Gradient Compressors

1 code implementation5 Feb 2024 Yongchang Hao, Yanshuai Cao, Lili Mou

Despite large neural networks demonstrating remarkable abilities to complete different tasks, they require excessive memory usage to store the optimization states for training.

Teacher Forcing Recovers Reward Functions for Text Generation

1 code implementation17 Oct 2022 Yongchang Hao, Yuxin Liu, Lili Mou

We additionally propose a simple modification to stabilize the RL training on non-parallel datasets with our induced reward function.

regression reinforcement-learning +2

An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

2 code implementations29 Sep 2022 Yuqiao Wen, Yongchang Hao, Yanshuai Cao, Lili Mou

Open-domain dialogue systems aim to interact with humans through natural language texts in an open-ended fashion.

Dialogue Generation

Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation

no code implementations ACL 2022 Wenxuan Wang, Wenxiang Jiao, Yongchang Hao, Xing Wang, Shuming Shi, Zhaopeng Tu, Michael Lyu

In this paper, we present a substantial step in better understanding the SOTA sequence-to-sequence (Seq2Seq) pretraining for neural machine translation~(NMT).

Machine Translation NMT +1

Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation

1 code implementation NAACL 2021 Yongchang Hao, Shilin He, Wenxiang Jiao, Zhaopeng Tu, Michael Lyu, Xing Wang

In addition, experimental results demonstrate that our Multi-Task NAT is complementary to knowledge distillation, the standard knowledge transfer method for NAT.

Knowledge Distillation Machine Translation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.