Search Results for author: Liqun Yang

Found 9 papers, 3 papers with code

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

1 code implementation20 Dec 2022 Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li

Inspired by the idea of Generative Adversarial Networks (GANs), we propose a GAN-style model for encoder-decoder pre-training by introducing an auxiliary discriminator, unifying the ability of language understanding and generation in a single model.

Denoising Sentence +1

Knowledge Distillation based Contextual Relevance Matching for E-commerce Product Search

no code implementations4 Oct 2022 Ziyang Liu, Chaokun Wang, Hao Feng, Lingfei Wu, Liqun Yang

In this paper, we design an efficient knowledge distillation framework for e-commerce relevance matching to integrate the respective advantages of Transformer-style models and classical relevance matching models.

Knowledge Distillation

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation

1 code implementation29 Jul 2022 Jian Yang, Yuwei Yin, Liqun Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Furu Wei, Zhoujun Li

Transformer structure, stacked by a sequence of encoder and decoder network layers, achieves significant development in neural machine translation.

Machine Translation Translation

The distance between the weights of the neural network is meaningful

no code implementations31 Jan 2021 Liqun Yang, Yijun Yang, Yao Wang, Zhenyu Yang, Wei Zeng

In the application of neural networks, we need to select a suitable model based on the problem complexity and the dataset scale.

Cannot find the paper you are looking for? You can Submit a new open access paper.