Search Results for author: Fei Yuan

Found 17 papers, 4 papers with code

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

1 code implementation • 21 Mar 2024 • Qiushi Sun, Zhirui Chen, Fangzhi Xu, Kanzhi Cheng, Chang Ma, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Qipeng Guo, Xipeng Qiu, Pengcheng Yin, XiaoLi Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

Building on our examination of the developmental trajectories, we further investigate the emerging synergies between code intelligence and broader machine intelligence, uncovering new cross-domain opportunities and illustrating the substantial influence of code intelligence across various domains.

160

Paper
Code

Continuous Sign Language Recognition Based on Motor attention mechanism and frame-level Self-distillation

no code implementations • 29 Feb 2024 • Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

Changes in facial expression, head movement, body movement and gesture movement are remarkable cues in sign language recognition, and most of the current continuous sign language recognition(CSLR) research methods mainly focus on static images in video sequences at the frame-level feature extraction stage, while ignoring the dynamic changes in the images.

Sign Language Recognition

Paper
Add Code

KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models

no code implementations • 5 Feb 2024 • Fei Yuan, Chang Ma, Shuai Yuan, Qiushi Sun, Lei LI

We further theoretically prove that KS-Lottery can find the certified winning tickets in the embedding layer, fine-tuning on the found parameters is guaranteed to perform as well as full fine-tuning.

Translation

Paper
Add Code

Question Translation Training for Better Multilingual Reasoning

1 code implementation • 15 Jan 2024 • Wenhao Zhu, ShuJian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

A typical solution is to translate instruction data into all languages of interest, and then train on the resulting multilingual data, which is called translate-training.

Mathematical Reasoning Translation

Paper
Code

How Multilingual is Multilingual LLM?

no code implementations • 15 Nov 2023 • Fei Yuan, Shuai Yuan, Zhiyong Wu, Lei LI

Large Language Models (LLMs), trained predominantly on extensive English data, often exhibit limitations when applied to other languages.

Paper
Add Code

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

no code implementations • 15 Nov 2023 • Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu

Although Large Language Models (LLMs) demonstrate remarkable ability in processing and generating human-like text, they do have limitations when it comes to comprehending and expressing world knowledge that extends beyond the boundaries of natural language(e. g., chemical molecular formula).

World Knowledge

Paper
Add Code

Extrapolating Large Language Models to Non-English by Aligning Languages

2 code implementations • 9 Aug 2023 • Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, ShuJian Huang, Lingpeng Kong, Jiajun Chen, Lei LI

We start from targeting individual languages by performing cross-lingual instruction-tuning (CoIT) on LLaMA, i. e. tuning it with translation task data and cross-lingual general task data to obtain cross-lingual models (x-LLaMAs), and formulate underlying scaling laws to investigate the advantages of using scalable translation data.

Translation

Paper
Code

Utility-Probability Duality of Neural Networks

no code implementations • 24 May 2023 • Huang Bojun, Fei Yuan

In this perspective, training of the neural network corresponds to a utility learning process.

Text Generation

Paper
Add Code

Extrapolating Multilingual Understanding Models as Multilingual Generators

no code implementations • 22 May 2023 • Bohong Wu, Fei Yuan, Hai Zhao, Lei LI, Jingjing Xu

Considering that encoder-based models have the advantage of efficient generation and self-correction abilities, this paper explores methods to empower multilingual understanding models the generation abilities to get a unified model.

Denoising Machine Translation +5

Paper
Add Code

Continuous sign language recognition based on cross-resolution knowledge distillation

no code implementations • 13 Mar 2023 • Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

It is then used to combine cross-resolution knowledge distillation and traditional knowledge distillation methods to form a CSLR model based on cross-resolution knowledge distillation (CRKD).

Knowledge Distillation Sign Language Recognition

Paper
Add Code

Lego-MT: Learning Detachable Models for Massively Multilingual Machine Translation

1 code implementation • 20 Dec 2022 • Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei LI, Yu Qiao, Jingjing Xu

To address the needs of learning representations for all languages in a unified space, we propose a novel efficient training recipe, upon which we build an effective detachable model, Lego-MT.

Machine Translation Translation

Paper
Code

Temporal superimposed crossover module for effective continuous sign language

no code implementations • 7 Nov 2022 • Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

The ultimate goal of continuous sign language recognition(CSLR) is to facilitate the communication between special people and normal people, which requires a certain degree of real-time and deploy-ability of the model.

Image Classification Sign Language Recognition +1

Paper
Add Code

Continuous Sign Language Recognition via Temporal Super-Resolution Network

no code implementations • 3 Jul 2022 • Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

The sparse frame-level features are fused through the features obtained by the two designed branches as the reconstructed dense frame-level feature sequence, and the connectionist temporal classification(CTC) loss is used for training and optimization after the time-series feature extraction part.

Sign Language Recognition Super-Resolution +2

Paper
Add Code

Multi-scale temporal network for continuous sign language recognition

no code implementations • 8 Apr 2022 • Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

The time-wise feature extraction part performs temporal feature learning by first extracting temporal receptive field features of different scales using the proposed multi-scale temporal block (MST-block) to improve the temporal modeling capability, and then further encoding the temporal features of different scales by the transformers module to obtain more accurate temporal features.

Sign Language Recognition

Paper
Add Code

Simpson's Bias in NLP Training

no code implementations • 13 Mar 2021 • Fei Yuan, Longtu Zhang, Huang Bojun, Yaobo Liang

In most machine learning tasks, we evaluate a model $M$ on a given data population $S$ by measuring a population-level metric $F(S;M)$.

Multi-class Classification Sentence +1

Paper
Add Code

Reinforced Multi-Teacher Selection for Knowledge Distillation

no code implementations • 11 Dec 2020 • Fei Yuan, Linjun Shou, Jian Pei, Wutao Lin, Ming Gong, Yan Fu, Daxin Jiang

When multiple teacher models are available in distillation, the state-of-the-art methods assign a fixed weight to a teacher model in the whole distillation.

Knowledge Distillation Model Compression

Paper
Add Code

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension

no code implementations • ACL 2020 • Fei Yuan, Linjun Shou, Xuanyu Bai, Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Multilingual pre-trained models could leverage the training data from a rich source language (such as English) to improve performance on low resource languages.

Boundary Detection Machine Reading Comprehension +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.