Search Results for author: Tong Xiao

Reinforcement learning with human feedback for aligning large language models (LLMs) trains a reward model typically using ranking loss with comparison pairs. However, the training procedure suffers from an inherent problem: the uncontrolled scaling of reward scores during reinforcement learning due to the lack of constraints while training the reward model. This paper proposes a Prior Constraints-based Reward Model (namely PCRM) training method to mitigate this problem.

reinforcement-learning

Paper
Code

RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners

no code implementations • 19 Mar 2024 • Chi Hu, Yuan Ge, Xiangnan Ma, Hang Cao, Qiang Li, Yonghua Yang, Tong Xiao, Jingbo Zhu

Our experiments across 11 arithmetic and commonsense reasoning tasks show that RankPrompt significantly enhances the reasoning performance of ChatGPT and GPT-4, with improvements of up to 13%.

Paper
Add Code

Large Language Models are Parallel Multilingual Learners

1 code implementation • 14 Mar 2024 • Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, Jingbo Zhu

In this study, we reveal an in-context learning (ICL) capability of multilingual large language models (LLMs): by translating the input to several languages, we provide Parallel Input in Multiple Languages (PiM) to LLMs, which significantly enhances their comprehension abilities.

In-Context Learning

Paper
Code

Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

1 code implementation • 28 Feb 2024 • Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Hongxia Ma, Li Zhang, Hao Yang, Tong Xiao

The second step involves preserving dataset diversity through a clustering process. In our experiment, CaR selected a subset containing only 1. 96% of Alpaca's IT data, yet the underlying AlpaCaR model trained on this subset outperforms Alpaca by an average of 32. 1% in GPT-4 evaluations.

Clustering

Paper
Code

Effect of target signals and delays on spatially selective active noise control for open-fitting hearables

no code implementations • 15 Jan 2024 • Tong Xiao, Simon Doclo

Spatially selective active noise control (ANC) hearables are designed to reduce unwanted noise from certain directions while preserving desired sounds from other directions.

Paper
Add Code

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis

no code implementations • 20 Dec 2023 • Bichen Wu, Ching-Yao Chuang, Xiaoyan Wang, Yichen Jia, Kapil Krishnakumar, Tong Xiao, Feng Liang, Licheng Yu, Peter Vajda

In this paper, we introduce Fairy, a minimalist yet robust adaptation of image-editing diffusion models, enhancing them for video editing applications.

Data Augmentation Video Editing +1

Paper
Add Code

Soft Alignment of Modality Space for End-to-end Speech Translation

no code implementations • 18 Dec 2023 • Yuhao Zhang, Kaiqi Kou, Bei Li, Chen Xu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

End-to-end Speech Translation (ST) aims to convert speech into target text within a unified model.

Cross-Lingual Transfer Translation

Paper
Add Code

Introduction to Transformers: an NLP Perspective

1 code implementation • 29 Nov 2023 • Tong Xiao, Jingbo Zhu

Transformers have dominated empirical machine learning models of natural language processing.

Paper
Code

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

1 code implementation • 7 Nov 2023 • Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.

Multi-Task Learning

Paper
Code

Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs

1 code implementation • 26 Oct 2023 • Yuxin Zuo, Bei Li, Chuanhao Lv, Tong Zheng, Tong Xiao, Jingbo Zhu

This paper presents an in-depth study of multimodal machine translation (MMT), examining the prevailing understanding that MMT systems exhibit decreased sensitivity to visual information when text inputs are complete.

Attribute Multimodal Machine Translation +2

Paper
Code

PartialFormer: Modeling Part Instead of Whole

1 code implementation • 23 Oct 2023 • Tong Zheng, Bei Li, Huiwen Bao, Weiqiao Shan, Tong Xiao, Jingbo Zhu

The design choices in Transformer feed-forward neural networks have resulted in significant computational and parameter overhead.

Ranked #23 on Machine Translation on WMT2014 English-German

Abstractive Text Summarization Machine Translation +1

Paper
Code

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

1 code implementation • 21 Sep 2023 • Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

In this study, we present synchronous bilingual Connectionist Temporal Classification (CTC), an innovative framework that leverages dual CTC to bridge the gaps of both modality and language in the speech translation (ST) task.

speech-recognition Speech Recognition +1

Paper
Code

Learning Evaluation Models from Large Language Models for Sequence Generation

no code implementations • 8 Aug 2023 • Chenglong Wang, Hang Zhou, Kaiyan Chang, Tongran Liu, Chunliang Zhang, Quan Du, Tong Xiao, Jingbo Zhu

Large language models achieve state-of-the-art performance on sequence generation evaluation, but typically have a large number of parameters.

Machine Translation Style Transfer +1

Paper
Add Code

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

3 code implementations • 4 Aug 2023 • Chenglong Wang, Hang Zhou, Yimin Hu, Yifu Huo, Bei Li, Tongran Liu, Tong Xiao, Jingbo Zhu

Applying Reinforcement Learning (RL) to sequence generation models enables the direct optimization of long-term rewards (\textit{e. g.,} BLEU and human feedback), but typically requires large-scale sampling over a space of action sequences.

Abstractive Text Summarization Language Modelling +5

16,646

Paper
Code

Towards Robust Aspect-based Sentiment Analysis through Non-counterfactual Augmentations

no code implementations • 24 Jun 2023 • Xinyu Liu, Yan Ding, Kaikai An, Chunyang Xiao, Pranava Madhyastha, Tong Xiao, Jingbo Zhu

While state-of-the-art NLP models have demonstrated excellent performance for aspect based sentiment analysis (ABSA), substantial evidence has been presented on their lack of robustness.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

Recent Advances in Direct Speech-to-text Translation

no code implementations • 20 Jun 2023 • Chen Xu, Rong Ye, Qianqian Dong, Chengqi Zhao, Tom Ko, Mingxuan Wang, Tong Xiao, Jingbo Zhu

Recently, speech-to-text translation has attracted more and more attention and many studies have emerged rapidly.

Data Augmentation Knowledge Distillation +2

Paper
Add Code

Understanding Parameter Sharing in Transformers

no code implementations • 15 Jun 2023 • Ye Lin, Mingxuan Wang, Zhexi Zhang, Xiaohui Wang, Tong Xiao, Jingbo Zhu

Inspired by this, we tune the training hyperparameters related to model convergence in a targeted manner.

Machine Translation

Paper
Add Code

Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation

1 code implementation • 13 Jun 2023 • Yuchen Han, Chen Xu, Tong Xiao, Jingbo Zhu

Pre-training and fine-tuning is a paradigm for alleviating the data scarcity problem in end-to-end speech translation (E2E ST).

Paper
Code

MobileNMT: Enabling Translation in 15MB and 30ms

1 code implementation • 7 Jun 2023 • Ye Lin, Xiaohui Wang, Zhexi Zhang, Mingxuan Wang, Tong Xiao, Jingbo Zhu

With the co-design of model and engine, compared with the existing system, we speed up 47. 0x and save 99. 5% of memory with only 11. 6% loss of BLEU.

Model Compression NMT +2

Paper
Code

Deliberate then Generate: Enhanced Prompting Framework for Text Generation

no code implementations • 31 May 2023 • Bei Li, Rui Wang, Junliang Guo, Kaitao Song, Xu Tan, Hany Hassan, Arul Menezes, Tong Xiao, Jiang Bian, Jingbo Zhu

Large language models (LLMs) have shown remarkable success across a wide range of natural language generation tasks, where proper prompt designs make great impacts.

Text Generation

Paper
Add Code

CTC-based Non-autoregressive Speech Translation

1 code implementation • 27 May 2023 • Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu

Combining end-to-end speech translation (ST) and non-autoregressive (NAR) generation is promising in language and speech processing for their advantages of less error propagation and low latency.

Translation

Paper
Code

Bridging the Granularity Gap for Acoustic Modeling

1 code implementation • 27 May 2023 • Chen Xu, Yuhao Zhang, Chengbo Jiao, Xiaoqian Liu, Chi Hu, Xin Zeng, Tong Xiao, Anxiang Ma, Huizhen Wang, Jingbo Zhu

While Transformer has become the de-facto standard for speech, modeling upon the fine-grained frame-level features remains an open challenge of capturing long-distance dependencies and distributing the attention weights.

speech-recognition Speech Recognition

Paper
Code

Augmenting Large Language Model Translators via Translation Memories

no code implementations • 27 May 2023 • Yongyu Mu, Abudurexiti Reheman, Zhiquan Cao, Yuchun Fan, Bei Li, Yinqiao Li, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Using translation memories (TMs) as prompts is a promising approach to in-context learning of machine translation models.

In-Context Learning Language Modelling +4

Paper
Add Code

TranSFormer: Slow-Fast Transformer for Machine Translation

no code implementations • 26 May 2023 • Bei Li, Yi Jing, Xu Tan, Zhen Xing, Tong Xiao, Jingbo Zhu

Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems.

Machine Translation Translation

Paper
Add Code

Multi-Path Transformer is Better: A Case Study on Neural Machine Translation

no code implementations • 10 May 2023 • Ye Lin, Shuhan Zhou, Yanyang Li, Anxiang Ma, Tong Xiao, Jingbo Zhu

For years the model performance in machine learning obeyed a power-law relationship with the model size.

Machine Translation

Paper
Add Code

Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models

1 code implementation • 20 Mar 2023 • Xinnian Liang, Zefan Zhou, Hui Huang, Shuangzhi Wu, Tong Xiao, Muyun Yang, Zhoujun Li, Chao Bian

We conduct extensive experiments on various Chinese NLP tasks to evaluate existing PLMs as well as the proposed MigBERT.

Paper
Code

A Novel Approach for Auto-Formulation of Optimization Problems

no code implementations • 9 Feb 2023 • Yuting Ning, Jiayu Liu, Longhu Qin, Tong Xiao, Shangzi Xue, Zhenya Huang, Qi Liu, Enhong Chen, Jinze Wu

In the Natural Language for Optimization (NL4Opt) NeurIPS 2022 competition, competitors focus on improving the accessibility and usability of optimization solvers, with the aim of subtask 1: recognizing the semantic entities that correspond to the components of the optimization problem; subtask 2: generating formulations for the optimization problem.

Ensemble Learning named-entity-recognition +2

Paper
Add Code

Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection

no code implementations • 1 Feb 2023 • Chenglong Wang, Yi Lu, Yongyu Mu, Yimin Hu, Tong Xiao, Jingbo Zhu

Knowledge distillation addresses the problem of transferring knowledge from a teacher model to a student model.

Knowledge Distillation

Paper
Add Code

Prompting Neural Machine Translation with Translation Memories

no code implementations • 13 Jan 2023 • Abudurexiti Reheman, Tao Zhou, Yingfeng Luo, Di Yang, Tong Xiao, Jingbo Zhu

Improving machine translation (MT) systems with translation memories (TMs) is of great interest to practitioners in the MT community.

Machine Translation NMT +1

Paper
Add Code

EIT: Enhanced Interactive Transformer

1 code implementation • 20 Dec 2022 • Tong Zheng, Bei Li, Huiwen Bao, Tong Xiao, Jingbo Zhu

In this paper, we propose a novel architecture, the Enhanced Interactive Transformer (EIT), to address the issue of head degradation in self-attention mechanisms.

Abstractive Text Summarization Language Modelling +2

Paper
Code

Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data

no code implementations • 4 Dec 2022 • Yuhao Zhang, Chen Xu, Bojie Hu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

We present a method for introducing a text encoder into pre-trained end-to-end speech translation systems.

Denoising Translation

Paper
Add Code

Spatially Selective Active Noise Control Systems

no code implementations • 22 Aug 2022 • Tong Xiao, Buye Xu, Chuming Zhao

In this work, we propose a multi-channel ANC system that only reduces sound from undesired directions, and the system truly preserves the desired sound instead of reproducing it.

Paper
Add Code

Learning Multiscale Transformer Models for Sequence Generation

1 code implementation • 19 Jun 2022 • Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu

In this work, we define those scales in different linguistic units, including sub-words, words and phrases.

Paper
Code

On Vision Features in Multimodal Machine Translation

2 code implementations • ACL 2022 • Bei Li, Chuanhao Lv, Zefan Zhou, Tao Zhou, Tong Xiao, Anxiang Ma, Jingbo Zhu

Previous work on multimodal machine translation (MMT) has focused on the way of incorporating vision features into translation but little attention is on the quality of vision models.

Image Captioning Multimodal Machine Translation +3

Paper
Code

ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation

1 code implementation • ACL 2022 • Bei Li, Quan Du, Tao Zhou, Yi Jing, Shuhan Zhou, Xin Zeng, Tong Xiao, Jingbo Zhu, Xuebo Liu, Min Zhang

Inspired by this, we design a new architecture, {\it ODE Transformer}, which is analogous to the Runge-Kutta method that is well motivated in ODE.

Abstractive Text Summarization Machine Translation +1

Paper
Code

The NiuTrans Machine Translation Systems for WMT21

no code implementations • WMT (EMNLP) 2021 • Shuhan Zhou, Tao Zhou, Binghao Wei, Yingfeng Luo, Yongyu Mu, Zefan Zhou, Chenglong Wang, Xuanjun Zhou, Chuanhao Lv, Yi Jing, Laohu Wang, Jingnan Zhang, Canan Huang, Zhongxiang Yan, Chi Hu, Bei Li, Tong Xiao, Jingbo Zhu

This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks.

Knowledge Distillation Machine Translation +1

Paper
Add Code

The NiuTrans System for the WMT21 Efficiency Task

1 code implementation • 16 Sep 2021 • Chenglong Wang, Chi Hu, Yongyu Mu, Zhongxiang Yan, Siming Wu, Minyi Hu, Hang Cao, Bei Li, Ye Lin, Tong Xiao, Jingbo Zhu

This paper describes the NiuTrans system for the WMT21 translation efficiency task (http://statmt. org/wmt21/efficiency-task. html).

Knowledge Distillation Translation

129

Paper
Code

The NiuTrans System for WNGT 2020 Efficiency Task

2 code implementations • WS 2020 • Chi Hu, Bei Li, Ye Lin, Yinqiao Li, Yanyang Li, Chenglong Wang, Tong Xiao, Jingbo Zhu

This paper describes the submissions of the NiuTrans Team to the WNGT 2020 Efficiency Shared Task.

Knowledge Distillation Machine Translation +2

377

Paper
Code

RankNAS: Efficient Neural Architecture Search by Pairwise Ranking

no code implementations • EMNLP 2021 • Chi Hu, Chenglong Wang, Xiangnan Ma, Xia Meng, Yinqiao Li, Tong Xiao, Jingbo Zhu, Changliang Li

This paper addresses the efficiency challenge of Neural Architecture Search (NAS) by formulating the task as a ranking problem.

Language Modelling Machine Translation +2

Paper
Add Code

Bag of Tricks for Optimizing Transformer Efficiency

1 code implementation • Findings (EMNLP) 2021 • Ye Lin, Yanyang Li, Tong Xiao, Jingbo Zhu

Improving Transformer efficiency has become increasingly attractive recently.

Quantization Translation

Paper
Code

The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task

no code implementations • ACL (IWSLT) 2021 • Chen Xu, Xiaoqian Liu, Xiaowen Liu, Laohu Wang, Canan Huang, Tong Xiao, Jingbo Zhu

This paper describes the submission of the NiuTrans end-to-end speech translation system for the IWSLT 2021 offline task, which translates from the English audio to German text directly without intermediate transcription.

Position Translation

Paper
Add Code

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders

no code implementations • ACL 2021 • Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

To our knowledge, we are the first to develop an end-to-end ST system that achieves comparable or even better BLEU performance than the cascaded ST counterpart when large-scale ASR and MT data is available.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

ODE Transformer: An Ordinary Differential Equation-Inspired Model for Neural Machine Translation

no code implementations • 6 Apr 2021 • Bei Li, Quan Du, Tao Zhou, Shuhan Zhou, Xin Zeng, Tong Xiao, Jingbo Zhu

We show that a residual block of layers in Transformer can be described as a higher-order solution to ODEs.

Machine Translation Translation

Paper
Add Code

Non-Autoregressive Translation by Learning Target Categorical Codes

1 code implementation • NAACL 2021 • Yu Bao, ShuJian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai, Jiajun Chen

Non-autoregressive Transformer is a promising text generation model.

Ranked #7 on Machine Translation on WMT2014 German-English

Attribute Machine Translation +2

Paper
Code

An Efficient Transformer Decoder with Compressed Sub-layers

no code implementations • 3 Jan 2021 • Yanyang Li, Ye Lin, Tong Xiao, Jingbo Zhu

The large attention-based encoder-decoder network (Transformer) has become prevailing recently due to its effectiveness.

Machine Translation Translation

Paper
Add Code

Learning Light-Weight Translation Models from Deep Transformer

1 code implementation • 27 Dec 2020 • Bei Li, Ziyang Wang, Hui Liu, Quan Du, Tong Xiao, Chunliang Zhang, Jingbo Zhu

We proposed a novel group-permutation based knowledge distillation approach to compressing the deep Transformer model into a shallow model.

Knowledge Distillation Machine Translation +2

Paper
Code

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

no code implementations • COLING 2020 • Chen Xu, Bojie Hu, Yufan Jiang, Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

This eases training by highlighting easy samples that the current model has enough competence to learn.

Low-Resource Neural Machine Translation NMT +1

Paper
Add Code

A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction

no code implementations • COLING 2020 • Yanyang Li, Yingfeng Luo, Ye Lin, Quan Du, Huizhen Wang, ShuJian Huang, Tong Xiao, Jingbo Zhu

Our experiments show that this simple method does not hamper the performance of similar language pairs and achieves an accuracy of 13. 64~55. 53% between English and four distant languages, i. e., Chinese, Japanese, Vietnamese and Thai.

Dimensionality Reduction Self-Learning

Paper
Add Code

Layer-Wise Multi-View Learning for Neural Machine Translation

no code implementations • COLING 2020 • Qiang Wang, Changliang Li, Yue Zhang, Tong Xiao, Jingbo Zhu

In this way, in addition to the topmost encoder layer (referred to as the primary view), we also incorporate an intermediate encoder layer as the auxiliary view.

Machine Translation MULTI-VIEW LEARNING +2

Paper
Add Code

Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Qiang Wang, Tong Xiao, Jingbo Zhu

The standard neural machine translation model can only decode with the same depth configuration as training.

Machine Translation Multi-Task Learning +1

Paper
Add Code

Shallow-to-Deep Training for Neural Machine Translation

1 code implementation • EMNLP 2020 • Bei Li, Ziyang Wang, Hui Liu, Yufan Jiang, Quan Du, Tong Xiao, Huizhen Wang, Jingbo Zhu

We find that stacking layers is helpful in improving the representation ability of NMT models and adjacent layers perform similarly.

Machine Translation NMT +2

Paper
Code

Weight Distillation: Transferring the Knowledge in Neural Network Parameters

no code implementations • ACL 2021 • Ye Lin, Yanyang Li, Ziyang Wang, Bei Li, Quan Du, Tong Xiao, Jingbo Zhu

Inspired by this, we investigate methods of model acceleration and compression in another line of research.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Towards Fully 8-bit Integer Inference for the Transformer Model

no code implementations • 17 Sep 2020 • Ye Lin, Yanyang Li, Tengbo Liu, Tong Xiao, Tongran Liu, Jingbo Zhu

8-bit integer inference, as a promising direction in reducing both the latency and storage of deep neural networks, has made great progress recently.

Language Modelling Quantization +1

Paper
Add Code

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild

no code implementations • ECCV 2020 • Alexander Grabner, Yaming Wang, Peizhao Zhang, Peihong Guo, Tong Xiao, Peter Vajda, Peter M. Roth, Vincent Lepetit

We present a novel 3D pose refinement approach based on differentiable rendering for objects of arbitrary categories in the wild.

Paper
Add Code

MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs

no code implementations • ACL 2020 • Jifan Yu, Gan Luo, Tong Xiao, Qingyang Zhong, Yuquan Wang, Wenzheng Feng, Junyi Luo, Chenyu Wang, Lei Hou, Juanzi Li, Zhiyuan Liu, Jie Tang

The prosperity of Massive Open Online Courses (MOOCs) provides fodder for many NLP and AI research for education applications, e. g., course concept extraction, prerequisite relation discovery, etc.

Paper
Add Code

Towards Differentially Private Text Representations

no code implementations • 25 Jun 2020 • Lingjuan Lyu, Yitong Li, Xuanli He, Tong Xiao

Most deep learning frameworks require users to pool their local data or model updates to a trusted server to train or maintain a global model.

Paper
Add Code

Attention: to Better Stand on the Shoulders of Giants

no code implementations • 27 May 2020 • Sha Yuan, Zhou Shao, Yu Zhang, Xingxing Wei, Tong Xiao, Yifan Wang, Jie Tang

In the progress of science, the previously discovered knowledge principally inspires new scientific ideas, and citation is a reasonably good reflection of this cumulative nature of scientific research.

Paper
Add Code

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

1 code implementation • ACL 2020 • Bei Li, Hui Liu, Ziyang Wang, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

In encoder-decoder neural models, multiple encoders are in general used to represent the contextual information in addition to the individual sentence.

Machine Translation NMT +2

Paper
Code

Learning Architectures from an Extended Search Space for Language Modeling

no code implementations • ACL 2020 • Yinqiao Li, Chi Hu, Yuhao Zhang, Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Neural architecture search (NAS) has advanced significantly in recent years but most NAS systems restrict search to learning architectures of a recurrent or convolutional cell.

Chunking Language Modelling +4

Paper
Add Code

Neural Machine Translation with Joint Representation

1 code implementation • 16 Feb 2020 • Yanyang Li, Qiang Wang, Tong Xiao, Tongran Liu, Jingbo Zhu

Though early successes of Statistical Machine Translation (SMT) systems are attributed in part to the explicit modelling of the interaction between any two source and target units, e. g., alignment, the recent Neural Machine Translation (NMT) systems resort to the attention which partially encodes the interaction for efficiency.

Machine Translation NMT +1

Paper
Code

Multi-layer Representation Fusion for Neural Machine Translation

1 code implementation • COLING 2018 • Qiang Wang, Fuxue Li, Tong Xiao, Yanyang Li, Yinqiao Li, Jingbo Zhu

In this paper, we propose a multi-layer representation fusion (MLRF) approach to fusing stacked layers.

Machine Translation Sentence +1

Paper
Code

Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition

1 code implementation • IJCNLP 2019 • Yufan Jiang, Chi Hu, Tong Xiao, Chunliang Zhang, Jingbo Zhu

In this paper, we study differentiable neural architecture search (NAS) methods for natural language processing.

Ranked #1 on Language Modelling on PTB Diagnostic ECG Database

Language Modelling named-entity-recognition +3

Paper
Code

Ultra-broadband local active noise control with remote acoustic sensing

no code implementations • 8 Sep 2019 • Tong Xiao, Xiaojun Qiu, Benjamin Halkon

One enduring challenge for controlling high frequency sound in local active noise control (ANC) systems is to obtain the acoustic signal at the specific location to be controlled.

Paper
Add Code

The NiuTrans Machine Translation Systems for WMT19

no code implementations • WS 2019 • Bei Li, Yinqiao Li, Chen Xu, Ye Lin, Jiqiang Liu, Hui Liu, Ziyang Wang, Yuhao Zhang, Nuo Xu, Zeyang Wang, Kai Feng, Hexuan Chen, Tengbo Liu, Yanyang Li, Qiang Wang, Tong Xiao, Jingbo Zhu

We participated in 13 translation directions, including 11 supervised tasks, namely EN↔{ZH, DE, RU, KK, LT}, GU→EN and the unsupervised DE↔CS sub-track.

Knowledge Distillation Machine Translation +2

Paper
Add Code

Sharing Attention Weights for Fast Transformer

no code implementations • 26 Jun 2019 • Tong Xiao, Yinqiao Li, Jingbo Zhu, Zhengtao Yu, Tongran Liu

This is even 16 times faster than the baseline with no use of the attention cache.

Machine Translation Translation

Paper
Add Code

Shared-Private Bilingual Word Embeddings for Neural Machine Translation

no code implementations • ACL 2019 • Xuebo Liu, Derek F. Wong, Yang Liu, Lidia S. Chao, Tong Xiao, Jingbo Zhu

For similar source and target words, their embeddings tend to share a part of the features and they cooperatively learn these common representation units.

Machine Translation NMT +3

Paper
Add Code

Learning Deep Transformer Models for Machine Translation

2 code implementations • ACL 2019 • Qiang Wang, Bei Li, Tong Xiao, Jingbo Zhu, Changliang Li, Derek F. Wong, Lidia S. Chao

Transformer is the state-of-the-art model in recent machine translation evaluations.

Machine Translation Translation

113

Paper
Code

Modeling and Predicting Citation Count via Recurrent Neural Network with Long Short-Term Memory

no code implementations • 6 Nov 2018 • Sha Yuan, Jie Tang, Yu Zhang, Yifan Wang, Tong Xiao

The rapid evolution of scientific research has been creating a huge volume of publications every year.

Digital Libraries Physics and Society

Paper
Add Code

The NiuTrans Machine Translation System for WMT18

no code implementations • WS 2018 • Qiang Wang, Bei Li, Jiqiang Liu, Bojian Jiang, Zheyang Zhang, Yinqiao Li, Ye Lin, Tong Xiao, Jingbo Zhu

This paper describes the submission of the NiuTrans neural machine translation system for the WMT 2018 Chinese ↔ English news translation tasks.

Machine Translation Translation

Paper
Add Code

End-to-End Deep Kronecker-Product Matching for Person Re-identification

1 code implementation • CVPR 2018 • Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang

Person re-identification aims to robustly measure similarities between person images.

Person Re-Identification

103

Paper
Code

Deep Group-shuffling Random Walk for Person Re-identification

1 code implementation • CVPR 2018 • Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang

Person re-identification aims at finding a person of interest in an image gallery by comparing the probe image of this person with all the gallery images.

Person Re-Identification Retrieval

103

Paper
Code

A Simple and Effective Approach to Coverage-Aware Neural Machine Translation

no code implementations • ACL 2018 • Yanyang Li, Tong Xiao, Yinqiao Li, Qiang Wang, Changming Xu, Jingbo Zhu

We offer a simple and effective method to seek a better balance between model confidence and length preference for Neural Machine Translation (NMT).

Machine Translation NMT +1

Paper
Add Code

Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding

no code implementations • CVPR 2018 • Dapeng Chen, Hongsheng Li, Tong Xiao, Shuai Yi, Xiaogang Wang

The attention weights are obtained based on a query feature, which is learned from the whole probe snippet by an LSTM network, making the resulting embeddings less affected by noisy frames.

Ranked #4 on Person Re-Identification on PRID2011

Video-Based Person Re-Identification

Paper
Add Code

Implicit Syntactic Features for Target-dependent Sentiment Analysis

no code implementations • IJCNLP 2017 • Yuze Gao, Yue Zhang, Tong Xiao

Targeted sentiment analysis investigates the sentiment polarities on given target mentions from input texts.

Representation Learning Sentence +1

Paper
Add Code

Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-temporal Path Proposals

no code implementations • ICCV 2017 • Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang

Vehicle re-identification is an important problem and has many applications in video surveillance and intelligent transportation.

Person Re-Identification Vehicle Re-Identification

Paper
Add Code

Identity-Aware Textual-Visual Matching with Latent Co-attention

no code implementations • ICCV 2017 • Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang

The stage-2 CNN-LSTM network refines the matching results with a latent co-attention mechanism.

Sentence Text based Person Retrieval

Paper
Add Code

Towards Bidirectional Hierarchical Representations for Attention-Based Neural Machine Translation

no code implementations • EMNLP 2017 • Baosong Yang, Derek F. Wong, Tong Xiao, Lidia S. Chao, Jingbo Zhu

This paper proposes a hierarchical attentional neural translation model which focuses on enhancing source-side hierarchical representations by covering both local and global semantic information using a bidirectional tree-based encoder.

Machine Translation Translation

Paper
Add Code

Object Detection in Videos with Tubelet Proposal Networks

1 code implementation • CVPR 2017 • Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang

Object detection in videos has drawn increasing attention recently with the introduction of the large-scale ImageNet VID dataset.

Object object-detection +2

Paper
Code

Person Search with Natural Language Description

1 code implementation • CVPR 2017 • Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang

Searching persons in large-scale image databases with the query of natural language description has important applications in video surveillance.

Attribute Person Search +1

142

Paper
Code

Crafting GBD-Net for Object Detection

1 code implementation • 8 Oct 2016 • Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang

The effectiveness of GBD-Net is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO.

Object object-detection +1

182

Paper
Code

Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification

1 code implementation • CVPR 2016 • Tong Xiao, Hongsheng Li, Wanli Ouyang, Xiaogang Wang

Learning generic and robust feature representations with data from multiple domains for the same problem is of great value, especially for the problems that have multiple datasets but none of them are large enough to provide abundant data variations.

Person Re-Identification

232

Paper
Code

T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

1 code implementation • 9 Apr 2016 • Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, Wanli Ouyang

Temporal and contextual information of videos are not fully investigated and utilized.

Novel Object Detection Object +3

369

Paper
Code

Joint Detection and Identification Feature Learning for Person Search

2 code implementations • CVPR 2017 • Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang

Existing person re-identification benchmarks and methods mainly focus on matching cropped pedestrian images between queries and candidates.

Ranked #9 on Person Re-Identification on CUHK03

Pedestrian Detection Person Re-Identification +1

732

Paper
Code

Convolutional neural networks with low-rank regularization

2 code implementations • 19 Nov 2015 • Cheng Tai, Tong Xiao, Yi Zhang, Xiaogang Wang, Weinan E

Recently, tensor decompositions have been used for speeding up CNNs.

Data Augmentation Tensor Decomposition

Paper
Code

NiuParser: A Chinese Syntactic and Semantic Parsing Toolkit

no code implementations • IJCNLP 2015 • Jingbo Zhu, Muhua Zhu, Qiang Wang, Tong Xiao

Chinese Word Segmentation Chunking +5

Paper
Add Code

Learning From Massive Noisy Labeled Data for Image Classification

no code implementations • CVPR 2015 • Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang

To demonstrate the effectiveness of our approach, we collect a large-scale real-world clothing classification dataset with both noisy and clean labels.

Classification General Classification +1

Paper
Add Code

Effective Incorporation of Source Syntax into Hierarchical Phrase-based Translation

no code implementations • COLING 2014 • Tong Xiao, Adri{\`a} de Gispert, Jingbo Zhu, Bill Byrne

Machine Translation Translation

Paper
Add Code

A Hybrid Approach to Skeleton-based Translation

no code implementations • ACL 2014 • Tong Xiao, Jingbo Zhu, Chunliang Zhang

Language Modelling Machine Translation +1

Paper
Add Code

DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification

no code implementations • CVPR 2014 • Wei Li, Rui Zhao, Tong Xiao, Xiaogang Wang

In this paper, we propose a novel filter pairing neural network (FPNN) to jointly handle misalignment, photometric and geometric transforms, occlusions and background clutter.

Person Re-Identification