Search Results for author: Peyman Passban

Embedding matrices are key components in neural natural language processing (NLP) models that are responsible to provide numerical representations of input tokens.\footnote{In this paper words and subwords are referred to as \textit{tokens} and the term \textit{embedding} only refers to embeddings of inputs.}

Machine Translation NMT +2

Paper
Add Code

Robust Embeddings Via Distributions

no code implementations • 17 Apr 2021 • Kira A. Selby, Yinong Wang, Ruizhe Wang, Peyman Passban, Ahmad Rashid, Mehdi Rezagholizadeh, Pascal Poupart

Despite recent monumental advances in the field, many Natural Language Processing (NLP) models still struggle to perform adequately on noisy domains.

Paper
Add Code

Revisiting Robust Neural Machine Translation: A Transformer Case Study

no code implementations • Findings (EMNLP) 2021 • Peyman Passban, Puneeth S. M. Saladi, Qun Liu

There is a large body of work in the NMT literature on analyzing the behavior of conventional models for the problem of noise but Transformers are relatively understudied in this context.

Denoising Machine Translation +2

Paper
Add Code

ALP-KD: Attention-Based Layer Projection for Knowledge Distillation

no code implementations • 27 Dec 2020 • Peyman Passban, Yimeng Wu, Mehdi Rezagholizadeh, Qun Liu

Knowledge distillation is considered as a training and compression strategy in which two neural networks, namely a teacher and a student, are coupled together during training.

Knowledge Distillation

Paper
Add Code

Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers

2 code implementations • EMNLP 2020 • Yimeng Wu, Peyman Passban, Mehdi Rezagholizade, Qun Liu

With the growth of computing power neural machine translation (NMT) models also grow accordingly and become better.

Knowledge Distillation Machine Translation +2

Paper
Code

Tailoring Neural Architectures for Translating from Morphologically Rich Languages

no code implementations • COLING 2018 • Peyman Passban, Andy Way, Qun Liu

A morphologically complex word (MCW) is a hierarchical constituent with meaning-preserving subunits, so word-based models which rely on surface forms might not be powerful enough to translate such structures.

Decoder Machine Translation +3

Paper
Add Code

Improving Character-based Decoding Using Target-Side Morphological Information for Neural Machine Translation

no code implementations • NAACL 2018 • Peyman Passban, Qun Liu, Andy Way

Recently, neural machine translation (NMT) has emerged as a powerful alternative to conventional statistical approaches.

Decoder Machine Translation +2

Paper
Add Code

Investigating Backtranslation in Neural Machine Translation

no code implementations • 17 Apr 2018 • Alberto Poncelas, Dimitar Shterionov, Andy Way, Gideon Maillette de Buy Wenniger, Peyman Passban

A prerequisite for training corpus-based machine translation (MT) systems -- either Statistical MT (SMT) or Neural MT (NMT) -- is the availability of high-quality parallel data.

Machine Translation NMT +1