Search Results for author: Anna Sun

Found 8 papers, 4 papers with code

Efficient Monotonic Multihead Attention

no code implementations7 Dec 2023 Xutai Ma, Anna Sun, Siqi Ouyang, Hirofumi Inaguma, Paden Tomasello

We introduce the Efficient Monotonic Multihead Attention (EMMA), a state-of-the-art simultaneous translation model with numerically-stable and unbiased monotonic alignment estimation.

Simultaneous Speech-to-Text Translation Translation

Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference

no code implementations10 Mar 2023 Haiyang Huang, Newsha Ardalani, Anna Sun, Liu Ke, Hsien-Hsin S. Lee, Anjali Sridhar, Shruti Bhosale, Carole-Jean Wu, Benjamin Lee

We propose three optimization techniques to mitigate sources of inefficiencies, namely (1) Dynamic gating, (2) Expert Buffering, and (3) Expert load balancing.

Language Modelling Machine Translation

Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages

no code implementations7 Feb 2023 Simeng Sun, Maha Elbayad, Anna Sun, James Cross

With multilingual machine translation (MMT) models continuing to grow in size and number of supported languages, it is natural to reuse and upgrade existing models to save computation as data becomes available in more languages.

Machine Translation Translation

Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation

no code implementations15 Dec 2022 Maha Elbayad, Anna Sun, Shruti Bhosale

Sparsely gated Mixture of Experts (MoE) models have been shown to be a compute-efficient method to scale model capacity for multilingual machine translation.

Machine Translation Translation

Playing Codenames with Language Graphs and Word Embeddings

1 code implementation12 May 2021 Divya Koyyalagunta, Anna Sun, Rachel Lea Draelos, Cynthia Rudin

Although board games and video games have been studied for decades in artificial intelligence research, challenging word games remain relatively unexplored.

Board Games Common Sense Reasoning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.