Search Results for author: Tsz Kin Lam

Found 9 papers, 2 papers with code

Compact Speech Translation Models via Discrete Speech Units Pretraining

no code implementations • 29 Feb 2024 • Tsz Kin Lam, Alexandra Birch, Barry Haddow

In this paper, we leverage the SSL models by pretraining smaller models on their Discrete Speech Units (DSU).

Paper
Add Code

Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases

no code implementations • 1 Feb 2024 • Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow

While there has been a growing interest in developing direct speech translation systems to avoid propagating errors and losing non-verbal content, prior work in direct S2TT has struggled to conclusively establish the advantages of integrating the acoustic signal directly into the translation process.

speech-recognition Speech Recognition +2

Paper
Add Code

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation

no code implementations • 27 Oct 2022 • Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

Data augmentation is a technique to generate new training data based on existing data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation

no code implementations • 24 Oct 2022 • Tsz Kin Lam, Eva Hasler, Felix Hieber

Customer feedback can be an important signal for improving commercial machine translation systems.

Image Classification Machine Translation +2

Paper
Add Code

Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation

no code implementations • ACL 2022 • Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

End-to-end speech translation relies on data that pair source-language speech inputs with corresponding translations into a target language.

Data Augmentation Knowledge Distillation +1

Paper
Add Code

On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR

1 code implementation • 3 Apr 2021 • Tsz Kin Lam, Mayumi Ohta, Shigehiko Schamoni, Stefan Riezler

Our method, called Aligned Data Augmentation (ADA) for ASR, replaces transcribed tokens and the speech representations in an aligned manner to generate previously unseen training pairs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Cascaded Models With Cyclic Feedback For Direct Speech Translation

no code implementations • 21 Oct 2020 • Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

Direct speech translation describes a scenario where only speech inputs and corresponding translations are available.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation

no code implementations • WS 2019 • Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

We propose an interactive-predictive neural machine translation framework for easier model personalization using reinforcement and imitation learning.

Imitation Learning Machine Translation +1

Paper
Add Code

A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation

1 code implementation • 3 May 2018 • Tsz Kin Lam, Julia Kreutzer, Stefan Riezler

We present an approach to interactive-predictive neural machine translation that attempts to reduce human effort from three directions: Firstly, instead of requiring humans to select, correct, or delete segments, we employ the idea of learning from human reinforcements in form of judgments on the quality of partial translations.

Machine Translation reinforcement-learning +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.