Search Results for author: Dominik Macháček

Found 14 papers, 3 papers with code

Turning Whisper into Real-Time Transcription System

1 code implementation27 Jul 2023 Dominik Macháček, Raj Dabre, Ondřej Bojar

Whisper is one of the recent state-of-the-art multilingual speech recognition and translation models, however, it is not designed for real time transcription.

speech-recognition Speech Recognition +1

Robustness of Multi-Source MT to Transcription Errors

no code implementations26 May 2023 Dominik Macháček, Peter Polák, Ondřej Bojar, Raj Dabre

Automatic speech translation is sensitive to speech recognition errors, but in a multilingual scenario, the same content may be available in various languages via simultaneous interpreting, dubbing or subtitling.

Machine Translation speech-recognition +2

MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation

1 code implementation16 Nov 2022 Dominik Macháček, Ondřej Bojar, Raj Dabre

There have been several meta-evaluation studies on the correlation between human ratings and offline machine translation (MT) evaluation metrics such as BLEU, chrF2, BertScore and COMET.

Machine Translation Translation

Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation

no code implementations4 Mar 2022 Dávid Javorský, Dominik Macháček, Ondřej Bojar

Our results show that the subtitling layout or flicker have a little effect on comprehension, in contrast to machine translation itself and individual competence.

Machine Translation Translation

The Reality of Multi-Lingual Machine Translation

no code implementations25 Feb 2022 Tom Kocmi, Dominik Macháček, Ondřej Bojar

Machine translation is for us a prime example of deep learning applications where human skills and learning capabilities are taken as a benchmark that many try to match and surpass.

Cross-Lingual Transfer Machine Translation +2

Lost in Interpreting: Speech Translation from Source or Interpreter?

no code implementations17 Jun 2021 Dominik Macháček, Matúš Žilinec, Ondřej Bojar

Interpreters facilitate multi-lingual meetings but the affordable set of languages is often smaller than what is needed.

Machine Translation Translation

Presenting Simultaneous Translation in Limited Space

no code implementations18 Sep 2020 Dominik Macháček, Ondřej Bojar

Furthermore, we propose a way how to estimate the overall usability of the combination of automatic translation and subtitling by measuring the quality, latency, and stability on a test set, and propose an improved measure for translation latency.

Translation

CUNI Systems for the Unsupervised News Translation Task in WMT 2019

no code implementations29 Jul 2019 Ivana Kvapilíková, Dominik Macháček, Ondřej Bojar

In this paper we describe the CUNI translation system used for the unsupervised news shared task of the ACL 2019 Fourth Conference on Machine Translation (WMT19).

Machine Translation Translation

Morphological and Language-Agnostic Word Segmentation for NMT

1 code implementation14 Jun 2018 Dominik Macháček, Jonáš Vidra, Ondřej Bojar

The state of the art of handling rich morphology in neural machine translation (NMT) is to break word forms into subword units, so that the overall vocabulary size of these units fits the practical limits given by the NMT model and GPU memory capacity.

Machine Translation NMT +1

Cannot find the paper you are looking for? You can Submit a new open access paper.