Search Results for author: Henry Li Xinyuan

Found 2 papers, 1 papers with code

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation

1 code implementation20 Jun 2023 Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur

We introduce HK-LegiCoST, a new three-way parallel corpus of Cantonese-English translations, containing 600+ hours of Cantonese audio, its standard traditional Chinese transcript, and English translation, segmented and aligned at the sentence level.

Cross-corpus Sentence +3

Learning a Formality-Aware Japanese Sentence Representation

no code implementations17 Jan 2023 Henry Li Xinyuan, Ray Lee, Jerry Chen, Kelly Marchisio

On the other hand, downstream tasks such as translation would benefit from working with a sentence representation that preserves formality in addition to semantics, so as to generate sentences with the appropriate level of social formality -- the difference between speaking to a friend versus speaking with a supervisor.

Decoder Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.