2 code implementations • 11 Oct 2022 • Chinh Ngo, Trieu H. Trinh, Long Phan, Hieu Tran, Tai Dang, Hieu Nguyen, Minh Nguyen, Minh-Thang Luong
We introduce MTet, the largest publicly available parallel corpus for English-Vietnamese translation.
Ranked #1 on Machine Translation on IWSLT2015 English-Vietnamese (using extra training data)
1 code implementation • Blog 2022 • Chinh Ngo, Hieu Tran, Long Phan, Trieu H. Trinh, Hieu Nguyen, Minh Nguyen, Minh-Thang Luong
We are excited to introduce a new larger and better quality Machine Translation dataset, MTet, which stands for Multi-domain Translation for English and VieTnamese.
1 code implementation • 20 Apr 2021 • Chinh Ngo, Trieu Trinh
We collect data from open sources on the Internet, and classify them into different categories, each labeled with a specific language style 3.
Ranked #2 on Machine Translation on IWSLT2015 English-Vietnamese (using extra training data)