Search Results for author: Minghao Wu

Found 15 papers, 6 papers with code

NoahNMT at WMT 2021: Dual Transfer for Very Low Resource Supervised Machine Translation

no code implementations • WMT (EMNLP) 2021 • Meng Zhang, Minghao Wu, Pengfei Li, Liangyou Li, Qun Liu

This paper describes the NoahNMT system submitted to the WMT 2021 shared task of Very Low Resource Supervised Machine Translation.

Machine Translation Translation

Paper
Add Code

Beyond Probabilities: Unveiling the Misalignment in Evaluating Large Language Models

no code implementations • 21 Feb 2024 • Chenyang Lyu, Minghao Wu, Alham Fikri Aji

Large Language Models (LLMs) have demonstrated remarkable capabilities across various applications, fundamentally reshaping the landscape of natural language processing (NLP) research.

Multiple-choice

Paper
Add Code

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation

no code implementations • 27 Jan 2024 • Minghao Wu, YuFei Wang, George Foster, Lizhen Qu, Gholamreza Haffari

Document-level neural machine translation (DocNMT) aims to generate translations that are both coherent and cohesive, in contrast to its sentence-level counterpart.

Data Augmentation Machine Translation +2

Paper
Add Code

Adapting Large Language Models for Document-Level Machine Translation

no code implementations • 12 Jan 2024 • Minghao Wu, Thuy-Trang Vu, Lizhen Qu, George Foster, Gholamreza Haffari

Large language models (LLMs) have made significant strides in various natural language processing (NLP) tasks.

Document Level Machine Translation Domain Generalization +2

Paper
Add Code

Demystifying Instruction Mixing for Fine-tuning Large Language Models

1 code implementation • 17 Dec 2023 • Renxi Wang, Haonan Li, Minghao Wu, Yuxia Wang, Xudong Han, Chiyu Zhang, Timothy Baldwin

Instruction tuning significantly enhances the performance of large language models (LLMs) across various tasks.

Language Modelling Large Language Model

Paper
Code

GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation

no code implementations • 25 Nov 2023 • Zhanyu Wang, Longyue Wang, Zhen Zhao, Minghao Wu, Chenyang Lyu, Huayang Li, Deng Cai, Luping Zhou, Shuming Shi, Zhaopeng Tu

While the recent advances in Multimodal Large Language Models (MLLMs) constitute a significant leap forward in the field, these models are predominantly confined to the realm of input-side multimodal comprehension, lacking the capacity for multimodal content generation.

Instruction Following Language Modelling +7

Paper
Add Code

Style Over Substance: Evaluation Biases for Large Language Models

no code implementations • 6 Jul 2023 • Minghao Wu, Alham Fikri Aji

This study investigates the behavior of crowd-sourced and expert annotators, as well as LLMs, when comparing outputs from different models.

Text Generation

Paper
Add Code

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

1 code implementation • 15 Jun 2023 • Chenyang Lyu, Minghao Wu, Longyue Wang, Xinting Huang, Bingshuai Liu, Zefeng Du, Shuming Shi, Zhaopeng Tu

Although instruction-tuned large language models (LLMs) have exhibited remarkable capabilities across various NLP tasks, their effectiveness on other data modalities beyond text has not been fully studied.

Language Modelling

1,396

Paper
Code

Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation

1 code implementation • 24 May 2023 • Haonan Li, Fajri Koto, Minghao Wu, Alham Fikri Aji, Timothy Baldwin

However, research on multilingual instruction tuning has been limited due to the scarcity of high-quality instruction-response datasets across different languages.

Instruction Following

Paper
Code

A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models

no code implementations • 2 May 2023 • Chenyang Lyu, Zefeng Du, Jitao Xu, Yitao Duan, Minghao Wu, Teresa Lynn, Alham Fikri Aji, Derek F. Wong, Siyou Liu, Longyue Wang

We conclude by emphasizing the critical role of LLMs in guiding the future evolution of MT and offer a roadmap for future exploration in the sector.

Document Translation Machine Translation +2

Paper
Add Code

LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions

1 code implementation • 27 Apr 2023 • Minghao Wu, Abdul Waheed, Chiyu Zhang, Muhammad Abdul-Mageed, Alham Fikri Aji

The results demonstrate that our proposed LaMini-LM models are comparable to competitive baselines, while being much smaller in size.

Ranked #15 on Word Sense Disambiguation on Words in Context

Common Sense Reasoning Coreference Resolution +5

800

Paper
Code

Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation

no code implementations • 16 Feb 2023 • Minghao Wu, George Foster, Lizhen Qu, Gholamreza Haffari

Existing work in document-level neural machine translation commonly concatenates several consecutive sentences as a pseudo-document, and then learns inter-sentential dependencies.

Machine Translation Translation

Paper
Add Code

Universal Conditional Masked Language Pre-training for Neural Machine Translation

1 code implementation • ACL 2022 • Pengfei Li, Liangyou Li, Meng Zhang, Minghao Wu, Qun Liu

To the best of our knowledge, this is the first work to pre-train a unified model for fine-tuning on both NMT tasks.

Language Modelling Machine Translation +2

2,958

Paper
Code

Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training

no code implementations • EMNLP 2021 • Minghao Wu, Yitong Li, Meng Zhang, Liangyou Li, Gholamreza Haffari, Qun Liu

In this work, we propose an approach, MultiUAT, that dynamically adjusts the training data usage based on the model's uncertainty on a small set of trusted clean data for multi-corpus machine translation.

Machine Translation Translation

Paper
Add Code

Evaluating the Utility of Hand-crafted Features in Sequence Labelling

1 code implementation • EMNLP 2018 • Minghao Wu, Fei Liu, Trevor Cohn

Conventional wisdom is that hand-crafted features are redundant for deep learning models, as they already learn adequate representations of text automatically from corpora.

Ranked #42 on Named Entity Recognition (NER) on CoNLL 2003 (English)

named-entity-recognition Named Entity Recognition +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.