Search Results for author: Yuwei Yin

Found 16 papers, 10 papers with code

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages

1 code implementation • Findings (EMNLP) 2021 • Weijia Xu, Yuwei Yin, Shuming Ma, Dongdong Zhang, Haoyang Huang

Multilingual neural machine translation models typically handle one source language at a time.

Paper
Code

m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt

no code implementations • 26 Mar 2024 • Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, Linzheng Cahi, Liqun Yang, Zhoujun Li

Our method aims to minimize the representation distance of different languages by regarding the image as a central language.

Machine Translation Translation

Paper
Add Code

Red Teaming Visual Language Models

no code implementations • 23 Jan 2024 • Mukai Li, Lei LI, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu

Additionally, we simply apply red teaming alignment to LLaVA-v1. 5 with Supervised Fine-tuning (SFT) using RTVLM, and this bolsters the models' performance with 10% in RTVLM test set, 13% in MM-Hal, and without noticeable decline in MM-Bench, overpassing other LLaVA-based models with regular alignment data.

Fairness

Paper
Add Code

MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction

2 code implementations • 12 Aug 2023 • Tongliang Li, Zixiang Wang, Linzheng Chai, Jian Yang, Jiaqi Bai, Yuwei Yin, Jiaheng Liu, Hongcheng Guo, Liqun Yang, Hebboul Zine el-abidine, Zhoujun Li

Cross-lingual open information extraction aims to extract structured information from raw text across multiple languages.

Cross-Lingual Transfer Language Modelling +2

Paper
Code

FinPT: Financial Risk Prediction with Profile Tuning on Pretrained Foundation Models

1 code implementation • 22 Jul 2023 • Yuwei Yin, Yazheng Yang, Jian Yang, Qi Liu

To tackle these issues, we propose FinPT and FinBench: the former is a novel approach for financial risk prediction that conduct Profile Tuning on large pretrained foundation models, and the latter is a set of high-quality datasets on financial risks such as default, fraud, and churn.

Paper
Code

M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

no code implementations • 7 Jun 2023 • Lei LI, Yuwei Yin, Shicheng Li, Liang Chen, Peiyi Wang, Shuhuai Ren, Mukai Li, Yazheng Yang, Jingjing Xu, Xu sun, Lingpeng Kong, Qi Liu

To tackle this challenge and promote research in the vision-language field, we introduce the Multi-Modal, Multilingual Instruction Tuning (M$^3$IT) dataset, designed to optimize VLM alignment with human instructions.

World Knowledge

Paper
Add Code

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

1 code implementation • 18 Apr 2023 • Yuwei Yin, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, Qi Liu

In addition, generative data augmentation (GDA) has been shown to produce more diverse and flexible data.

Data Augmentation domain classification +1

Paper
Code

HanoiT: Enhancing Context-aware Translation via Selective Context

no code implementations • 17 Jan 2023 • Jian Yang, Yuwei Yin, Shuming Ma, Liqun Yang, Hongcheng Guo, Haoyang Huang, Dongdong Zhang, Yutao Zeng, Zhoujun Li, Furu Wei

Context-aware neural machine translation aims to use the document-level context to improve translation quality.

Document Level Machine Translation Machine Translation +2

Paper
Add Code

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

1 code implementation • 20 Dec 2022 • Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li

Inspired by the idea of Generative Adversarial Networks (GANs), we propose a GAN-style model for encoder-decoder pre-training by introducing an auxiliary discriminator, unifying the ability of language understanding and generation in a single model.

Denoising Sentence +1

Paper
Code

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation

1 code implementation • 13 Oct 2022 • Jian Yang, Shaohan Huang, Shuming Ma, Yuwei Yin, Li Dong, Dongdong Zhang, Hongcheng Guo, Zhoujun Li, Furu Wei

Specifically, the target sequence is first translated into the source language and then tagged by a source NER model.

Cross-Lingual NER Machine Translation +5

Paper
Code

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation

1 code implementation • 29 Jul 2022 • Jian Yang, Yuwei Yin, Liqun Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Furu Wei, Zhoujun Li

Transformer structure, stacked by a sequence of encoder and decoder network layers, achieves significant development in neural machine translation.

Machine Translation Translation

Paper
Code

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation

1 code implementation • 11 Jul 2022 • Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Shuangzhi Wu, Hongcheng Guo, Zhoujun Li, Furu Wei

Most translation tasks among languages belong to the zero-resource translation problem where parallel corpora are unavailable.

Machine Translation NMT +1

Paper
Code

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

1 code implementation • 11 Jul 2022 • Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Zhoujun Li, Furu Wei

Nonetheless, multilingual training is plagued by language interference degeneration in shared parameters because of the negative interference among different translation directions, especially on high-resource languages.

Machine Translation Translation

Paper
Code

Exploring Entity Interactions for Few-Shot Relation Learning (Student Abstract)

no code implementations • 4 May 2022 • Yi Liang, Shuai Zhao, Bo Cheng, Yuwei Yin, Hao Yang

Few-shot relation learning refers to infer facts for relations with a limited number of observed triples.

Metric Learning Relation

Paper
Add Code

Multilingual Agreement for Multilingual Neural Machine Translation

no code implementations • ACL 2021 • Jian Yang, Yuwei Yin, Shuming Ma, Haoyang Huang, Dongdong Zhang, Zhoujun Li, Furu Wei

Although multilingual neural machine translation (MNMT) enables multiple language translations, the training process is based on independent multilingual objectives.

Machine Translation Translation

Paper
Add Code

Toward Tweet Entity Linking with Heterogeneous Information Networks

1 code implementation • IEEE Transactions on Knowledge and Data Engineering 2021 • Wei Shen, Yuwei Yin, Yang Yang, Jiawei Han, Jianyong Wang, Xiaojie Yuan

The task of linking an entity mention in a tweet with its corresponding entity in a heterogeneous information network is of great importance, for the purpose of enriching heterogeneous information networks with the abundant and fresh knowledge embedded in tweets.

Entity Linking Metric Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.