Machine Translation

2148 papers with code • 81 benchmarks • 76 datasets

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Libraries

Use these libraries to find Machine Translation models and implementations
24 papers
1,206
15 papers
29,201
14 papers
124,593
See all 14 libraries.

Latest papers with no code

Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets

no code yet • 9 Apr 2024

Recent research in the field of multimodal machine translation (MMT) has indicated that the visual modality is either dispensable or offers only marginal advantages.

Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods

no code yet • 8 Apr 2024

In various real-world applications such as machine translation, sentiment analysis, and question answering, a pivotal role is played by NLP models, facilitating efficient communication and decision-making processes in domains ranging from healthcare to finance.

Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation

no code yet • 5 Apr 2024

Parameter-efficient fine-tuning (PEFT) methods are increasingly vital in adapting large-scale pre-trained language models for diverse tasks, offering a balance between adaptability and computational efficiency.

Towards Automated Movie Trailer Generation

no code yet • 4 Apr 2024

Movie trailers are an essential tool for promoting films and attracting audiences.

Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison

no code yet • 3 Apr 2024

Retrieval-Augmented Neural Machine Translation (RAMT) architectures retrieve examples from memory to guide the generation process.

MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness

no code yet • 3 Apr 2024

This paper presents our system developed for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR), on Track C: Cross-lingual.

Backdoor Attack on Multilingual Machine Translation

no code yet • 3 Apr 2024

Our aim is to bring attention to these vulnerabilities within MNMT systems with the hope of encouraging the community to address security concerns in machine translation, especially in the context of low-resource languages.

Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach

no code yet • 3 Apr 2024

Efforts on the research and development of OCR systems for Low-Resource Languages are relatively new.

An Incomplete Loop: Deductive, Inductive, and Abductive Learning in Large Language Models

no code yet • 3 Apr 2024

Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small number of examples; in instruction inference, LMs are presented with in-context examples and are then prompted to generate a natural language task description before making predictions.

HyperCLOVA X Technical Report

no code yet • 2 Apr 2024

We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding.