Word Translation

35 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Word Translation models and implementations

Most implemented papers

Word Translation Without Parallel Data

facebookresearch/MUSE ICLR 2018

We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation.

Non-Adversarial Unsupervised Word Translation

facebookresearch/MUSE EMNLP 2018

We present a novel method that first aligns the second moment of the word distributions of the two languages and then iteratively refines the alignment.

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion

facebookresearch/fastText EMNLP 2018

Continuous word representations learned separately on distinct languages can be aligned so that their words become comparable in a common space.

Unsupervised Multilingual Word Embeddings

ccsasuke/umwe EMNLP 2018

Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space.

Cross-Lingual Adaptation using Structural Correspondence Learning

pprett/bolt 4 Aug 2010

From these correspondences a cross-lingual representation is created that enables the transfer of classification knowledge from the source to the target language.

Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach

anoopkunchukuttan/geomm TACL 2019

Our approach decouples learning the transformation from the source language to the target language into (a) learning rotations for language-specific embeddings to align them to a common space, and (b) learning a similarity metric in the common space to model similarities between the embeddings.

Robust Cross-lingual Embeddings from Parallel Sentences

epfml/Bi-Sent2Vec 28 Dec 2019

Recent advances in cross-lingual word embeddings have primarily relied on mapping-based methods, which project pretrained word embeddings from different languages into a shared space through a linear transformation.

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

BatsResearch/LexC-Gen 21 Feb 2024

We show that conditioning on bilingual lexicons is the key component of LexC-Gen. LexC-Gen is also practical -- it only needs a single GPU to generate data at scale.

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity

gg42554/cl-sts 19 Jan 2018

In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages.

Unsupervised Clinical Language Translation

ckbjimmy/p2c 4 Feb 2019

As patients' access to their doctors' clinical notes becomes common, translating professional, clinical jargon to layperson-understandable language is essential to improve patient-clinician communication.