Word Embeddings

484 papers with code · Methodology

Word embedding is the collective name for a set of language modeling and feature learning techniques in natural language processing (NLP) where words or phrases from the vocabulary are mapped to vectors of real numbers.

( Image credit: Dynamic Word Embedding for Evolving Semantic Discovery )

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Adversarial Training Methods for Semi-Supervised Text Classification

25 May 2016tensorflow/models

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting.

ADVERSARIAL TRAINING SENTIMENT ANALYSIS TEXT CLASSIFICATION WORD EMBEDDINGS

FastText.zip: Compressing text classification models

12 Dec 2016facebookresearch/fastText

We consider the problem of producing compact architectures for text classification, such that the full model fits in a limited amount of memory.

QUANTIZATION TEXT CLASSIFICATION WORD EMBEDDINGS

Enriching Word Vectors with Subword Information

TACL 2017 facebookresearch/fastText

A vector representation is associated to each character $n$-gram; words being represented as the sum of these representations.

WORD EMBEDDINGS

Contextual String Embeddings for Sequence Labeling

COLING 2018 zalandoresearch/flair

Recent advances in language modeling using recurrent neural networks have made it viable to model language as distributions over characters.

CHUNKING LANGUAGE MODELLING NAMED ENTITY RECOGNITION PART-OF-SPEECH TAGGING WORD EMBEDDINGS

Named Entity Recognition with Bidirectional LSTM-CNNs

TACL 2016 zalandoresearch/flair

Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance.

ENTITY LINKING FEATURE ENGINEERING NAMED ENTITY RECOGNITION WORD EMBEDDINGS

StarSpace: Embed All The Things!

12 Sep 2017facebookresearch/ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

TEXT CLASSIFICATION WORD EMBEDDINGS

Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition

27 Sep 2017deepmipt/DeepPavlov

Named Entity Recognition (NER) is one of the most common tasks of the natural language processing.

NAMED ENTITY RECOGNITION WORD EMBEDDINGS

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec

6 May 2016cemoody/lda2vec

Distributed dense word vectors have been shown to be effective at capturing token-level semantic and syntactic regularities in language, while topic models can form interpretable representations over documents.

TOPIC MODELS WORD EMBEDDINGS

Unsupervised Alignment of Embeddings with Wasserstein Procrustes

29 May 2018facebookresearch/MUSE

A library for Multilingual Unsupervised or Supervised word Embeddings

WORD EMBEDDINGS