Search Results for author: Karl Stratos

A conventional approach to entity linking is to first find mentions in a given document and then infer their underlying entities in the knowledge base.

Ranked #4 on Entity Linking on AIDA-CoNLL

Benchmarking Entity Linking +4

Paper
Code

Understanding Hard Negatives in Noise Contrastive Estimation

1 code implementation • NAACL 2021 • Wenzheng Zhang, Karl Stratos

The choice of negative examples is important in noise contrastive estimation.

Entity Linking Retrieval +1

Paper
Code

Fast and Effective Biomedical Entity Linking Using a Dual Encoder

1 code implementation • EACL (Louhi) 2021 • Rajarshi Bhowmik, Karl Stratos, Gerard de Melo

Additionally, we modify our dual encoder model for end-to-end biomedical entity linking that performs both mention span detection and entity disambiguation and out-performs two recently proposed models.

Entity Disambiguation Entity Linking

Paper
Code

Data-to-text Generation by Splicing Together Nearest Neighbors

1 code implementation • EMNLP 2021 • Sam Wiseman, Arturs Backurs, Karl Stratos

We propose to tackle data-to-text generation tasks by directly splicing together retrieved segments of text from "neighbor" source-target pairs.

Conditional Text Generation Data-to-Text Generation

Paper
Code

Corrected CBOW Performs as well as Skip-gram

1 code implementation • EMNLP (insights) 2021 • Ozan İrsoy, Adrian Benton, Karl Stratos

Mikolov et al. (2013a) observed that continuous bag-of-words (CBOW) word embeddings tend to underperform Skip-gram (SG) embeddings, and this finding has been reported in subsequent works.

Word Embeddings

261

Paper
Code

Unsupervised Label Refinement Improves Dataless Text Classification

1 code implementation • Findings (ACL) 2021 • Zewei Chu, Karl Stratos, Kevin Gimpel

This reliance causes dataless classifiers to be highly sensitive to the choice of label descriptions and hinders the broader application of dataless classification in practice.

Ranked #3 on Zero-Shot Text Classification on AG News

Clustering General Classification +2

Paper
Code

Mining Knowledge for Natural Language Inference from Wikipedia Categories

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Mingda Chen, Zewei Chu, Karl Stratos, Kevin Gimpel

Accurate lexical entailment (LE) and natural language inference (NLI) often require large quantities of costly annotations.

Lexical Entailment Natural Language Inference

Paper
Code

NatCat: Weakly Supervised Text Classification with Naturally Annotated Resources

1 code implementation • AKBC 2021 • Zewei Chu, Karl Stratos, Kevin Gimpel

We describe NatCat, a large-scale resource for text classification constructed from three data sources: Wikipedia, Stack Exchange, and Reddit.

General Classification Text Categorization +1

Paper
Code

Discrete Latent Variable Representations for Low-Resource Text Classification

1 code implementation • ACL 2020 • Shuning Jin, Sam Wiseman, Karl Stratos, Karen Livescu

While much work on deep latent variable models of text uses continuous latent variables, discrete latent variables are interesting because they are more interpretable and typically more space efficient.

General Classification Sentence +2

Paper
Code

Learning Discrete Structured Representations by Adversarially Maximizing Mutual Information

1 code implementation • ICML 2020 • Karl Stratos, Sam Wiseman

We propose learning discrete structured representations from unlabeled data by maximizing the mutual information between a structured latent variable and a target variable.

Paper
Code

EntEval: A Holistic Evaluation Benchmark for Entity Representations

2 code implementations • IJCNLP 2019 • Mingda Chen, Zewei Chu, Yang Chen, Karl Stratos, Kevin Gimpel

Rich entity representations are useful for a wide class of problems involving entities.

Entity Disambiguation Entity Typing

Paper
Code

Label-Agnostic Sequence Labeling by Copying Nearest Neighbors

1 code implementation • ACL 2019 • Sam Wiseman, Karl Stratos

Retrieve-and-edit based approaches to structured prediction, where structures associated with retrieved neighbors are edited to form new structures, have recently attracted increased interest.

Structured Prediction

Paper
Code

Formal Limitations on the Measurement of Mutual Information

2 code implementations • ICLR 2019 • David McAllester, Karl Stratos

Measuring mutual information from finite data is difficult.

Paper
Code

Compositional Morpheme Embeddings with Affixes as Functions and Stems as Arguments

no code implementations • WS 2018 • Daniel Edmiston, Karl Stratos

StAffNet, the name of our architecture, shows competitive performance with the state-of-the-art on this task.

Dependency Parsing Word Embeddings

Paper
Add Code

Mutual Information Maximization for Simple and Accurate Part-Of-Speech Induction

1 code implementation • NAACL 2019 • Karl Stratos

We address part-of-speech (POS) induction by maximizing the mutual information between the induced label and its context.

Clustering POS

Paper
Code

OneNet: Joint Domain, Intent, Slot Prediction for Spoken Language Understanding

no code implementations • 16 Jan 2018 • Young-Bum Kim, Sungjin Lee, Karl Stratos

In practice, most spoken language understanding systems process user input in a pipelined manner; first domain is predicted, then intent and semantic slots are inferred according to the semantic frames of the predicted domain.

Spoken Language Understanding

Paper
Add Code

Reconstruction of Word Embeddings from Sub-Word Parameters

no code implementations • WS 2017 • Karl Stratos

Pre-trained word embeddings improve the performance of a neural model at the cost of increasing the model size.

Part-Of-Speech Tagging Word Embeddings +1

Paper
Add Code

A Sub-Character Architecture for Korean Language Processing

1 code implementation • EMNLP 2017 • Karl Stratos

We introduce a novel sub-character architecture that exploits a unique compositional structure of the Korean language.

Dependency Parsing

Paper
Code

Adversarial Adaptation of Synthetic or Stale Data

no code implementations • ACL 2017 • Young-Bum Kim, Karl Stratos, Dongchan Kim

Both cause a distribution mismatch between training and evaluation, leading to a model that overfits the flawed training data and performs poorly on the test data.

Domain Adaptation Spoken Language Understanding

Paper
Add Code

Domain Attention with an Ensemble of Experts

no code implementations • ACL 2017 • Young-Bum Kim, Karl Stratos, Dongchan Kim

When given domain K + 1, our model uses a weighted combination of the K domain experts{'} feedback along with its own opinion to make predictions on the new domain.

Domain Adaptation Spoken Language Understanding

Paper
Add Code

Entity Identification as Multitasking

1 code implementation • WS 2017 • Karl Stratos

Standard approaches in entity identification hard-code boundary detection and type prediction into labels (e. g., John/B-PER Smith/I-PER) and then perform Viterbi.

Boundary Detection Type prediction +1

Paper
Code

Domainless Adaptation by Constrained Decoding on a Schema Lattice

no code implementations • COLING 2016 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

In many applications such as personal digital assistants, there is a constant need for new domains to increase the system{'}s coverage of user queries.

Multi-Label Classification Spoken Language Understanding

Paper
Add Code

Frustratingly Easy Neural Domain Adaptation

no code implementations • COLING 2016 • Young-Bum Kim, Karl Stratos, Ruhi Sarikaya

Popular techniques for domain adaptation such as the feature augmentation method of Daum{\'e} III (2009) have mostly been considered for sparse binary-valued features, but not for dense real-valued features such as those used in neural networks.

Domain Adaptation