Search Results for author: Hinrich Schütze

Found 203 papers, 95 papers with code

Wine is not v i n. On the Compatibility of Tokenizations across Languages

no code implementations • Findings (EMNLP) 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.

Paper
Add Code

Few-Shot Text Generation with Natural Language Instructions

no code implementations • EMNLP 2021 • Timo Schick, Hinrich Schütze

Providing pretrained language models with simple task descriptions in natural language enables them to solve some tasks in a fully unsupervised fashion.

Headline Generation text-classification +1

Paper
Add Code

Multidomain Pretrained Language Models for Green NLP

1 code implementation • EACL (AdaptNLP) 2021 • Antonis Maronikolakis, Hinrich Schütze

Thus, instead of training multiple models, we can train a single multidomain model saving on computational resources and training time.

Domain Adaptation

Paper
Code

Don’t Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings

no code implementations • LREC (BUCC) 2022 • Silvia Severini, Viktor Hangya, Masoud Jalili Sabet, Alexander Fraser, Hinrich Schütze

The two approaches we find most effective are: 1) using identical words as seed lexicons (which unsupervised approaches incorrectly assume are not available for orthographically distinct language pairs) and 2) combining such lexicons with pairs extracted by matching romanized versions of words with an edit distance threshold.

Cross-Lingual Transfer Word Embeddings

Paper
Add Code

Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing

1 code implementation • NAACL (WOAH) 2022 • Shuzhou Yuan, Antonis Maronikolakis, Hinrich Schütze

Research to tackle hate speech plaguing online media has made strides in providing solutions, analyzing bias and curating data.

Paper
Code

MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

no code implementations • 17 Apr 2024 • Ali Modarressi, Abdullatif Köksal, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze

While current large language models (LLMs) demonstrate some capabilities in knowledge-intensive tasks, they are limited by relying on their parameters as an implicit storage mechanism.

Hallucination Language Modelling +2

Paper
Add Code

Labeled Morphological Segmentation with Semi-Markov Models

no code implementations • CONLL 2015 • Ryan Cotterell, Thomas Müller, Alexander Fraser, Hinrich Schütze

We present labeled morphological segmentation, an alternative view of morphological processing that unifies several tasks.

Segmentation TAG

Paper
Add Code

Rehearsal-Free Modular and Compositional Continual Learning for Language Models

no code implementations • 31 Mar 2024 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

Continual learning aims at incrementally acquiring new knowledge while not forgetting existing knowledge.

Continual Learning Transfer Learning

Paper
Add Code

UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies

1 code implementation • 26 Mar 2024 • Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori Levin, Amir Zeldes, Joakim Nivre, William Croft, Nathan Schneider

The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages.

Paper
Code

Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

1 code implementation • 26 Mar 2024 • Shijia Zhou, Leonie Weissweiler, Taiqi He, Hinrich Schütze, David R. Mortensen, Lori Levin

In this paper, we make a contribution that can be understood from two perspectives: from an NLP perspective, we introduce a small challenge dataset for NLI with large lexical overlap, which minimises the possibility of models discerning entailment solely based on token distinctions, and show that GPT-4 and Llama 2 fail it with strong bias.

Paper
Code

Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs

no code implementations • 26 Mar 2024 • David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, Hinrich Schütze, Leonie Weissweiler

We find that GPT-4 performs best on the task, followed by GPT-3. 5, but that the open source language models are also able to perform it and that the 7B parameter Mistral displays as little difference between its baseline performance on the natural language inference task and the non-prototypical syntactic category task, as the massive GPT-4.

Natural Language Inference

Paper
Add Code

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank

no code implementations • 15 Mar 2024 • Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze, Barbara Plank

Despite the success of the Universal Dependencies (UD) project exemplified by its impressive language breadth, there is still a lack in `within-language breadth': most treebanks focus on standard languages.

POS POS Tagging

Paper
Add Code

Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena

no code implementations • 11 Mar 2024 • Leonie Weissweiler, Abdullatif Köksal, Hinrich Schütze

Argument Structure Constructions (ASCs) are one of the most well-studied construction groups, providing a unique opportunity to demonstrate the usefulness of Construction Grammar (CxG).

Dependency Parsing Sentence

Paper
Add Code

Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models

no code implementations • 28 Feb 2024 • Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze

Despite the predominance of English in their training data, English-centric Large Language Models (LLMs) like GPT-3 and LLaMA display a remarkable ability to perform multilingual tasks, raising questions about the depth and nature of their cross-lingual capabilities.

Part-Of-Speech Tagging Sentence

Paper
Add Code

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

1 code implementation • 26 Feb 2024 • Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schütze, Dirk Hovy

Motivated by this discrepancy, we challenge the prevailing constrained evaluation paradigm for values and opinions in LLMs and explore more realistic unconstrained evaluations.

Multiple-choice

Paper
Code

What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects

no code implementations • 19 Feb 2024 • Verena Blaschke, Christoph Purschke, Hinrich Schütze, Barbara Plank

Natural language processing (NLP) has largely focused on modelling standardized languages.

Machine Translation

Paper
Add Code

GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network

no code implementations • 18 Feb 2024 • Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schütze

Large Language Models (LLMs) exhibit strong In-Context Learning (ICL) capabilities when prompts with demonstrations are applied to them.

In-Context Learning text-classification +1

Paper
Add Code

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

1 code implementation • 29 Jan 2024 • Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze

However, most previous studies primarily focused on sentence-level classification tasks, and only a few considered token-level labeling tasks such as Named Entity Recognition (NER) and Part-of-Speech (POS) tagging.

Benchmarking In-Context Learning +8

Paper
Code

HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy

no code implementations • 26 Jan 2024 • Yongkang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze

As LMs grow in size, fine-tuning the full parameters of LMs requires a prohibitively large amount of GPU memory.

Paper
Add Code

MaLA-500: Massive Language Adaptation of Large Language Models

no code implementations • 24 Jan 2024 • Peiqin Lin, Shaoxiong Ji, Jörg Tiedemann, André F. T. Martins, Hinrich Schütze

Large language models (LLMs) have advanced the state of the art in natural language processing.

In-Context Learning Language Modelling +1

Paper
Add Code

TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models

1 code implementation • 12 Jan 2024 • Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schütze

As a result, mPLMs present a script barrier: representations from different scripts are located in different subspaces, which is a strong indicator of why crosslingual transfer involving languages of different scripts shows sub-optimal performance.

Contrastive Learning Transliteration

Paper
Code

MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer

no code implementations • 9 Jan 2024 • Haotian Ye, Yihong Liu, Chunlan Ma, Hinrich Schütze

In this paper, we introduce MoSECroT Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer), a novel and challenging task that is especially relevant to low-resource languages for which static word embeddings are available.

Word Embeddings

Paper
Add Code

Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages

no code implementations • 21 Nov 2023 • Viktor Hangya, Silvia Severini, Radoslav Ralev, Alexander Fraser, Hinrich Schütze

In this paper, we propose to build multilingual word embeddings (MWEs) via a novel language chain-based approach, that incorporates intermediate related languages to bridge the gap between the distant source and target.

Bilingual Lexicon Induction Multilingual NLP +1

Paper
Add Code

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

1 code implementation • 15 Nov 2023 • Yihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schütze

Instead of pretraining multilingual language models from scratch, a more efficient method is to adapt existing pretrained language models (PLMs) to new languages via vocabulary extension and continued pretraining.

Language Modelling Multilingual Word Embeddings

Paper
Code

GlotLID: Language Identification for Low-Resource Languages

3 code implementations • 24 Oct 2023 • Amir Hossein Kargaran, Ayyoob Imani, François Yvon, Hinrich Schütze

Several recent papers have published good solutions for language identification (LID) for about 300 high-resource and medium-resource languages.

Ranked #1 on Language Identification on GlotLID-C

Dialect Identification

Paper
Code

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

no code implementations • 23 Oct 2023 • Leonie Weissweiler, Valentin Hofmann, Anjali Kantharuban, Anna Cai, Ritam Dutt, Amey Hengle, Anubha Kabra, Atharva Kulkarni, Abhishek Vijayakumar, Haofei Yu, Hinrich Schütze, Kemal Oflazer, David R. Mortensen

Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills.

Language Modelling Large Language Model

Paper
Add Code

GradSim: Gradient-Based Language Grouping for Effective Multilingual Training

no code implementations • 23 Oct 2023 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

However, not all languages positively influence each other and it is an open research question how to select the most suitable set of languages for multilingual training and avoid negative interference among languages whose characteristics or data distributions are not compatible.

Sentiment Analysis

Paper
Add Code

LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation

no code implementations • 18 Oct 2023 • Shengqiang Zhang, Philipp Wicke, Lütfi Kerem Şenel, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Barbara Plank, Hinrich Schütze

The convergence of embodied agents and large language models (LLMs) has brought significant advancements to embodied instruction following.

Caption Generation Instruction Following

Paper
Add Code

Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration

1 code implementation • 8 Oct 2023 • Ercong Nie, Helmut Schmid, Hinrich Schütze

Pretrained multilingual encoder models can directly perform zero-shot multilingual tasks or linguistic probing by reformulating the input examples into cloze-style prompts.

Position

Paper
Code

GlotScript: A Resource and Tool for Low Resource Writing System Identification

1 code implementation • 23 Sep 2023 • Amir Hossein Kargaran, François Yvon, Hinrich Schütze

We present GlotScript, an open resource and tool for low resource writing system identification.

Language Modelling

Paper
Code

Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach

no code implementations • 9 Aug 2023 • Ercong Nie, Helmut Schmid, Hinrich Schütze

However, training an automatic syntactic analysis system for ancient languages solely relying on annotated parse data is a formidable task due to the inherent challenges in building treebanks for such languages.

Constituency Parsing Cross-Lingual Transfer

Paper
Add Code

Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding

1 code implementation • 15 Jul 2023 • Bolei Ma, Ercong Nie, Helmut Schmid, Hinrich Schütze

We conduct comprehensive experiments on diverse cross-lingual language understanding tasks (sentiment classification, paraphrase identification, and natural language inference) and empirically analyze the variation trends of prompt-based finetuning performance in cross-lingual transfer across different few-shot and full-data settings.

Natural Language Inference Natural Language Understanding +4

Paper
Code

Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models

1 code implementation • 16 Jun 2023 • Victor Steinborn, Antonis Maronikolakis, Hinrich Schütze

Non-English bias research, however, is still in its infancy with most work focusing on English.

Paper
Code

On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss

1 code implementation • 26 May 2023 • Yihong Liu, Alexandra Chronopoulou, Hinrich Schütze, Alexander Fraser

By conducting extensive experiments on different language pairs, including similar and distant, high and low-resource languages, we find that our method alleviates the copying problem, thus improving the translation performance on low-resource languages.

Machine Translation NMT +2

Paper
Code

Evaluate What You Can't Evaluate: Unassessable Quality for Generated Response

no code implementations • 24 May 2023 • Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze

There are risks in using eference-free evaluators based on LLMs to evaluate the quality of dialogue responses.

Dialogue Generation

Paper
Add Code

How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

1 code implementation • 24 May 2023 • Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze, Barbara Plank

To the best of our knowledge, this is the first work comprehensively evaluating distillation objectives in both settings.

Knowledge Distillation QNLI

Paper
Code

mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models

1 code implementation • 23 May 2023 • Peiqin Lin, Chengzhi Hu, Zheyu Zhang, André F. T. Martins, Hinrich Schütze

Recent multilingual pretrained language models (mPLMs) have been shown to encode strong language-specific signals, which are not explicitly provided during pretraining.

Open-Ended Question Answering Zero-Shot Cross-Lingual Transfer

Paper
Code

RET-LLM: Towards a General Read-Write Memory for Large Language Models

1 code implementation • 23 May 2023 • Ali Modarressi, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP) through their extensive parameters and comprehensive data utilization.

Question Answering

18,174

Paper
Code

Language-Agnostic Bias Detection in Language Models with Bias Probing

no code implementations • 22 May 2023 • Abdullatif Köksal, Omer Faruk Yalcin, Ahmet Akbiyik, M. Tahir Kilavuz, Anna Korhonen, Hinrich Schütze

For nationality as a case study, we show that LABDet `surfaces' nationality bias by training a classifier on top of a frozen PLM on non-nationality sentiment detection.

Bias Detection

Paper
Add Code

A study of conceptual language similarity: comparison and evaluation

no code implementations • 22 May 2023 • Haotian Ye, Yihong Liu, Hinrich Schütze

An interesting line of research in natural language processing (NLP) aims to incorporate linguistic typology to bridge linguistic diversity and assist the research of low-resource languages.

Binary Classification

Paper
Add Code

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

2 code implementations • 22 May 2023 • Yihong Liu, Haotian Ye, Leonie Weissweiler, Renhao Pei, Hinrich Schütze

ColexNet's nodes are concepts and its edges are colexifications.

Multilingual NLP Retrieval +3

Paper
Code

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages

1 code implementation • 20 May 2023 • Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André F. T. Martins, François Yvon, Hinrich Schütze

The NLP community has mainly focused on scaling Large Language Models (LLMs) vertically, i. e., making them better for about 100 languages.

XLM-R

Paper
Code

Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages

no code implementations • 15 May 2023 • Chunlan Ma, Ayyoob ImaniGooghari, Haotian Ye, Ehsaneddin Asgari, Hinrich Schütze

While natural language processing tools have been developed extensively for some of the world's languages, a significant portion of the world's over 7000 languages are still neglected.

text-classification Text Classification

Paper
Add Code

A Crosslingual Investigation of Conceptualization in 1335 Languages

3 code implementations • 15 May 2023 • Yihong Liu, Haotian Ye, Leonie Weissweiler, Philipp Wicke, Renhao Pei, Robert Zangenfeind, Hinrich Schütze

The resulting measure for the conceptual similarity of two languages is complementary to standard genealogical, typological, and surface similarity measures.

Paper
Code

NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis

no code implementations • 28 Apr 2023 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

In this work, we propose to leverage language-adaptive and task-adaptive pretraining on African texts and study transfer learning with source language selection on top of an African language-centric pretrained language model.

Language Modelling Sentiment Analysis +1

Paper
Add Code

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

5 code implementations • 20 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank

This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.

Cross-Lingual Transfer Part-Of-Speech Tagging +2

Paper
Code

A Survey of Corpora for Germanic Low-Resource Languages and Dialects

2 code implementations • 19 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank

In this work, we instead focus on low-resource languages and in particular non-standardized low-resource languages.

Paper
Code

LongForm: Effective Instruction Tuning with Reverse Instructions

1 code implementation • 17 Apr 2023 • Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze

We generate instructions via LLMs for human-written corpus examples using reverse instructions.

Long Form Question Answering News Generation +1

197

Paper
Code

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks

no code implementations • 4 Apr 2023 • Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze

We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models.

Few-Shot Learning Hate Speech Detection

Paper
Add Code

MenuCraft: Interactive Menu System Design with Large Language Models

1 code implementation • 8 Mar 2023 • Amir Hossein Kargaran, Nafiseh Nikeghbal, Abbas Heydarnoori, Hinrich Schütze

Menu system design is a challenging task involving many design options and various human factors.

Few-Shot Learning

Paper
Code

Construction Grammar Provides Unique Insight into Neural Language Models

no code implementations • 4 Feb 2023 • Leonie Weissweiler, Taiqi He, Naoki Otani, David R. Mortensen, Lori Levin, Hinrich Schütze

Construction Grammar (CxG) has recently been used as the basis for probing studies that have investigated the performance of large pretrained language models (PLMs) with respect to the structure and meaning of constructions.

Position

Paper
Add Code

Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages

1 code implementation • 19 Dec 2022 • Ercong Nie, Sheng Liang, Helmut Schmid, Hinrich Schütze

Multilingual Pretrained Language Models (MPLMs) have shown their strong multilinguality in recent empirical cross-lingual transfer studies.

Cross-Lingual Transfer Natural Language Inference +3

Paper
Code

PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism

no code implementations • 18 Dec 2022 • Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze

We investigate response generation for multi-turn dialogue in generative-based chatbots.

Response Generation

Paper
Add Code

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology

1 code implementation • 14 Dec 2022 • Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

We propose a fully unsupervised method to detect bias in contextualized embeddings.

Paper
Code

Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging

2 code implementations • Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing 2022 • Sajad Mirzababaei, Amir Hossein Kargaran, Hinrich Schütze, Ehsaneddin Asgari

We create Hengam in the following concrete steps: (1) we develop HengamTagger, an extensible rule-based tool that can extract temporal expressions from a set of diverse language-specific patterns for any language of interest.

Ranked #1 on Temporal Tagging on HengamCorpus

Information Retrieval Named Entity Recognition (NER) +5

Paper
Code

MEAL: Stable and Active Learning for Few-Shot Prompting

1 code implementation • 15 Nov 2022 • Abdullatif Köksal, Timo Schick, Hinrich Schütze

Few-shot classification has made great strides due to foundation models that, through priming and prompting, are highly effective few-shot learners.

Active Learning Few-Shot Learning +1

Paper
Code

This joke is [MASK]: Recognizing Humor and Offense with Prompting

no code implementations • 25 Oct 2022 • Junze Li, Mengjie Zhao, Yubo Xie, Antonis Maronikolakis, Pearl Pu, Hinrich Schütze

Humor is a magnetic component in everyday human interactions and communications.

Transfer Learning

Paper
Add Code

The Better Your Syntax, the Better Your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

no code implementations • 24 Oct 2022 • Leonie Weissweiler, Valentin Hofmann, Abdullatif Köksal, Hinrich Schütze

Construction Grammar (CxG) is a paradigm from cognitive linguistics emphasising the connection between syntax and semantics.

Paper
Add Code

Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging

1 code implementation • 18 Oct 2022 • Ayyoob Imani, Silvia Severini, Masoud Jalili Sabet, François Yvon, Hinrich Schütze

An established method for training a POS tagger in such a scenario is to create a labeled training set by transferring from high-resource languages.

Part-Of-Speech Tagging POS +3

Paper
Code

SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment

1 code implementation • 12 Oct 2022 • Abdullatif Köksal, Silvia Severini, Hinrich Schütze

Word alignments are essential for a variety of NLP tasks.

Machine Translation Translation +2

Paper
Code

Federated Continual Learning for Text Classification via Selective Inter-client Transfer

1 code implementation • 12 Oct 2022 • Yatin Chaudhary, Pranav Rai, Matthias Schubert, Hinrich Schütze, Pankaj Gupta

The objective of Federated Continual Learning (FCL) is to improve deep learning models over life time at each client by (relevant and efficient) knowledge transfer without sharing data.

Continual Learning Federated Learning +3

Paper
Code

Modeling Content-Emotion Duality via Disentanglement for Empathetic Conversation

1 code implementation • 26 Sep 2022 • Peiqin Lin, Jiashuo Wang, Hinrich Schütze, Wenjie Li

To solve the task, it is essential to model the content-emotion duality of a dialogue, which is composed of the content view (i. e., what personal experiences are described) and the emotion view (i. e., the feelings of the speaker on these experiences).

Disentanglement Empathetic Response Generation +1

Paper
Code

Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

no code implementations • 28 Jul 2022 • Yanai Elazar, Nora Kassner, Shauli Ravfogel, Amir Feder, Abhilasha Ravichander, Marius Mosbach, Yonatan Belinkov, Hinrich Schütze, Yoav Goldberg

Our causal framework and our results demonstrate the importance of studying datasets and the benefits of causality for understanding NLP models.

Paper
Add Code

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

3 code implementations • 9 Jun 2022 • Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, César Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodola, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocoń, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Şenel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramírez Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima, Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, ZiRui Wang, Ziyi Wu

BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models.

Common Sense Reasoning Math +1

2,650

Paper
Code

Don't Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings

no code implementations • 31 May 2022 • Silvia Severini, Viktor Hangya, Masoud Jalili Sabet, Alexander Fraser, Hinrich Schütze

Cross-Lingual Transfer Word Embeddings

Paper
Add Code

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes

no code implementations • NAACL (GeBNLP) 2022 • Antonis Maronikolakis, Philip Baader, Hinrich Schütze

To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis.

Paper
Add Code

Flow-Adapter Architecture for Unsupervised Machine Translation

no code implementations • ACL 2022 • Yihong Liu, Haris Jabbar, Hinrich Schütze

The primary novelties of our model are: (a) capturing language-specific sentence representations separately for each language using normalizing flows and (b) using a simple transformation of these latent representations for translating from one language to another.

NMT Sentence +2

Paper
Add Code

Domain Adaptation for Sparse-Data Settings: What Do We Gain by Not Using Bert?

no code implementations • 31 Mar 2022 • Marina Sedinkina, Martin Schmitt, Hinrich Schütze

The practical success of much of NLP depends on the availability of training data.

Domain Adaptation Transfer Learning

Paper
Add Code

CaMEL: Case Marker Extraction without Labels

1 code implementation • ACL 2022 • Leonie Weissweiler, Valentin Hofmann, Masoud Jalili Sabet, Hinrich Schütze

We introduce CaMEL (Case Marker Extraction without Labels), a novel and challenging task in computational morphology that is especially relevant for low-resource languages.

Paper
Code

ECOLA: Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations

no code implementations • 17 Mar 2022 • Zhen Han, Ruotong Liao, Jindong Gu, Yao Zhang, Zifeng Ding, Yujia Gu, Heinz Köppl, Hinrich Schütze, Volker Tresp

Since conventional knowledge embedding models cannot take full advantage of the abundant textual information, there have been extensive research efforts in enhancing knowledge embedding using texts.

Knowledge Graph Embedding Link Prediction +1

Paper
Add Code

Geographic Adaptation of Pretrained Language Models

no code implementations • 16 Mar 2022 • Valentin Hofmann, Goran Glavaš, Nikola Ljubešić, Janet B. Pierrehumbert, Hinrich Schütze

While pretrained language models (PLMs) have been shown to possess a plethora of linguistic knowledge, the existing body of research has largely neglected extralinguistic knowledge, which is generally difficult to obtain by pretraining on text alone.

Language Identification Language Modelling +2

Paper
Add Code

Graph Neural Networks for Multiparallel Word Alignment

no code implementations • Findings (ACL) 2022 • Ayyoob Imani, Lütfi Kerem Şenel, Masoud Jalili Sabet, François Yvon, Hinrich Schütze

First, we create a multiparallel word alignment graph, joining all bilingual word alignment pairs in one graph.

Community Detection Machine Translation +2

Paper
Add Code

Modular and Parameter-Efficient Multimodal Fusion with Prompting

no code implementations • Findings (ACL) 2022 • Sheng Liang, Mengjie Zhao, Hinrich Schütze

Recent research has made impressive progress in large-scale multimodal pre-training.

Paper
Add Code

CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment

1 code implementation • ACL 2022 • Lütfi Kerem Senel, Timo Schick, Hinrich Schütze

Pretrained language models (PLMs) have achieved superhuman performance on many benchmarks, creating a need for harder tasks.

Natural Language Understanding World Knowledge

Paper
Code

Semantic-Oriented Unlabeled Priming for Large-Scale Language Models

no code implementations • 12 Feb 2022 • Yanchen Liu, Timo Schick, Hinrich Schütze

Due to the high costs associated with finetuning large language models, various recent works propose to adapt them to specific tasks without any parameter updates through in-context learning.

In-Context Learning

Paper
Add Code

Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages

no code implementations • LREC 2022 • Silvia Severini, Ayyoob Imani, Philipp Dufter, Hinrich Schütze

Prior work on extracting MNE datasets from parallel corpora required resources such as large monolingual corpora or word aligners that are unavailable or perform poorly for underresourced languages.

Bilingual Lexicon Induction Transliteration

Paper
Add Code

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework

no code implementations • Findings (NAACL) 2022 • Mengjie Zhao, Fei Mi, Yasheng Wang, Minglei Li, Xin Jiang, Qun Liu, Hinrich Schütze

We propose LMTurk, a novel approach that treats few-shot learners as crowdsourcing workers.

Active Learning Language Modelling

Paper
Add Code

True Few-Shot Learning with Prompts -- A Real-World Perspective

no code implementations • 26 Nov 2021 • Timo Schick, Hinrich Schütze

Prompt-based approaches are strong at few-shot learning.

Few-Shot Learning

Paper
Add Code

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief

no code implementations • EMNLP 2021 • Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark

We show that, in a controlled experimental setting, these two mechanisms result in more consistent beliefs in the overall system, improving both the accuracy and consistency of its answers over time.

Language Modelling World Knowledge

Paper
Add Code

Active Learning for Argument Mining: A Practical Approach

no code implementations • 28 Sep 2021 • Nikolai Solmsdorf, Dietrich Trautmann, Hinrich Schütze

Despite considerable recent progress, the creation of well-balanced and diverse resources remains a time-consuming and costly challenge in Argument Mining.

Active Learning Argument Mining

Paper
Add Code

Scene Graph Generation for Better Image Captioning?

no code implementations • 23 Sep 2021 • Maximilian Mozes, Martin Schmitt, Vladimir Golkov, Hinrich Schütze, Daniel Cremers

We investigate the incorporation of visual relationships into the task of supervised image caption generation by proposing a model that leverages detected objects and auto-generated visual relationships to describe images in natural language.

Caption Generation Graph Generation +2

Paper
Add Code

BERT Cannot Align Characters

no code implementations • EMNLP (insights) 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

We show that the closer two languages are, the better BERT can align them on the character level.

Paper
Add Code

Locating Language-Specific Information in Contextualized Embeddings

1 code implementation • 16 Sep 2021 • Sheng Liang, Philipp Dufter, Hinrich Schütze

Multilingual pretrained language models (MPLMs) exhibit multilinguality and are well suited for transfer across languages.

Paper
Code

Graph Algorithms for Multiparallel Word Alignment

1 code implementation • EMNLP 2021 • Ayyoob Imani, Masoud Jalili Sabet, Lütfi Kerem Şenel, Philipp Dufter, François Yvon, Hinrich Schütze

With the advent of end-to-end deep learning approaches in machine translation, interest in word alignments initially decreased; however, they have again become a focus of research more recently.

Link Prediction Machine Translation +3

Paper
Code

Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages

no code implementations • 13 Sep 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.

Paper
Add Code

Discrete and Soft Prompting for Multilingual Models

1 code implementation • EMNLP 2021 • Mengjie Zhao, Hinrich Schütze

It has been shown for English that discrete and soft prompting perform strongly in few-shot learning with pretrained language models (PLMs).

Few-Shot Learning Natural Language Inference

Paper
Code

Continuous Entailment Patterns for Lexical Inference in Context

1 code implementation • EMNLP 2021 • Martin Schmitt, Hinrich Schütze

If we allow for tokens outside the PLM's vocabulary, patterns can be adapted more flexibly to a PLM's idiosyncrasies.

Ranked #1 on Few-Shot NLI on SherLIiC

Few-Shot NLI Lexical Entailment +1

Paper
Code

ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus

no code implementations • ACL 2021 • Ayyoob Imani, Masoud Jalili Sabet, Philipp Dufter, Michael Cysouw, Hinrich Schütze

With more than 7000 languages worldwide, multilingual natural language processing (NLP) is essential both from an academic and commercial perspective.

Multilingual NLP Transfer Learning

Paper
Add Code

Data Centric Domain Adaptation for Historical Text with OCR Errors

2 code implementations • 2 Jul 2021 • Luisa März, Stefan Schweter, Nina Poerner, Benjamin Roth, Hinrich Schütze

We propose new methods for in-domain and cross-domain Named Entity Recognition (NER) on historical data for Dutch and French.

Cross-Domain Named Entity Recognition Domain Adaptation +4

Paper
Code

Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity

1 code implementation • Findings (NAACL) 2022 • Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schütze

The increasing polarization of online political discourse calls for computational tools that automatically detect and monitor ideological divides in social media.

Paper
Code

Multi-source Neural Topic Modeling in Multi-view Embedding Spaces

1 code implementation • NAACL 2021 • Pankaj Gupta, Yatin Chaudhary, Hinrich Schütze

Though word embeddings and topics are complementary representations, several past works have only used pretrained word embeddings in (neural) topic modeling to address data sparsity in short-text or small collection of documents.

Information Retrieval Retrieval +1

Paper
Code

Generating Datasets with Pretrained Language Models

2 code implementations • EMNLP 2021 • Timo Schick, Hinrich Schütze

To obtain high-quality sentence embeddings from pretrained language models (PLMs), they must either be augmented with additional pretraining objectives or finetuned on a large set of labeled text pairs.

Ranked #8 on Semantic Textual Similarity on SICK

Semantic Textual Similarity Sentence +1

188

Paper
Code

Static Embeddings as Efficient Knowledge Bases?

1 code implementation • NAACL 2021 • Philipp Dufter, Nora Kassner, Hinrich Schütze

Recent research investigates factual knowledge stored in large pretrained language models (PLMs).

Paper
Code

Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP

3 code implementations • 28 Feb 2021 • Timo Schick, Sahana Udupa, Hinrich Schütze

In this paper, we first demonstrate a surprising finding: pretrained language models recognize, to a considerable degree, their undesirable biases and the toxicity of the content they produce.

Language Modelling

Paper
Code

Position Information in Transformers: An Overview

no code implementations • CL (ACL) 2022 • Philipp Dufter, Martin Schmitt, Hinrich Schütze

Transformers are arguably the main workhorse in recent Natural Language Processing research.

Clustering Position

Paper
Add Code

Language Models for Lexical Inference in Context

1 code implementation • EACL 2021 • Martin Schmitt, Hinrich Schütze

Lexical inference in context (LIiC) is the task of recognizing textual entailment between two very similar sentences, i. e., sentences that only differ in one expression.

Ranked #2 on Few-Shot NLI on SherLIiC

Few-Shot NLI Natural Language Inference

Paper
Code

Improving Scene Graph Classification by Exploiting Knowledge from Texts

no code implementations • 9 Feb 2021 • Sahand Sharifzadeh, Sina Moayed Baharlou, Martin Schmitt, Hinrich Schütze, Volker Tresp

We show that by fine-tuning the classification pipeline with the extracted knowledge from texts, we can achieve ~8x more accurate results in scene graph classification, ~3x in object classification, and ~1. 5x in predicate classification, compared to the supervised baselines with only 1% of the annotated images.

General Classification Graph Classification +7

Paper
Add Code

Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models

no code implementations • 6 Feb 2021 • Lutfi Kerem Senel, Hinrich Schütze

Recent progress in pretraining language models on large corpora has resulted in large performance gains on many NLP tasks.

Language Modelling

Paper
Add Code

Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models

1 code implementation • EACL 2021 • Nora Kassner, Philipp Dufter, Hinrich Schütze

(i) Can mBERT be used as a multilingual knowledge base?

Paper
Code

Measuring and Improving Consistency in Pretrained Language Models

1 code implementation • 1 Feb 2021 • Yanai Elazar, Nora Kassner, Shauli Ravfogel, Abhilasha Ravichander, Eduard Hovy, Hinrich Schütze, Yoav Goldberg

In this paper we study the question: Are Pretrained Language Models (PLMs) consistent with respect to factual knowledge?

Paper
Code

Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words

1 code implementation • ACL 2021 • Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

How does the input segmentation of pretrained language models (PLMs) affect their interpretations of complex words?

Segmentation

Paper
Code

A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters

no code implementations • ACL 2021 • Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen, Hinrich Schütze

Few-shot crosslingual transfer has been shown to outperform its zero-shot counterpart with pretrained encoders like multilingual BERT.

Few-Shot Learning

Paper
Add Code

Few-Shot Text Generation with Pattern-Exploiting Training

2 code implementations • 22 Dec 2020 • Timo Schick, Hinrich Schütze

Providing pretrained language models with simple task descriptions in natural language enables them to solve some tasks in a fully unsupervised fashion.

Headline Generation text-classification +2

1,610

Paper
Code

Subword Sampling for Low Resource Word Alignment

no code implementations • 21 Dec 2020 • Ehsaneddin Asgari, Masoud Jalili Sabet, Philipp Dufter, Christopher Ringlstetter, Hinrich Schütze

This method's hypothesis is that the aggregation of different granularities of text for certain language pairs can help word-level alignment.

Bayesian Optimization Machine Translation +1

Paper
Add Code

Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification

2 code implementations • COLING 2020 • Timo Schick, Helmut Schmid, Hinrich Schütze

A recent approach for few-shot text classification is to convert textual inputs to cloze questions that contain some form of task description, process them with a pretrained language model and map the predicted words to labels.

Few-Shot Text Classification General Classification +2

1,610

Paper
Code

Dynamic Contextualized Word Embeddings

1 code implementation • ACL 2021 • Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

Static word embeddings that represent words by a single vector cannot capture the variability of word meaning in different linguistic and extralinguistic contexts.

Language Modelling Word Embeddings

Paper
Code

TopicBERT for Energy Efficient Document Classification

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas Runkler, Hinrich Schütze

Our work thus focuses on optimizing the computational cost of fine-tuning for document classification.

Classification Document Classification +1

Paper
Code

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

5 code implementations • NAACL 2021 • Timo Schick, Hinrich Schütze

When scaled to hundreds of billions of parameters, pretrained language models such as GPT-3 (Brown et al., 2020) achieve remarkable few-shot performance.

Natural Language Understanding

1,610

Paper
Code

Investigating Pretrained Language Models for Graph-to-Text Generation

3 code implementations • EMNLP (NLP4ConvAI) 2021 • Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

We show that the PLMs BART and T5 achieve new state-of-the-art results and that task-adaptive pretraining strategies improve their performance even further.

Ranked #1 on KG-to-Text Generation on WebNLG (All)

AMR-to-Text Generation KB-to-Language Generation +2

209

Paper
Code

Automatic Domain Adaptation Outperforms Manual Domain Adaptation for Predicting Financial Outcomes

no code implementations • ACL 2019 • Marina Sedinkina, Nikolas Breitkopf, Hinrich Schütze

In our experiments, we demonstrate that the automatically adapted sentiment dictionary outperforms the previous state of the art in predicting the financial outcomes excess return and volatility.

Domain Adaptation

Paper
Add Code

Neural Topic Modeling with Continual Lifelong Learning

1 code implementation • ICML 2020 • Pankaj Gupta, Yatin Chaudhary, Thomas Runkler, Hinrich Schütze

To address the problem, we propose a lifelong learning framework for neural topic modeling that can continuously process streams of document collections, accumulate topics and guide future topic modeling tasks by knowledge transfer from several sources to better deal with the sparse data.

Data Augmentation Information Retrieval +2

Paper
Code

Explainable and Discourse Topic-aware Neural Language Understanding

1 code implementation • ICML 2020 • Yatin Chaudhary, Hinrich Schütze, Pankaj Gupta

Marrying topic models and language models exposes language understanding to a broader source of document-level context beyond sentences via topics.

Document Classification Language Modelling +5

Paper
Code

Are Pretrained Language Models Symbolic Reasoners Over Knowledge?

1 code implementation • CONLL 2020 • Nora Kassner, Benno Krojer, Hinrich Schütze

How can pretrained language models (PLMs) learn factual knowledge from the training set?

Memorization

Paper
Code

Modeling Graph Structure via Relative Position for Text Generation from Knowledge Graphs

no code implementations • NAACL (TextGraphs) 2021 • Martin Schmitt, Leonardo F. R. Ribeiro, Philipp Dufter, Iryna Gurevych, Hinrich Schütze

We present Graformer, a novel Transformer-based encoder-decoder architecture for graph-to-text generation.

Ranked #5 on KG-to-Text Generation on AGENDA

KG-to-Text Generation Knowledge Graphs +1

Paper
Add Code

Unsupervised Embedding-based Detection of Lexical Semantic Changes

no code implementations • 16 May 2020 • Ehsaneddin Asgari, Christoph Ringlstetter, Hinrich Schütze

This paper describes EmbLexChange, a system introduced by the "Life-Language" team for SemEval-2020 Task 1, on unsupervised detection of lexical-semantic changes.

Paper
Add Code

BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Nora Kassner, Hinrich Schütze

Khandelwal et al. (2020) use a k-nearest-neighbor (kNN) component to improve language model performance.

Information Retrieval Language Modelling +2

Paper
Code

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model

1 code implementation • EMNLP 2020 • Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

Can pretrained language models (PLMs) generate derivationally complex words?

General Classification Language Modelling

Paper
Code

Identifying Necessary Elements for BERT's Multilinguality

1 code implementation • 1 May 2020 • Philipp Dufter, Hinrich Schütze

We aim to identify architectural properties of BERT and linguistic properties of languages that are necessary for BERT to become multilingual.

Paper
Code

Masking as an Efficient Alternative to Finetuning for Pretrained Language Models

no code implementations • EMNLP 2020 • Mengjie Zhao, Tao Lin, Fei Mi, Martin Jaggi, Hinrich Schütze

We present an efficient method of utilizing pretrained language models, where we learn selective binary masks for pretrained weights in lieu of modifying them through finetuning.

Paper
Add Code

Quantifying the Contextualization of Word Representations with Semantic Class Probing

no code implementations • Findings of the Association for Computational Linguistics 2020 • Mengjie Zhao, Philipp Dufter, Yadollah Yaghoobzadeh, Hinrich Schütze

Pretrained language models have achieved a new state of the art on many NLP tasks, but there are still many open questions about how and why they work so well.

Paper
Add Code

SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings

3 code implementations • Findings of the Association for Computational Linguistics 2020 • Masoud Jalili Sabet, Philipp Dufter, François Yvon, Hinrich Schütze

We find that alignments created from embeddings are superior for four and comparable for two language pairs compared to those produced by traditional statistical aligners, even with abundant parallel data; e. g., contextualized embeddings achieve a word alignment F1 for English-German that is 5 percentage points higher than eflomal, a high-quality statistical aligner, trained on 100k parallel sentences.

Machine Translation Multilingual Word Embeddings +3

340

Paper
Code

Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QA

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Nina Poerner, Ulli Waltinger, Hinrich Schütze

Domain adaptation of Pretrained Language Models (PTLMs) is typically achieved by unsupervised pretraining on target-domain text.

Domain Adaptation named-entity-recognition +3

Paper
Code

Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference

6 code implementations • 21 Jan 2020 • Timo Schick, Hinrich Schütze

Some NLP tasks can be solved in a fully unsupervised fashion by providing a pretrained language model with "task descriptions" in natural language (e. g., Radford et al., 2019).

Few-Shot Text Classification General Classification +2

11,412

Paper
Code

Multipurpose Intelligent Process Automation via Conversational Assistant

no code implementations • 7 Jan 2020 • Alena Moiseeva, Dietrich Trautmann, Michael Heimann, Hinrich Schütze

Such intelligent agents can assist the user by answering specific questions and executing routine tasks that are ordinarily performed in a natural language (i. e., customer support).

Transfer Learning

Paper
Add Code

Extending Machine Language Models toward Human-Level Language Understanding

no code implementations • 12 Dec 2019 • James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge, Hinrich Schütze

We take language to be a part of a system for understanding and communicating about situations.

Paper
Add Code

Morphological Segmentation Inside-Out

no code implementations • EMNLP 2016 • Ryan Cotterell, Arun Kumar, Hinrich Schütze

Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output.

Morphological Analysis Segmentation

Paper
Add Code

Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity

no code implementations • ACL 2020 • Nina Poerner, Ulli Waltinger, Hinrich Schütze

We address the task of unsupervised Semantic Textual Similarity (STS) by ensembling diverse pre-trained sentence encoders into sentence meta-embeddings.

Dimensionality Reduction Semantic Textual Similarity +2

Paper
Add Code

E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Nina Poerner, Ulli Waltinger, Hinrich Schütze

We present a novel way of injecting factual knowledge about entities into the pretrained BERT model (Devlin et al., 2019): We align Wikipedia2Vec entity vectors (Yamada et al., 2016) with BERT's native wordpiece vector space and use the aligned entity vectors as if they were wordpiece vectors.

Entity Embeddings Entity Linking +3

Paper
Code

Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly

2 code implementations • ACL 2020 • Nora Kassner, Hinrich Schütze

We find that PLMs do not distinguish between negated ("Birds cannot [MASK]") and non-negated ("Birds can [MASK]") cloze questions.

Language Modelling Negation +1

1,302

Paper
Code

BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance

1 code implementation • ACL 2020 • Timo Schick, Hinrich Schütze

In this work, we transfer this idea to pretrained language models: We introduce BERTRAM, a powerful architecture based on BERT that is capable of inferring high-quality embeddings for rare words that are suitable as input representations for deep language models.

Language Modelling Word Embeddings

Paper
Code

Linguistically Informed Relation Extraction and Neural Architectures for Nested Named Entity Recognition in BioNLP-OST 2019

1 code implementation • WS 2019 • Usama Yaseen, Pankaj Gupta, Hinrich Schütze

Our RE system ranked first in the SeeDev-binary Relation Extraction Task with F1-score of 0. 3738.

Binary Relation Extraction named-entity-recognition +4

Paper
Code

BioNLP-OST 2019 RDoC Tasks: Multi-grain Neural Relevance Ranking Using Topics and Attention Based Query-Document-Sentence Interactions

1 code implementation • WS 2019 • Yatin Chaudhary, Pankaj Gupta, Hinrich Schütze

This paper presents our system details and results of participation in the RDoC Tasks of BioNLP-OST 2019.

Re-Ranking Retrieval +2

Paper
Code

Type-aware Convolutional Neural Networks for Slot Filling

no code implementations • 1 Oct 2019 • Heike Adel, Hinrich Schütze

In particular, we explore different ways of integrating the named entity types of the relation arguments into a neural network for relation classification, including a joint training and a structured prediction approach.

coreference-resolution General Classification +6

Paper
Add Code

Generating Multi-Sentence Abstractive Summaries of Interleaved Texts

no code implementations • 25 Sep 2019 • Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schütze

The interleaved posts are encoded hierarchically, i. e., word-to-word (words in a post) followed by post-to-post (posts in a channel).

Disentanglement Sentence

Paper
Add Code

Multi-source Multi-view Transfer Learning in Neural Topic Modeling with Pretrained Topic and Word Embeddings

no code implementations • 25 Sep 2019 • Pankaj Gupta, Yatin Chaudhary, Hinrich Schütze

Information Retrieval Retrieval +2

Paper
Add Code

Multi-view and Multi-source Transfers in Neural Topic Modeling with Pretrained Topic and Word Embeddings

no code implementations • 14 Sep 2019 • Pankaj Gupta, Yatin Chaudhary, Hinrich Schütze

Though word embeddings and topics are complementary representations, several past works have only used pre-trained word embeddings in (neural) topic modeling to address data sparsity problem in short text or small collection of documents.

Information Retrieval Retrieval +2

Paper
Add Code

Neural Architectures for Fine-Grained Propaganda Detection in News

no code implementations • WS 2019 • Pankaj Gupta, Khushbu Saxena, Usama Yaseen, Thomas Runkler, Hinrich Schütze

To address the tasks of sentence (SLC) and fragment level (FLC) propaganda detection, we explore different neural architectures (e. g., CNN, LSTM-CRF and BERT) and extract linguistic (e. g., part-of-speech, named entity, readability, sentiment, emotion, etc.

Propaganda detection Sentence

Paper
Add Code

Morphological Word Embeddings

no code implementations • 4 Jul 2019 • Ryan Cotterell, Hinrich Schütze

Linguistic similarity is multi-faceted.

Word Embeddings

Paper
Add Code

Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings

1 code implementation • ACL 2019 • Yadollah Yaghoobzadeh, Katharina Kann, Timothy J. Hazen, Eneko Agirre, Hinrich Schütze

Word embeddings typically represent different meanings of a word in a single conflated vector.

Word Embeddings

Paper
Code

A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

no code implementations • 5 Jun 2019 • Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schütze

Interleaved texts, where posts belonging to different threads occur in one sequence, are a common occurrence, e. g., online chat conversations.

Paper
Add Code

SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference

1 code implementation • ACL 2019 • Martin Schmitt, Hinrich Schütze

We present SherLIiC, a testbed for lexical inference in context (LIiC), consisting of 3985 manually annotated inference rule candidates (InfCands), accompanied by (i) ~960k unlabeled InfCands, and (ii) ~190k typed textual relations between Freebase entities extracted from the large entity-linked corpus ClueWeb09.

Ranked #1 on Lexical Entailment on SherLIiC

Lexical Entailment Natural Language Inference

Paper
Code

Fine-Grained Argument Unit Recognition and Classification

1 code implementation • 22 Apr 2019 • Dietrich Trautmann, Johannes Daxenberger, Christian Stab, Hinrich Schütze, Iryna Gurevych

In this work, we argue that the task should be performed on a more fine-grained level of sequence labeling.

Argument Mining Argument Retrieval +5

Paper
Code

An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing

1 code implementation • EMNLP 2020 • Martin Schmitt, Sahand Sharifzadeh, Volker Tresp, Hinrich Schütze

To this end, we present the first approach to unsupervised text generation from KGs and show simultaneously how it can be used for unsupervised semantic parsing.

Ranked #1 on Unsupervised KG-to-Text Generation on WebNLG v2.1

Domain Adaptation Unsupervised KG-to-Text Generation +1

Paper
Code

Analytical Methods for Interpretable Ultradense Word Embeddings

1 code implementation • IJCNLP 2019 • Philipp Dufter, Hinrich Schütze

In this work, we investigate three methods for making word spaces interpretable by rotation: Densifier (Rothe et al., 2016), linear SVMs and DensRay, a new method we propose.

Word Embeddings

Paper
Code

Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking

2 code implementations • 14 Apr 2019 • Timo Schick, Hinrich Schütze

Pretraining deep neural network architectures with a language modeling objective has brought large improvements for many natural language processing tasks.

Language Modelling

Paper
Code

Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts

1 code implementation • NAACL 2019 • Timo Schick, Hinrich Schütze

Learning high-quality embeddings for rare words is a hard problem because of sparse context information.

Word Embeddings

Paper
Code

Learning Semantic Representations for Novel Words: Leveraging Both Form and Context

1 code implementation • 9 Nov 2018 • Timo Schick, Hinrich Schütze

The general problem setting is that word embeddings are induced on an unlabeled training corpus and then a model is trained that embeds novel words into this induced embedding space.

Learning Semantic Representations Word Embeddings

Paper
Code

CIS at TAC Cold Start 2015: Neural Networks and Coreference Resolution for Slot Filling

no code implementations • 6 Nov 2018 • Heike Adel, Hinrich Schütze

Especially, it focuses on the coreference and classification component.

coreference-resolution General Classification +2

Paper
Add Code

Multilingual Embeddings Jointly Induced from Contexts and Concepts: Simple, Strong and Scalable

no code implementations • 1 Nov 2018 • Philipp Dufter, Mengjie Zhao, Hinrich Schütze

A simple and effective context-based multilingual embedding learner is Levy et al. (2017)'s S-ID (sentence ID) method.

Multilingual Word Embeddings Sentence

Paper
Add Code

Aligning Very Small Parallel Corpora Using Cross-Lingual Word Embeddings and a Monogamy Objective

no code implementations • 31 Oct 2018 • Nina Poerner, Masoud Jalili Sabet, Benjamin Roth, Hinrich Schütze

Count-based word alignment methods, such as the IBM models or fast-align, struggle on very small parallel corpora.

Cross-Lingual Word Embeddings Word Alignment +1

Paper
Add Code

Multi-Multi-View Learning: Multilingual and Multi-Representation Entity Typing

1 code implementation • EMNLP 2018 • Yadollah Yaghoobzadeh, Hinrich Schütze

For representation, we consider representations based on the context distribution of the entity (i. e., on its embedding), on the entity's name (i. e., on its surface form) and on its description in Wikipedia.

Entity Typing Multiview Learning +1

Paper
Code

Neural Relation Extraction Within and Across Sentence Boundaries

1 code implementation • 11 Oct 2018 • Pankaj Gupta, Subburam Rajaram, Hinrich Schütze, Bernt Andrassy, Thomas Runkler

iDepNN models the shortest and augmented dependency paths via recurrent and recursive neural networks to extract relationships within (intra-) and across (inter-) sentence boundaries.

Ranked #1 on Relation Extraction on MUC6

Relation Relation Extraction +1

Paper
Code

textTOvec: Deep Contextualized Neural Autoregressive Topic Models of Language with Distributed Compositional Prior

1 code implementation • ICLR 2019 • Pankaj Gupta, Yatin Chaudhary, Florian Buettner, Hinrich Schütze

We address two challenges of probabilistic topic modelling in order to better estimate the probability of a word in a given context, i. e., P(word|context): (1) No Language Structure in Context: Probabilistic topic models ignore word order by summarizing a given context as a "bag-of-word" and consequently the semantics of words in the context is lost.

Information Extraction Information Retrieval +4

Paper
Code

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

no code implementations • EMNLP 2018 • Katharina Kann, Hinrich Schütze

Neural state-of-the-art sequence-to-sequence (seq2seq) models often do not perform well for small training sets.

Transductive Learning

Paper
Add Code

Interpretable Textual Neuron Representations for NLP

2 code implementations • WS 2018 • Nina Poerner, Benjamin Roth, Hinrich Schütze

Input optimization methods, such as Google Deep Dream, create interpretable representations of neurons for computer vision DNNs.

Paper
Code

Document Informed Neural Autoregressive Topic Models with Distributional Prior

1 code implementation • 15 Sep 2018 • Pankaj Gupta, Yatin Chaudhary, Florian Buettner, Hinrich Schütze

Here, we extend a neural autoregressive topic model to exploit the full context information around words in a document in a language modeling fashion.

Language Modelling Retrieval +1

Paper
Code

Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging

no code implementations • NAACL 2019 • Apostolos Kemos, Heike Adel, Hinrich Schütze

Character-level models of tokens have been shown to be effective at dealing with within-token noise and out-of-vocabulary words.

Part-Of-Speech Tagging

Paper
Add Code

Document Informed Neural Autoregressive Topic Models

1 code implementation • 11 Aug 2018 • Pankaj Gupta, Florian Buettner, Hinrich Schütze

Context information around words helps in determining their actual meaning, for example "networks" used in contexts of artificial neural networks or biological neuron networks.

Language Modelling Retrieval +2

Paper
Code

LISA: Explaining Recurrent Neural Network Judgments via Layer-wIse Semantic Accumulation and Example to Pattern Transformation

no code implementations • WS 2018 • Pankaj Gupta, Hinrich Schütze

Recurrent neural networks (RNNs) are temporal networks and cumulative in nature that have shown promising results in various natural language processing tasks.

Decision Making Relation Classification +2

Paper
Add Code

News Article Teaser Tweets and How to Generate Them

2 code implementations • NAACL 2019 • Sanjeev Kumar Karn, Mark Buckley, Ulli Waltinger, Hinrich Schütze

In this work, we define the task of teaser generation and provide an evaluation benchmark and baseline systems for the process of generating teasers.

Paper
Code

Evaluating Word Embeddings in Multi-label Classification Using Fine-grained Name Typing

1 code implementation • WS 2018 • Yadollah Yaghoobzadeh, Katharina Kann, Hinrich Schütze

We propose a new evaluation method for word embeddings based on multi-label classification given a word embedding.

General Classification Multi-Label Classification +2

Paper
Code

Replicated Siamese LSTM in Ticketing System for Similarity Learning and Retrieval in Asymmetric Texts

no code implementations • COLING 2018 • Pankaj Gupta, Bernt Andrassy, Hinrich Schütze

The task is challenging due to significant term mismatch in the query and ticket pairs of asymmetric lengths, where subject is a short text but description and solution are multi-sentence texts.

Retrieval Sentence

Paper
Add Code

Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs

no code implementations • COLING 2018 • Wenpeng Yin, Yadollah Yaghoobzadeh, Hinrich Schütze

Large scale knowledge graphs (KGs) such as Freebase are generally incomplete.

Knowledge Base Completion Knowledge Graphs +2

Paper
Add Code

Joint Bootstrapping Machines for High Confidence Relation Extraction

1 code implementation • NAACL 2018 • Pankaj Gupta, Benjamin Roth, Hinrich Schütze

Semi-supervised bootstrapping techniques for relationship extraction from text iteratively expand a set of initial seed instances.

Relation Relationship Extraction (Distant Supervised) +1

Paper
Code

End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions

1 code implementation • ACL 2018 • Wenpeng Yin, Hinrich Schütze, Dan Roth

This work deals with SciTail, a natural entailment challenge derived from a multi-choice question answering problem.

Natural Language Inference Position +3

Paper
Code

Fortification of Neural Morphological Segmentation Models for Polysynthetic Minimal-Resource Languages

no code implementations • NAACL 2018 • Katharina Kann, Manuel Mager, Ivan Meza-Ruiz, Hinrich Schütze

Morphological segmentation for polysynthetic languages is challenging, because a word may consist of many individual morphemes and training data can be extremely scarce.

Cross-Lingual Transfer Data Augmentation +1

Paper
Add Code

Neural Architectures for Open-Type Relation Argument Extraction

no code implementations • 5 Mar 2018 • Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e. g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e. g. X: the title of a book or a work of art) from the corpus.

Question Answering Relation +2

Paper
Add Code

Embedding Learning Through Multilingual Concept Induction

no code implementations • ACL 2018 • Philipp Dufter, Mengjie Zhao, Martin Schmitt, Alexander Fraser, Hinrich Schütze

We present a new method for estimating vector space representations of words: embedding learning by concept induction.

Sentiment Analysis Word Similarity

Paper
Add Code

Evaluating neural network explanation methods using hybrid documents and morphological agreement

1 code implementation • 19 Jan 2018 • Nina Poerner, Benjamin Roth, Hinrich Schütze

The behavior of deep neural networks (DNNs) is hard to understand.

211

Paper
Code

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

no code implementations • NAACL 2018 • Pankaj Gupta, Subburam Rajaram, Hinrich Schütze, Bernt Andrassy

We also introduce a metric (named as SPAN) to quantify the capability of dynamic topic model to capture word evolution in topics over time.

Dynamic Topic Modeling

Paper
Add Code

Impact of Coreference Resolution on Slot Filling

no code implementations • 26 Oct 2017 • Heike Adel, Hinrich Schütze

In this paper, we demonstrate the importance of coreference resolution for natural language processing on the example of the TAC Slot Filling shared task.

coreference-resolution slot-filling +1

Paper
Add Code

Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms

1 code implementation • TACL 2018 • Wenpeng Yin, Hinrich Schütze

We hypothesize that this is because the attention in CNNs has been mainly implemented as attentive pooling (i. e., it is applied to pooling) rather than as attentive convolution (i. e., it is integrated into convolution).

Claim Verification Natural Language Inference +3

Paper
Code

Corpus-level Fine-grained Entity Typing

no code implementations • 7 Aug 2017 • Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

This paper addresses the problem of corpus-level entity typing, i. e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist".

Entity Typing Knowledge Base Completion

Paper
Add Code

Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification

no code implementations • EMNLP 2017 • Heike Adel, Hinrich Schütze

We introduce globally normalized convolutional neural networks for joint entity classification and relation extraction.

General Classification Relation +1

Paper
Add Code

Unlabeled Data for Morphological Generation With Character-Based Sequence-to-Sequence Models

no code implementations • WS 2017 • Katharina Kann, Hinrich Schütze

We present a semi-supervised way of training a character-based encoder-decoder recurrent neural network for morphological reinflection, the task of generating one inflected word form from another.

Paper
Add Code

Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages

no code implementations • EMNLP 2017 • Ehsaneddin Asgari, Hinrich Schütze

We present SuperPivot, an analysis method for low-resource languages that occur in a superparallel corpus, i. e., in a corpus that contains an order of magnitude more languages than parallel corpora currently in use.

Paper
Add Code

One-Shot Neural Cross-Lingual Transfer for Paradigm Completion

no code implementations • ACL 2017 • Katharina Kann, Ryan Cotterell, Hinrich Schütze

We present a novel cross-lingual transfer method for paradigm completion, the task of mapping a lemma to its inflected forms, using a neural encoder-decoder model, the state of the art for the monolingual task.

Cross-Lingual Transfer LEMMA +1

Paper
Add Code

Comparative Study of CNN and RNN for Natural Language Processing

4 code implementations • 7 Feb 2017 • Wenpeng Yin, Katharina Kann, Mo Yu, Hinrich Schütze

Deep neural networks (DNN) have revolutionized the field of natural language processing (NLP).

Position

Paper
Code

Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching

no code implementations • EACL 2017 • Wenpeng Yin, Hinrich Schütze

This work studies comparatively two typical sentence matching tasks: textual entailment (TE) and answer selection (AS), observing that weaker phrase alignments are more critical in TE, while stronger phrase alignments deserve more attention in AS.

Answer Selection Natural Language Inference +2

Paper
Add Code

Multi-level Representations for Fine-Grained Typing of Knowledge Base Entities

no code implementations • EACL 2017 • Yadollah Yaghoobzadeh, Hinrich Schütze

Entities are essential elements of natural language.

Entity Embeddings Entity Typing +1

Paper
Add Code

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

no code implementations • TACL 2018 • Ryan Cotterell, Hinrich Schütze

Since morphology obeys the principle of compositionality, the semantics of the word can be systematically derived from the meaning of its parts.

Additive models Morphological Analysis

Paper
Add Code

Noise Mitigation for Neural Entity Typing and Relation Extraction

no code implementations • EACL 2017 • Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

For the second noise type, we propose ways to improve the integration of noisy entity type predictions into relation extraction.

Entity Typing Multi-Label Learning +3

Paper
Add Code

Exploring Different Dimensions of Attention for Uncertainty Detection

no code implementations • EACL 2017 • Heike Adel, Hinrich Schütze

Neural networks with attention have proven effective for many natural language processing tasks.

Paper
Add Code

Neural Multi-Source Morphological Reinflection

no code implementations • EACL 2017 • Katharina Kann, Ryan Cotterell, Hinrich Schütze

We explore the task of multi-source morphological reinflection, which generalizes the standard, single-source version.

LEMMA TAG

Paper
Add Code

Intrinsic Subspace Evaluation of Word Embedding Representations

no code implementations • ACL 2016 • Yadollah Yaghoobzadeh, Hinrich Schütze

We introduce a new methodology for intrinsic evaluation of word representations.

Paper
Add Code

Corpus-level Fine-grained Entity Typing Using Contextual Information

no code implementations • EMNLP 2015 • Yadollah Yaghoobzadeh, Hinrich Schütze

This paper addresses the problem of corpus-level entity typing, i. e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist".

Entity Typing Knowledge Base Completion +1

Paper
Add Code

Simple Question Answering by Attentive Convolutional Neural Network

no code implementations • COLING 2016 • Wenpeng Yin, Mo Yu, Bing Xiang, Bo-Wen Zhou, Hinrich Schütze

In fact selection, we match the subject entity in a fact candidate with the entity mention in the question by a character-level convolutional neural network (char-CNN), and match the predicate in that fact with the question by a word-level CNN (word-CNN).

Entity Linking Fact Selection +1

Paper
Add Code

Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection

1 code implementation • ACL 2016 • Katharina Kann, Hinrich Schütze

Morphological reinflection is the task of generating a target form given a source form, a source tag and a target tag.

TAG

Paper
Code

Combining Recurrent and Convolutional Neural Networks for Relation Classification

no code implementations • NAACL 2016 • Ngoc Thang Vu, Heike Adel, Pankaj Gupta, Hinrich Schütze

This paper investigates two different neural architectures for the task of relation classification: convolutional neural networks and recurrent neural networks.

Classification General Classification +2

Paper
Add Code

Why and How to Pay Different Attention to Phrase Alignments of Different Intensities

no code implementations • 23 Apr 2016 • Wenpeng Yin, Hinrich Schütze

We address the problems of identifying phrase alignments of flexible granularity and pooling alignments of different intensities for these tasks.

Answer Selection Natural Language Inference +3

Paper
Add Code

Online Updating of Word Representations for Part-of-Speech Tagging

no code implementations • EMNLP 2015 • Wenpeng Yin, Tobias Schnabel, Hinrich Schütze

We propose online unsupervised domain adaptation (DA), which is performed incrementally as data comes in and is applicable when batch DA is not possible.

Online unsupervised domain adaptation Part-Of-Speech Tagging +2

Paper
Add Code

Discriminative Phrase Embedding for Paraphrase Identification

no code implementations • HLT 2015 • Wenpeng Yin, Hinrich Schütze

This work, concerning paraphrase identification task, on one hand contributes to expanding deep learning embeddings to include continuous and discontinuous linguistic phrases.

Paraphrase Identification

Paper
Add Code

Comparing Convolutional Neural Networks to Traditional Models for Slot Filling

no code implementations • NAACL 2016 • Heike Adel, Benjamin Roth, Hinrich Schütze

We address relation classification in the context of slot filling, the task of finding and evaluating fillers like "Steve Jobs" for the slot X in "X founded Apple".

Classification General Classification +5

Paper
Add Code

Multichannel Variable-Size Convolution for Sentence Classification

no code implementations • CONLL 2015 • Wenpeng Yin, Hinrich Schütze

We propose MVCNN, a convolution neural network (CNN) architecture for sentence classification.

Classification General Classification +3

Paper
Add Code

Ultradense Word Embeddings by Orthogonal Transformation

1 code implementation • NAACL 2016 • Sascha Rothe, Sebastian Ebert, Hinrich Schütze

Embeddings are generic representations that are useful for many NLP tasks.

Sentiment Analysis Word Embeddings

Paper
Code

Attention-Based Convolutional Neural Network for Machine Comprehension

no code implementations • WS 2016 • Wenpeng Yin, Sebastian Ebert, Hinrich Schütze

Understanding open-domain text is one of the primary challenges in natural language processing (NLP).

Feature Engineering Natural Language Inference +2

Paper
Add Code

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

8 code implementations • TACL 2016 • Wenpeng Yin, Hinrich Schütze, Bing Xiang, Bo-Wen Zhou

(ii) We propose three attention schemes that integrate mutual influence between sentences into CNN; thus, the representation of each sentence takes into consideration its counterpart.

Answer Selection Natural Language Inference +2