Search Results for author: Marco Baroni

Found 89 papers, 29 papers with code

Controlled tasks for model analysis: Retrieving discrete information from sequences

1 code implementation • EMNLP (BlackboxNLP) 2021 • Ionut-Teodor Sorodoc, Gemma Boleda, Marco Baroni

In recent years, the NLP community has shown increasing interest in analysing how deep learning models work.

Paper
Code

MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models

1 code implementation • 23 Feb 2024 • Nathanaël Carraz Rakotonirina, Marco Baroni

Transformer-based language models (LMs) track contextual information through large, hard-coded input windows.

Paper
Code

Unnatural language processing: How do language models handle machine-generated prompts?

no code implementations • 24 Oct 2023 • Corentin Kervadec, Francesca Franzon, Marco Baroni

Language model prompt optimization research has shown that semantically and grammatically well-formed manually crafted prompts are routinely outperformed by automatically generated token sequences with no apparent meaning or syntactic structure, including sequences of vectors from a model's embedding space.

Language Modelling

Paper
Add Code

Bridging Information-Theoretic and Geometric Compression in Language Models

1 code implementation • 20 Oct 2023 • Emily Cheng, Corentin Kervadec, Marco Baroni

For a language model (LM) to faithfully model human language, it must compress vast, potentially infinite information into relatively few dimensions.

Language Modelling

Paper
Code

Cross-Domain Image Captioning with Discriminative Finetuning

1 code implementation • CVPR 2023 • Roberto Dessì, Michele Bevilacqua, Eleonora Gualdoni, Nathanael Carraz Rakotonirina, Francesca Franzon, Marco Baroni

However, when the model is used without further tuning to generate captions for out-of-domain datasets, our discriminatively-finetuned captioner generates descriptions that resemble human references more than those produced by the same captioner without finetuning.

Descriptive Image Captioning

280

Paper
Code

Can discrete information extraction prompts generalize across language models?

1 code implementation • 20 Feb 2023 • Nathanaël Carraz Rakotonirina, Roberto Dessì, Fabio Petroni, Sebastian Riedel, Marco Baroni

We study whether automatically-induced prompts that effectively extract information from a language model can also be used, out-of-the-box, to probe other language models for the same information.

Language Modelling slot-filling +1

Paper
Code

Referential communication in heterogeneous communities of pre-trained visual deep networks

1 code implementation • 4 Feb 2023 • Matéo Mahaut, Francesca Franzon, Roberto Dessì, Marco Baroni

As a first step in this direction, we systematically explore the task of \textit{referential communication} in a community of heterogeneous state-of-the-art pre-trained visual networks, showing that they can develop, in a self-supervised way, a shared protocol to refer to a target object among a set of candidates.

Self-Driving Cars

Paper
Code

Communication breakdown: On the low mutual intelligibility between human and neural captioning

1 code implementation • 20 Oct 2022 • Roberto Dessì, Eleonora Gualdoni, Francesca Franzon, Gemma Boleda, Marco Baroni

We compare the 0-shot performance of a neural caption-based image retriever when given as input either human-produced captions or captions generated by a neural captioner.

Retrieval

Paper
Code

How BPE Affects Memorization in Transformers

no code implementations • 6 Oct 2021 • Eugene Kharitonov, Marco Baroni, Dieuwke Hupkes

In this work, we demonstrate that the size of the subword vocabulary learned by Byte-Pair Encoding (BPE) greatly affects both ability and tendency of standard Transformer models to memorize training data, even when we control for the number of learned parameters.

Memorization

Paper
Add Code

On the proper role of linguistically-oriented deep net analysis in linguistic theorizing

no code implementations • 16 Jun 2021 • Marco Baroni

A lively research field has recently emerged that uses experimental methods to probe the linguistic behavior of modern deep networks.

Paper
Add Code

Interpretable agent communication from scratch (with a generic visual processor emerging on the side)

1 code implementation • NeurIPS 2021 • Roberto Dessì, Eugene Kharitonov, Marco Baroni

As deep networks begin to be deployed as autonomous agents, the issue of how they can communicate with each other becomes important.

Self-Supervised Learning

280

Paper
Code

Mechanisms for Handling Nested Dependencies in Neural-Network Language Models and Humans

no code implementations • 19 Jun 2020 • Yair Lakretz, Dieuwke Hupkes, Alessandra Vergallito, Marco Marelli, Marco Baroni, Stanislas Dehaene

We studied whether a modern artificial neural network trained with "deep learning" methods mimics a central aspect of human sentence processing, namely the storing of grammatical number and gender information in working memory and its use in long-distance agreement (e. g., capturing the correct number agreement between subject and verb when they are separated by other phrases).

Sentence

Paper
Add Code

Emergent Multi-Agent Communication in the Deep Learning Era

no code implementations • 3 Jun 2020 • Angeliki Lazaridou, Marco Baroni

The ability to cooperate through language is a defining feature of humans.

Paper
Add Code

Permutation Equivariant Models for Compositional Generalization in Language

1 code implementation • ICLR 2020 • Jonathan Gordon, David Lopez-Paz, Marco Baroni, Diane Bouchacourt

Humans understand novel sentences by composing meanings and roles of core language components.

Language Modelling

Paper
Code

Syntactic Structure from Deep Learning

no code implementations • 22 Apr 2020 • Tal Linzen, Marco Baroni

Modern deep neural networks achieve impressive performance in engineering applications that require extensive linguistic skills, such as machine translation.

Language Acquisition Machine Translation +1

Paper
Add Code

Compositionality and Generalization in Emergent Languages

1 code implementation • ACL 2020 • Rahma Chaabouni, Eugene Kharitonov, Diane Bouchacourt, Emmanuel Dupoux, Marco Baroni

Third, while compositionality is not necessary for generalization, it provides an advantage in terms of language transmission: The more compositional a language is, the more easily it will be picked up by new learners, even when the latter differ in architecture from the original agents.

Disentanglement

280

Paper
Code

Emergent Language Generalization and Acquisition Speed are not tied to Compositionality

1 code implementation • EMNLP (BlackboxNLP) 2020 • Eugene Kharitonov, Marco Baroni

Studies of discrete languages emerging when neural agents communicate to solve a joint task often look for evidence of compositional structure.

280

Paper
Code

Rat big, cat eaten! Ideas for a useful deep-agent protolanguage

no code implementations • 17 Mar 2020 • Marco Baroni

Deep-agent communities developing their own language-like communication protocol are a hot (or at least warm) topic in AI.

Paper
Add Code

A Benchmark for Systematic Generalization in Grounded Language Understanding

4 code implementations • NeurIPS 2020 • Laura Ruis, Jacob Andreas, Marco Baroni, Diane Bouchacourt, Brenden M. Lake

In this paper, we introduce a new benchmark, gSCAN, for evaluating compositional generalization in situated language understanding.

Systematic Generalization

Paper
Code

Focus on What's Informative and Ignore What's not: Communication Strategies in a Referential Game

no code implementations • 5 Nov 2019 • Roberto Dessì, Diane Bouchacourt, Davide Crepaldi, Marco Baroni

Research in multi-agent cooperation has shown that artificial agents are able to learn to play a simple referential game while developing a shared lexicon.

Paper
Add Code

On the Distribution of Deep Clausal Embeddings: A Large Cross-linguistic Study

no code implementations • ACL 2019 • Damian Blasi, Ryan Cotterell, Lawrence Wolf-Sonkin, Sabine Stoll, Balthasar Bickel, Marco Baroni

Embedding a clause inside another ({``}the girl [who likes cars [that run fast]] has arrived{''}) is a fundamental resource that has been argued to be a key driver of linguistic expressiveness.

Paper
Add Code

EGG: a toolkit for research on Emergence of lanGuage in Games

no code implementations • IJCNLP 2019 • Eugene Kharitonov, Rahma Chaabouni, Diane Bouchacourt, Marco Baroni

There is renewed interest in simulating language emergence among deep neural agents that communicate to jointly solve a task, spurred by the practical aim to develop language-enabled interactive AIs, as well as by theoretical questions about the evolution of human language.

Paper
Add Code

Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text

1 code implementation • TACL 2019 • Michael Hahn, Marco Baroni

Recurrent neural networks (RNNs) have reached striking performance in many natural language processing tasks.

Paper
Code

Entropy Minimization In Emergent Languages

1 code implementation • ICML 2020 • Eugene Kharitonov, Rahma Chaabouni, Diane Bouchacourt, Marco Baroni

There is growing interest in studying the languages that emerge when neural agents are jointly trained to solve tasks requiring communication through a discrete channel.

Representation Learning

280

Paper
Code

Word-order biases in deep-agent emergent communication

1 code implementation • ACL 2019 • Rahma Chaabouni, Eugene Kharitonov, Alessandro Lazaric, Emmanuel Dupoux, Marco Baroni

We train models to communicate about paths in a simple gridworld, using miniature languages that reflect or violate various natural language trends, such as the tendency to avoid redundancy or to minimize long-distance dependencies.

Paper
Code

Anti-efficient encoding in emergent communication

1 code implementation • NeurIPS 2019 • Rahma Chaabouni, Eugene Kharitonov, Emmanuel Dupoux, Marco Baroni

Despite renewed interest in emergent language simulations with neural networks, little is known about the basic properties of the induced code, and how they compare to human language.

280

Paper
Code

Miss Tools and Mr Fruit: Emergent communication in agents learning about object affordances

1 code implementation • ACL 2019 • Diane Bouchacourt, Marco Baroni

Recent research studies communication emergence in communities of deep network agents assigned a joint task, hoping to gain insights on human language evolution.

Paper
Code

CNNs found to jump around more skillfully than RNNs: Compositional generalization in seq2seq convolutional networks

no code implementations • ACL 2019 • Roberto Dessì, Marco Baroni

Lake and Baroni (2018) introduced the SCAN dataset probing the ability of seq2seq models to capture compositional generalizations, such as inferring the meaning of "jump around" 0-shot from the component words.

Paper
Add Code

Linguistic generalization and compositionality in modern artificial neural networks

no code implementations • 30 Mar 2019 • Marco Baroni

In the last decade, deep artificial neural networks have achieved astounding performance in many natural language processing tasks.

Systematic Generalization

Paper
Add Code

The emergence of number and syntax units in LSTM language models

1 code implementation • NAACL 2019 • Yair Lakretz, German Kruszewski, Theo Desbordes, Dieuwke Hupkes, Stanislas Dehaene, Marco Baroni

Importantly, the behaviour of these units is partially controlled by other units independently shown to track syntactic structure.

Language Modelling

Paper
Code

Human few-shot learning of compositional instructions

2 code implementations • 14 Jan 2019 • Brenden M. Lake, Tal Linzen, Marco Baroni

There have been striking recent improvements in machine learning for natural language processing, yet the best algorithms require vast amounts of experience and struggle to generalize new concepts in compositional ways.

Few-Shot Learning

Paper
Code

Jump to better conclusions: SCAN both left and right

1 code implementation • WS 2018 • Jasmijn Bastings, Marco Baroni, Jason Weston, Kyunghyun Cho, Douwe Kiela

Lake and Baroni (2018) recently introduced the SCAN data set, which consists of simple commands paired with action sequences and is intended to test the strong generalization abilities of recurrent sequence-to-sequence models.

Paper
Code

How agents see things: On visual representations in an emergent language game

no code implementations • EMNLP 2018 • Diane Bouchacourt, Marco Baroni

There is growing interest in the language developed by agents interacting in emergent-communication settings.

Paper
Add Code

Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks

no code implementations • WS 2018 • João Loula, Marco Baroni, Brenden M. Lake

Systematic compositionality is the ability to recombine meaningful units with regular and predictable outcomes, and it's seen as key to humans' capacity for generalization in language.

Paper
Add Code

What you can cram into a single \$\&!\#* vector: Probing sentence embeddings for linguistic properties

no code implementations • ACL 2018 • Alexis Conneau, German Kruszewski, Guillaume Lample, Lo{\"\i}c Barrault, Marco Baroni

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

General Classification Machine Translation +3

Paper
Add Code

What you can cram into a single vector: Probing sentence embeddings for linguistic properties

6 code implementations • 3 May 2018 • Alexis Conneau, German Kruszewski, Guillaume Lample, Loïc Barrault, Marco Baroni

Although much effort has recently been devoted to training high-quality sentence embeddings, we still have a poor understanding of what they are capturing.

General Classification Sentence +2

2,279

Paper
Code

Colorless green recurrent networks dream hierarchically

2 code implementations • NAACL 2018 • Kristina Gulordava, Piotr Bojanowski, Edouard Grave, Tal Linzen, Marco Baroni

Recurrent neural networks (RNNs) have achieved impressive results in a variety of linguistic processing tasks, suggesting that they can induce non-trivial properties of language.

Language Modelling

Paper
Code

Memorize or generalize? Searching for a compositional RNN in a haystack

1 code implementation • 18 Feb 2018 • Adam Liška, Germán Kruszewski, Marco Baroni

Neural networks are very powerful learning systems, but they do not readily generalize from one task to the other.

Paper
Code

Still not systematic after all these years: On the compositional skills of sequence-to-sequence recurrent networks

no code implementations • ICLR 2018 • Brenden Lake, Marco Baroni

Humans can understand and produce new utterances effortlessly, thanks to their systematic compositional skills.

Machine Translation Translation +1

Paper
Add Code

Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks

7 code implementations • ICML 2018 • Brenden M. Lake, Marco Baroni

Humans can understand and produce new utterances effortlessly, thanks to their compositional skills.

Machine Translation Translation +1

174

Paper
Code

High-risk learning: acquiring new word vectors from tiny data

1 code implementation • EMNLP 2017 • Aurelie Herbelot, Marco Baroni

Distributional semantics models are known to struggle with small data.

Language Modelling Small Data Image Classification +1

Paper
Code

Causal Discovery Using Proxy Variables

no code implementations • 23 Feb 2017 • Mateo Rojas-Carulla, Marco Baroni, David Lopez-Paz

In this paper, we develop a framework to estimate the cause-effect relation between two static entities $x$ and $y$: for instance, an art masterpiece $x$ and its fraudulent copy $y$.

Causal Discovery Relation

Paper
Add Code

Living a discrete life in a continuous world: Reference with distributed representations

no code implementations • 6 Feb 2017 • Gemma Boleda, Sebastian Padó, Nghia The Pham, Marco Baroni

Reference is a crucial property of language that allows us to connect linguistic expressions to the world.

Paper
Add Code

CommAI: Evaluating the first steps towards a useful general AI

no code implementations • 31 Jan 2017 • Marco Baroni, Armand Joulin, Allan Jabri, Germàn Kruszewski, Angeliki Lazaridou, Klemen Simonic, Tomas Mikolov

With machine learning successfully applied to new daunting problems almost every day, general AI starts looking like an attainable goal.

BIG-bench Machine Learning Continual Learning +2

Paper
Add Code

Living a discrete life in a continuous world: Reference in cross-modal entity tracking

no code implementations • WS 2017 • Gemma Boleda, Sebastian Pad{\'o}, Nghia The Pham, Marco Baroni

Paper
Add Code

Multi-Agent Cooperation and the Emergence of (Natural) Language

1 code implementation • 21 Dec 2016 • Angeliki Lazaridou, Alexander Peysakhovich, Marco Baroni

The sender is told one of them is the target and is allowed to send a message from a fixed, arbitrary vocabulary to the receiver.

Paper
Code

There Is No Logical Negation Here, But There Are Alternatives: Modeling Conversational Negation with Distributional Semantics

no code implementations • CL 2016 • Germ{\'a}n Kruszewski, Denis Paperno, Raffaella Bernardi, Marco Baroni

Negation

Paper
Add Code

The red one!: On learning to refer to things based on discriminative properties

no code implementations • ACL 2016 • Angeliki Lazaridou, Nghia The Pham, Marco Baroni

Paper
Add Code

"Show me the cup": Reference with Continuous Representations

no code implementations • 28 Jun 2016 • Gemma Boleda, Sebastian Padó, Marco Baroni

One of the most basic functions of language is to refer to objects in a shared scene.

Paper
Add Code

The LAMBADA dataset: Word prediction requiring a broad discourse context

2 code implementations • ACL 2016 • Denis Paperno, Germán Kruszewski, Angeliki Lazaridou, Quan Ngoc Pham, Raffaella Bernardi, Sandro Pezzelle, Marco Baroni, Gemma Boleda, Raquel Fernández

We introduce LAMBADA, a dataset to evaluate the capabilities of computational models for text understanding by means of a word prediction task.

LAMBADA Sentence

Paper
Code

Squibs: When the Whole Is Less Than the Sum of Its Parts: How Composition Affects PMI Values in Distributional Semantic Vectors

no code implementations • CL 2016 • Denis Paperno, Marco Baroni

Paper
Add Code

Multimodal Semantic Learning from Child-Directed Input

no code implementations • NAACL 2016 • Angeliki Lazaridou, Grzegorz Chrupa{\l}a, Raquel Fern{\'a}ndez, Marco Baroni

Word Embeddings

Paper
Add Code

Towards Multi-Agent Communication-Based Language Learning

no code implementations • 23 May 2016 • Angeliki Lazaridou, Nghia The Pham, Marco Baroni

We propose an interactive multimodal framework for language learning.

Paper
Add Code

The red one!: On learning to refer to things based on their discriminative properties

no code implementations • 8 Mar 2016 • Angeliki Lazaridou, Nghia The Pham, Marco Baroni

As a first step towards agents learning to communicate about their visual environment, we propose a system that, given visual representations of a referent (cat) and a context (sofa), identifies their discriminative attributes, i. e., properties that distinguish them (has_tail).

Attribute

Paper
Add Code

A Roadmap towards Machine Intelligence

1 code implementation • 25 Nov 2015 • Tomas Mikolov, Armand Joulin, Marco Baroni

The development of intelligent machines is one of the biggest unsolved challenges in computer science.

1,328

Paper
Code

Distributional vectors encode referential attributes

no code implementations • EMNLP 2015 • Abhijeet Gupta, Gemma Boleda, Marco Baroni, Sebastian Pad{\'o}

Semantic Textual Similarity

Paper
Add Code

Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis

no code implementations • WS 2015 • Angeliki Lazaridou, Dat Tien Nguyen, Marco Baroni

Image Generation Topic Models

Paper
Add Code

Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning

no code implementations • IJCNLP 2015 • Angeliki Lazaridou, Georgiana Dinu, Marco Baroni

Semantic Textual Similarity Zero-Shot Learning

Paper
Add Code

Jointly optimizing word representations for lexical and sentential tasks with the C-PHRASE model

no code implementations • IJCNLP 2015 • Nghia The Pham, Germ{\'a}n Kruszewski, Angeliki Lazaridou, Marco Baroni

Paper
Add Code

A Multitask Objective to Inject Lexical Contrast into Distributional Semantics

no code implementations • IJCNLP 2015 • Nghia The Pham, Angeliki Lazaridou, Marco Baroni

Semantic Textual Similarity Word Embeddings

Paper
Add Code

Unveiling the Dreams of Word Embeddings: Towards Language-Driven Image Generation

no code implementations • 10 Jun 2015 • Angeliki Lazaridou, Dat Tien Nguyen, Raffaella Bernardi, Marco Baroni

We introduce language-driven image generation, the task of generating an image visualizing the semantic contents of a word embedding, e. g., given the word embedding of grasshopper, we generate a natural image of a grasshopper.

Image Generation Word Embeddings

Paper
Add Code

Leveraging Preposition Ambiguity to Assess Compositional Distributional Models of Semantics

no code implementations • SEMEVAL 2015 • Samuel Ritter, Cotie Long, Denis Paperno, Marco Baroni, Matthew Botvinick, Adele Goldberg

Language Modelling

Paper
Add Code

So similar and yet incompatible: Toward the automated identification of semantically compatible words

no code implementations • HLT 2015 • Marco Baroni, Germán Kruszewski

Paper
Add Code

Squibs: When the Whole Is Not Greater Than the Combination of Its Parts: A ``Decompositional'' Look at Compositional Distributional Semantics

no code implementations • CL 2015 • Fabio Massimo Zanzotto, Lorenzo Ferrone, Marco Baroni

Semantic Textual Similarity

Paper
Add Code

From Visual Attributes to Adjectives through Decompositional Distributional Semantics

no code implementations • TACL 2015 • Angeliki Lazaridou, Georgiana Dinu, Adam Liska, Marco Baroni

By building on the recent "zero-shot learning" approach, and paying attention to the linguistic nature of attributes as noun modifiers, and specifically adjectives, we show that it is possible to tag images with attribute-denoting adjectives even when no training data containing the relevant annotation are available.

Attribute Object +4

Paper
Add Code

Combining Language and Vision with a Multimodal Skip-gram Model

no code implementations • HLT 2015 • Angeliki Lazaridou, Nghia The Pham, Marco Baroni

We extend the SKIP-GRAM model of Mikolov et al. (2013a) by taking visual information into account.

Retrieval

Paper
Add Code

Deriving Boolean structures from distributional vectors

no code implementations • TACL 2015 • German Kruszewski, Denis Paperno, Marco Baroni

Corpus-based distributional semantic models capture degrees of semantic relatedness among the words of very large vocabularies, but have problems with logical phenomena such as entailment, that are instead elegantly handled by model-theoretic approaches, which, in turn, do not scale up.

Paper
Add Code

Improving zero-shot learning by mitigating the hubness problem

4 code implementations • 20 Dec 2014 • Georgiana Dinu, Angeliki Lazaridou, Marco Baroni

The zero-shot paradigm exploits vector-based word representations extracted from text corpora with unsupervised methods to learn general mapping functions from other feature spaces onto word space, where the words associated to the nearest neighbours of the mapped vectors are used as their linguistic labels.

Image Retrieval Retrieval +1