Search Results for author: Olivier Ferret

Found 62 papers, 9 papers with code

Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at Each Single-Hop?

1 code implementation • COLING 2022 • Jesus Lovon-Melgarejo, Jose G. Moreno, Romaric Besançon, Olivier Ferret, Lynda Tamine

Despite the success of state-of-the-art pre-trained language models (PLMs) on a series of multi-hop reasoning tasks, they still suffer from their limited abilities to transfer learning from simple to complex tasks and vice-versa.

Language Modelling Multiple-choice +3

Paper
Code

CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives

no code implementations • LREC 2022 • Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol

We introduce a definition of similarity that is guided by clinical facts and apply it to the development of a new French corpus of 1, 000 sentence pairs manually annotated according to similarity scores.

Paper
Add Code

Re-train or Train from Scratch? Comparing Pre-training Strategies of BERT in the Medical Domain

no code implementations • LREC 2022 • Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne, Pierre Zweigenbaum

BERT models used in specialized domains all seem to be the result of a simple strategy: initializing with the original BERT and then resuming pre-training on a specialized corpus.

Paper
Add Code

Building Static Embeddings from Contextual Ones: Is It Useful for Building Distributional Thesauri?

no code implementations • LREC 2022 • Olivier Ferret

In this article, we propose a new method for building word or type-level embeddings from contextual models.

Paper
Add Code

Décontextualiser des plongements contextuels pour construire des thésaurus distributionnels (Decontextualizing contextual embeddings for building distributional thesauri )

no code implementations • JEP/TALN/RECITAL 2022 • Olivier Ferret

Même si les modèles de langue contextuels sont aujourd’hui dominants en traitement automatique des langues, les représentations qu’ils construisent ne sont pas toujours adaptées à toutes les utilisations.

Paper
Add Code

Intérêt des modèles de caractères pour la détection d’événements (The interest of character-level models for event detection)

no code implementations • JEP/TALN/RECITAL 2021 • Emanuela Boros, Romaric Besançon, Olivier Ferret, Brigitte Grau

Cet article aborde la tâche de détection d’événements, visant à identifier et catégoriser les mentions d’événements dans les textes.

Event Detection

Paper
Add Code

Exploration des relations sémantiques sous-jacentes aux plongements contextuels de mots (Exploring semantic relations underlying contextual word embeddings)

no code implementations • JEP/TALN/RECITAL 2021 • Olivier Ferret

De nombreuses études ont récemment été réalisées pour étudier les propriétés des modèles de langue contextuels mais, de manière surprenante, seules quelques-unes d’entre elles se concentrent sur les propriétés de ces modèles en termes de similarité sémantique.

Word Embeddings

Paper
Add Code

CLISTER : Un corpus pour la similarité sémantique textuelle dans des cas cliniques en français (CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives)

no code implementations • JEP/TALN/RECITAL 2022 • Nicolas Hiebel, Karën Fort, Aurélie Névéol, Olivier Ferret

Le TAL repose sur la disponibilité de corpus annotés pour l’entraînement et l’évaluation de modèles.

Semantic Textual Similarity STS

Paper
Add Code

Un jeu de données pour répondre à des questions visuelles à propos d’entités nommées en utilisant des bases de connaissances (ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities)

no code implementations • JEP/TALN/RECITAL 2022 • Paul Lerner, Olivier Ferret, Camille Guinaudeau, Hervé Le Borgne, Romaric Besançon, Jose Moreno, Jesús Lovón-Melgarejo

Dans le contexte général des traitements multimodaux, nous nous intéressons à la tâche de réponse à des questions visuelles à propos d’entités nommées en utilisant des bases de connaissances (KVQAE).

Question Answering Visual Question Answering

Paper
Add Code

Stratégies d’adaptation pour la reconnaissance d’entités médicales en français (Adaptation strategies for biomedical named entity recognition in French)

no code implementations • JEP/TALN/RECITAL 2022 • Tiphaine Le Clercq de Lannoy, Romaric Besançon, Olivier Ferret, Julien Tourille, Frédérique Brin-Henry, Bianca Vieru

Dans un contexte où peu de corpus annotés pour l’extraction d’entités médicales sont disponibles, nous étudions dans cet article une approche hybride combinant utilisation de connaissances spécialisées et adaptation de modèles de langues en mettant l’accent sur l’effet du pré-entraînement d’un modèle de langue généraliste (CamemBERT) sur différents corpus.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Mieux utiliser BERT pour la détection d’évènements à partir de peu d’exemples (Better exploitation of BERT for few-shot event detection)

no code implementations • JEP/TALN/RECITAL 2022 • Aboubacar Tuo, Romaric Besançon, Olivier Ferret, Julien Tourille

Les méthodes actuelles pour la détection d’évènements, qui s’appuient essentiellement sur l’apprentissage supervisé profond, s’avèrent très coûteuses en données annotées.

Event Detection

Paper
Add Code

Cross-modal Retrieval for Knowledge-based Visual Question Answering

1 code implementation • 11 Jan 2024 • Paul Lerner, Olivier Ferret, Camille Guinaudeau

Knowledge-based Visual Question Answering about Named Entities is a challenging task that requires retrieving information from a multimodal Knowledge Base.

Cross-Modal Retrieval Question Answering +2

Paper
Code

Probing Pretrained Language Models with Hierarchy Properties

no code implementations • 15 Dec 2023 • Jesús Lovón-Melgarejo, Jose G. Moreno, Romaric Besançon, Olivier Ferret, Lynda Tamine

In this work, we propose a task-agnostic evaluation method able to evaluate to what extent PLMs can capture complex taxonomy relations, such as ancestors and siblings.

Hypernym Discovery Information Retrieval +2

Paper
Add Code

TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation

1 code implementation • 11 Jul 2023 • Paul Grimal, Hervé Le Borgne, Olivier Ferret, Julien Tourille

While several metrics have been proposed to assess the rendering of images, it is crucial for Text-to-Image (T2I) models, which generate images based on a prompt, to consider additional aspects such as to which extent the generated image matches the important content of the prompt.

Text-to-Image Generation

Paper
Code

Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering

1 code implementation • 11 Jan 2023 • Paul Lerner, Olivier Ferret, Camille Guinaudeau

We present a new pre-training method, Multimodal Inverse Cloze Task, for Knowledge-based Visual Question Answering about named Entities (KVQAE).

Question Answering Reading Comprehension +3

Paper
Code

ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities

1 code implementation • SIGIR 2022 • Paul Lerner, Olivier Ferret, Camille Guinaudeau, Hervé Le Borgne, Romaric Besançon, Jose G Moreno, Jesús Lovón Melgarejo

To benchmark this task, called KVQAE (Knowledge-based Visual Question Answering about named Entities), we provide ViQuAE, a dataset of 3. 7K questions paired with images.

Few-Shot Learning Information Retrieval +4

Paper
Code

Using Distributional Principles for the Semantic Study of Contextual Language Models

no code implementations • 23 Nov 2021 • Olivier Ferret

Many studies were recently done for investigating the properties of contextual language models but surprisingly, only a few of them consider the properties of these models in terms of semantic similarity.

Paper
Add Code

Multimodal Entity Linking for Tweets

2 code implementations • 7 Apr 2021 • Omar Adjali, Romaric Besançon, Olivier Ferret, Herve Le Borgne, Brigitte Grau

In many information extraction applications, entity linking (EL) has emerged as a crucial task that allows leveraging information about named entities from a knowledge base.

Entity Linking

Paper
Code

CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters

2 code implementations • COLING 2020 • Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne, Hiroshi Noji, Pierre Zweigenbaum, Junichi Tsujii

Due to the compelling improvements brought by BERT, many recent representation models adopted the Transformer architecture as their main building block, consequently inheriting the wordpiece tokenization system despite it not being intrinsically linked to the notion of Transformers.

Ranked #1 on Semantic Similarity on ClinicalSTS

Clinical Concept Extraction Drug–drug Interaction Extraction +3

193

Paper
Code

Mod\`ele neuronal pour la r\'esolution de la cor\'ef\'erence dans les dossiers m\'edicaux \'electroniques (Neural approach for coreference resolution in electronic health records )

no code implementations • JEPTALNRECITAL 2020 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

La r{\'e}solution de la cor{\'e}f{\'e}rence est un {\'e}l{\'e}ment essentiel pour la constitution automatique de chronologies m{\'e}dicales {\`a} partir des dossiers m{\'e}dicaux {\'e}lectroniques.

coreference-resolution

Paper
Add Code

Repr\'esentation dynamique et sp\'ecifique du contexte textuel pour l'extraction d'\'ev\'enements (Dynamic and specific textual context representation for event extraction)

no code implementations • JEPTALNRECITAL 2020 • Dorian Kodelja, Romaric Besan{\c{c}}on, Olivier Ferret

Dans cet article, focalis{\'e} sur l{'}extraction supervis{\'e}e de mentions d{'}{\'e}v{\'e}nements dans les textes, nous proposons d{'}{\'e}tendre un mod{\`e}le op{\'e}rant au niveau phrastique et reposant sur une architecture neuronale de convolution de graphe exploitant les d{\'e}pendances syntaxiques.

Event Extraction

Paper
Add Code

Building a Multimodal Entity Linking Dataset From Tweets

no code implementations • LREC 2020 • Omar Adjali, Romaric Besan{\c{c}}on, Olivier Ferret, Herv{\'e} Le Borgne, Brigitte Grau

The method collects text and images to jointly build a corpus of tweets with ambiguous mentions along with a Twitter KB defining the entities.

Entity Linking Event Detection +1

Paper
Add Code

Extrinsic Evaluation of French Dependency Parsers on a Specialized Corpus: Comparison of Distributional Thesauri

no code implementations • LREC 2020 • Ludovic Tanguy, Pauline Brunet, Olivier Ferret

We present a study in which we compare 11 different French dependency parsers on a specialized corpus (consisting of research articles on NLP from the proceedings of the TALN conference).

Paper
Add Code

Which Dependency Parser to Use for Distributional Semantics in a Specialized Domain?

no code implementations • LREC 2020 • Pauline Brunet, Olivier Ferret, Ludovic Tanguy

We present a study whose objective is to compare several dependency parsers for English applied to a specialized corpus for building distributional count-based models from syntactic dependencies.

Paper
Add Code

Embedding Strategies for Specialized Domains: Application to Clinical Entity Recognition

1 code implementation • ACL 2019 • Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne, Pierre Zweigenbaum

Using pre-trained word embeddings in conjunction with Deep Learning models has become the {``}de facto{''} approach in Natural Language Processing (NLP).

Ranked #4 on Clinical Concept Extraction on 2010 i2b2/VA

Clinical Concept Extraction Word Embeddings

Paper
Code

Comparaison qualitative et extrins\`eque d'analyseurs syntaxiques du fran\ccais : confrontation de mod\`eles distributionnels sur un corpus sp\'ecialis\'e (Extrinsic evaluation of French dependency parsers on a specialised corpus : comparison of distributional thesauri )

no code implementations • JEPTALNRECITAL 2019 • Ludovic Tanguy, Pauline Brunet, Olivier Ferret

Nous pr{\'e}sentons une {\'e}tude visant {\`a} comparer 11 diff{\'e}rents analyseurs en d{\'e}pendances du fran{\c{c}}ais sur un corpus sp{\'e}cialis{\'e} (constitu{\'e} des archives des articles de la conf{\'e}rence TALN).

Paper
Add Code

Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

no code implementations • 11 Apr 2019 • Charlotte Rudnik, Thibault Ehrhart, Olivier Ferret, Denis Teyssou, Raphaël Troncy, Xavier Tannier

News agencies produce thousands of multimedia stories describing events happening in the world that are either scheduled such as sports competitions, political summits and elections, or breaking events such as military conflicts, terrorist attacks, natural disasters, etc.

Paper
Add Code

Evaluation of a Sequence Tagging Tool for Biomedical Texts

1 code implementation • WS 2018 • Julien Tourille, Matthieu Doutreligne, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Nicolas Paris, Xavier Tannier

Many applications in biomedical natural language processing rely on sequence tagging as an initial step to perform more complex analysis.

named-entity-recognition Named Entity Recognition +4

Paper
Code

Using pseudo-senses for improving the extraction of synonyms from word embeddings

no code implementations • ACL 2018 • Olivier Ferret

The methods proposed recently for specializing word embeddings according to a particular perspective generally rely on external knowledge.

Dimensionality Reduction Semantic Similarity +3

Paper
Add Code

Des pseudo-sens pour am\'eliorer l'extraction de synonymes \`a partir de plongements lexicaux (Pseudo-senses for improving the extraction of synonyms from word embeddings)

no code implementations • JEPTALNRECITAL 2018 • Olivier Ferret

Au-del{\`a} des mod{\`e}les destin{\'e}s {\`a} construire des plongements lexicaux {\`a} partir de corpus, des m{\'e}thodes de sp{\'e}cialisation de ces repr{\'e}sentations selon diff{\'e}rentes orientations ont {\'e}t{\'e} propos{\'e}es.

Word Embeddings

Paper
Add Code

Int\'egration de contexte global par amor\ccage pour la d\'etection d'\'ev\'enements (Integrating global context via bootstrapping for event detection)

no code implementations • JEPTALNRECITAL 2018 • Dorian Kodelja, Romaric Besan{\c{c}}on, Olivier Ferret

Les approches neuronales obtiennent depuis plusieurs ann{\'e}es des r{\'e}sultats int{\'e}ressants en extraction d{'}{\'e}v{\'e}nements.

Event Detection SENTS

Paper
Add Code

Utilisation de Repr\'esentations Distribu\'ees de Relations pour la D\'esambigu\"\isation d'Entit\'es Nomm\'ees (Exploiting Relation Embeddings to Improve Entity Linking )

no code implementations • JEPTALNRECITAL 2018 • Nicolas Wagner, Romaric Besan{\c{c}}on, Olivier Ferret

L{'}identification des entit{\'e}s nomm{\'e}es dans un texte est une {\'e}tape fondamentale pour de nombreuses t{\^a}ches d{'}extraction d{'}information.

Entity Linking

Paper
Add Code

Taking into account Inter-sentence Similarity for Update Summarization

no code implementations • IJCNLP 2017 • Ma{\^a}li Mnasri, Ga{\"e}l de Chalendar, Olivier Ferret

Following Gillick and Favre (2009), a lot of work about extractive summarization has modeled this task by associating two contrary constraints: one aims at maximizing the coverage of the summary with respect to its information content while the other represents its size limit.

Document Summarization Extractive Summarization +5

Paper
Add Code

Turning Distributional Thesauri into Word Vectors for Synonym Extraction and Expansion

no code implementations • IJCNLP 2017 • Olivier Ferret

In this article, we propose to investigate a new problem consisting in turning a distributional thesaurus into dense word vectors.

Graph Embedding Semantic Textual Similarity +1

Paper
Add Code

Unsupervised Event Clustering and Aggregation from Newswire and Web Articles

no code implementations • WS 2017 • Swen Ribeiro, Olivier Ferret, Xavier Tannier

In this paper, we present an unsupervised pipeline approach for clustering news articles based on identified event instances in their content.

Clustering Document Summarization +1

Paper
Add Code

LIMSI-COT at SemEval-2017 Task 12: Neural Architecture for Temporal Information Extraction from Clinical Narratives

no code implementations • SEMEVAL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

In this paper we present our participation to SemEval 2017 Task 12.

Domain Adaptation Entity Extraction using GAN +3

Paper
Add Code

Neural Architecture for Temporal Relation Extraction: A Bi-LSTM Approach for Detecting Narrative Containers

no code implementations • ACL 2017 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

We present a neural architecture for containment relation identification between medical events and/or temporal expressions.

Relation Temporal Information Extraction +1

Paper
Add Code

Construire des repr\'esentations denses \`a partir de th\'esaurus distributionnels (Distributional Thesaurus Embedding and its Applications)

no code implementations • JEPTALNRECITAL 2017 • Olivier Ferret

Dans cet article, nous nous int{\'e}ressons {\`a} un nouveau probl{\`e}me, appel{\'e} plongement de th{\'e}saurus, consistant {\`a} transformer un th{\'e}saurus distributionnel en une repr{\'e}sentation dense de mots.

Paper
Add Code

Temporal information extraction from clinical text

no code implementations • EACL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

In this paper, we present a method for temporal relation extraction from clinical narratives in French and in English.

Relation Temporal Information Extraction +1

Paper
Add Code

Int\'egration de la similarit\'e entre phrases comme crit\`ere pour le r\'esum\'e multi-document (Integrating sentence similarity as a constraint for multi-document summarization)

no code implementations • JEPTALNRECITAL 2016 • Ma{\^a}li Mnasri, Ga{\"e}l de Chalendar, Olivier Ferret

Dans cet article, nous reprenons le cadre d{\'e}fini par Gillick {\&} Favre (2009) mais nous examinons comment et dans quelle mesure la prise en compte explicite de la similarit{\'e} s{\'e}mantique des phrases peut am{\'e}liorer les performances d{'}un syst{\`e}me de r{\'e}sum{\'e} multi-document.

Document Summarization Multi-Document Summarization +2

Paper
Add Code

Extraction de relations temporelles dans des dossiers \'electroniques patient (Extracting Temporal Relations from Electronic Health Records)

no code implementations • JEPTALNRECITAL 2016 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

Cette analyse repose sur l{'}extraction d{'}{\'e}v{\'e}nements, d{'}expressions temporelles et des relations entre eux.

Paper
Add Code

Utilisation des relations d'une base de connaissances pour la d\'esambigu\"\isation d'entit\'es nomm\'ees (Using the Relations of a Knowledge Base to Improve Entity Linking )

no code implementations • JEPTALNRECITAL 2016 • Romaric Besan{\c{c}}on, Hani Daher, Olivier Ferret, Herv{\'e} Le Borgne

L{'}identification des entit{\'e}s nomm{\'e}es dans un texte est une t{\^a}che essentielle des outils d{'}extraction d{'}information dans de nombreuses applications.

Entity Linking

Paper
Add Code

LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of classifiers

no code implementations • SEMEVAL 2016 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

Entity Extraction using GAN Relation +2

Paper
Add Code

A Dataset for Open Event Extraction in English

no code implementations • LREC 2016 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

We detail the methodology used for building the corpus and evaluate some existing systems on this new data.

Event Extraction

Paper
Add Code

Generative Event Schema Induction with Entity Disambiguation

no code implementations • IJCNLP 2015 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

Entity Disambiguation

Paper
Add Code

Early and Late Combinations of Criteria for Reranking Distributional Thesauri

no code implementations • IJCNLP 2015 • Olivier Ferret

Dimensionality Reduction

Paper
Add Code

D\'eclasser les voisins non s\'emantiques pour am\'eliorer les th\'esaurus distributionnels

no code implementations • JEPTALNRECITAL 2015 • Olivier Ferret

La plupart des m{\'e}thodes d{'}am{\'e}lioration des th{\'e}saurus distributionnels se focalisent sur les moyens {--} repr{\'e}sentations ou mesures de similarit{\'e} {--} de mieux d{\'e}tecter la similarit{\'e} s{\'e}mantique entre les mots.

Paper
Add Code

D\'esambigu\"\isation d'entit\'es pour l'induction non supervis\'ee de sch\'emas \'ev\'enementiels

no code implementations • JEPTALNRECITAL 2015 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

Les pr{\'e}c{\'e}dentes m{\'e}thodes de la litt{\'e}rature utilisent uniquement les t{\^e}tes des syntagmes pour repr{\'e}senter les entit{\'e}s. Pourtant, le groupe complet (par exemple, {''}un homme arm{\'e}{''}) apporte une information plus discriminante (que {''}homme{''}).

SENTER

Paper
Add Code

Event Role Extraction using Domain-Relevant Word Representations

no code implementations • EMNLP 2014 • Emanuela Boro{\c{s}}, Romaric Besan{\c{c}}on, Olivier Ferret, Brigitte Grau

Machine Translation Named Entity Recognition (NER) +6

Paper
Add Code

Improving distributional thesauri by exploring the graph of neighbors

no code implementations • COLING 2014 • Vincent Claveau, Ewa Kijak, Olivier Ferret

Information Retrieval

Paper
Add Code

Exploring the neighbor graph to improve distributional thesauri (Explorer le graphe de voisinage pour am\'eliorer les th\'esaurus distributionnels) [in French]

no code implementations • JEPTALNRECITAL 2014 • Vincent Claveau, Ewa Kijak, Olivier Ferret

Information Retrieval

Paper
Add Code

Using a generic neural model for lexical substitution (Utiliser un mod\`ele neuronal g\'en\'erique pour la substitution lexicale) [in French]

no code implementations • JEPTALNRECITAL 2014 • Olivier Ferret

Semantic Textual Similarity

Paper
Add Code

Event Role Labelling using a Neural Network Model (\'Etiquetage en r\^oles \'ev\'enementiels fond\'e sur l'utilisation d'un mod\`ele neuronal) [in French]

no code implementations • JEPTALNRECITAL 2014 • Emanuela Boro{\c{s}}, Romaric Besan{\c{c}}on, Olivier Ferret, Brigitte Grau

Word Embeddings

Paper
Add Code

Compounds and distributional thesauri

no code implementations • LREC 2014 • Olivier Ferret

Lemmatization Machine Translation +2

Paper
Add Code

Evaluation of different strategies for domain adaptation in opinion mining

no code implementations • LREC 2014 • Garcia-Fern, Anne ez, Olivier Ferret, Marco Dinarelli

The work presented in this article takes place in the field of opinion mining and aims more particularly at finding the polarity of a text by relying on machine learning methods.

Domain Adaptation Opinion Mining +2