Search Results for author: Ant{\'o}nio Branco

Found 35 papers, 2 papers with code

The BDCam\~oes Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology

no code implementations LREC 2020 Sara Grilo, M{\'a}rcia Bolrinha, Jo{\~a}o Silva, Rui Vaz, Ant{\'o}nio Branco

This paper presents the BDCam{\~o}es Collection of Portuguese Literary Documents, a new corpus of literary texts written in Portuguese that in its inaugural version includes close to 4 million words from over 200 complete documents from 83 authors in 14 genres, covering a time span from the 16th to the 21st century, and adhering to different orthographic conventions.

Genre classification

Reproduction and Revival of the Argument Reasoning Comprehension Task

no code implementations LREC 2020 Jo{\~a}o Ant{\'o}nio Rodrigues, Ruben Branco, Jo{\~a}o Silva, Ant{\'o}nio Branco

Given a recent publication that pointed out spurious statistical cues in the data set used in the shared task, and that produced a revised version of it, we also evaluated the reproduced systems with this new data set.

Infrastructure for the Science and Technology of Language PORTULAN CLARIN

no code implementations LREC 2020 Ant{\'o}nio Branco, Am{\'a}lia Mendes, Paulo Quaresma, Lu{\'\i}s Gomes, Jo{\~a}o Silva, Andrea Teixeira

This paper presents the PORTULAN CLARIN Research Infrastructure for the Science and Technology of Language, which is part of the European research infrastructure CLARIN ERIC as its Portuguese national node, and belongs to the Portuguese National Roadmap of Research Infrastructures of Strategic Relevance.

Computational Complexity of Natural Languages: A Reasoned Overview

no code implementations WS 2018 Ant{\'o}nio Branco

There has been an upsurge of research interest in natural language complexity.

Predicting Brain Activation with WordNet Embeddings

1 code implementation WS 2018 Jo{\~a}o Ant{\'o}nio Rodrigues, Ruben Branco, Jo{\~a}o Silva, Chakaveh Saedi, Ant{\'o}nio Branco

The task of taking a semantic representation of a noun and predicting the brain activity triggered by it in terms of fMRI spatial patterns was pioneered by Mitchell et al. 2008.

Word Embeddings

Ways of Asking and Replying in Duplicate Question Detection

no code implementations SEMEVAL 2017 Jo{\~a}o Ant{\'o}nio Rodrigues, Chakaveh Saedi, Vladislav Maraev, Jo{\~a}o Silva, Ant{\'o}nio Branco

This paper presents the results of systematic experimentation on the impact in duplicate question detection of different types of questions across both a number of established approaches and a novel, superior one used to address this language processing task.

Machine Translation Question Answering +1

Bootstrapping a Hybrid MT System to a New Language Pair

no code implementations LREC 2016 Jo{\~a}o Ant{\'o}nio Rodrigues, Nuno Rendeiro, Andreia Querido, Sanja {\v{S}}tajner, Ant{\'o}nio Branco

The usual concern when opting for a rule-based or a hybrid machine translation (MT) system is how much effort is required to adapt the system to a different language pair or a new domain.

Machine Translation Translation

Word Sense-Aware Machine Translation: Including Senses as Contextual Features for Improved Translation Models

no code implementations LREC 2016 Steven Neale, Lu{\'\i}s Gomes, Eneko Agirre, Oier Lopez de Lacalle, Ant{\'o}nio Branco

Although it is commonly assumed that word sense disambiguation (WSD) should help to improve lexical choice and improve the quality of machine translation systems, how to successfully integrate word senses into such systems remains an unanswered question.

Machine Translation Translation +1

Use of Domain-Specific Language Resources in Machine Translation

no code implementations LREC 2016 Sanja {\v{S}}tajner, Andreia Querido, Nuno Rendeiro, Jo{\~a}o Ant{\'o}nio Rodrigues, Ant{\'o}nio Branco

In this paper, we address the problem of Machine Translation (MT) for a specialised domain in a language pair for which only a very small domain-specific parallel corpus is available.

Machine Translation Translation

Evaluating Machine Translation in a Usage Scenario

no code implementations LREC 2016 Rosa Gaudio, Aljoscha Burchardt, Ant{\'o}nio Branco

In this document we report on a user-scenario-based evaluation aiming at assessing the performance of machine translation (MT) systems in a real context of use.

Machine Translation Translation

A PropBank for Portuguese: the CINTIL-PropBank

no code implementations LREC 2012 Ant{\'o}nio Branco, Catarina Carvalheiro, S{\'\i}lvia Pereira, Sara Silveira, Jo{\~a}o Silva, S{\'e}rgio Castro, Jo{\~a}o Gra{\c{c}}a

With the CINTIL-International Corpus of Portuguese, an ongoing corpus annotated with fully flegded grammatical representation, sentences get not only a high level of lexical, morphological and syntactic annotation but also a semantic analysis that prepares the data to a manual specification step and thus opens the way for a number of tools and resources for which there is a great research focus at the present.

TimeBankPT: A TimeML Annotated Corpus of Portuguese

no code implementations LREC 2012 Francisco Costa, Ant{\'o}nio Branco

This procedure allowed for the detection of some errors in the annotations, that also affect the original English corpus.

Machine Translation Temporal Information Extraction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.