2 code implementations • LREC 2022 • Rémi Calizzano, Malte Ostendorff, Qian Ruan, Georg Rehm
Almost all summarisation methods and datasets focus on a single language and short summaries.
1 code implementation • LREC 2022 • Zdenka Uresova, Karolina Zaczynska, Peter Bourgonje, Eva Fučíková, Georg Rehm, Jan Hajic
We also show the next steps to adapt the annotation process, data structures and formats and tools necessary to make the addition of a new language in the future more smooth and efficient, and possibly to allow for various teams to work on SynSemClass extensions to many languages concurrently.
1 code implementation • LREC 2022 • Niklas Dehio, Malte Ostendorff, Georg Rehm
We investigate an automated approach to extract legal claims from news articles and to match the claims with their corresponding applicable laws.
no code implementations • EAMT 2022 • Itziar Aldabe, Jane Dunne, Aritz Farwell, Owen Gallagher, Federico Gaspari, Maria Giagkou, Jan Hajic, Jens Peter Kückens, Teresa Lynn, Georg Rehm, German Rigau, Katrin Marheinecke, Stelios Piperidis, Natalia Resende, Tea Vojtěchová, Andy Way
This paper provides an overview of the ongoing European Language Equality(ELE) project, an 18-month action funded by the European Commission which involves 52 partners.
1 code implementation • ACL (WOAH) 2021 • Dmitrii Aksenov, Peter Bourgonje, Karolina Zaczynska, Malte Ostendorff, Julian Moreno-Schneider, Georg Rehm
We present a data set consisting of German news articles labeled for political bias on a five-point scale in a semi-supervised way.
1 code implementation • GermEval 2021 • Remi Calizzano, Malte Ostendorff, Georg Rehm
Finally, the combination of the two techniques allows us to obtain an F1 score of 0. 6899 with XLM- RoBERTa and 0. 6859 with MT5.
no code implementations • TDLE (LREC) 2022 • Annika Grützner-Zahn, Georg Rehm
To assess the current state of play with regard to Europe’s languages, we developed, in the project European Language Equality, a metric for digital language equality that consists of two parts, technological and contextual (i. e., non-technological) factors.
no code implementations • TDLE (LREC) 2022 • Federico Gaspari, Owen Gallagher, Georg Rehm, Maria Giagkou, Stelios Piperidis, Jane Dunne, Andy Way
The paper situates this ongoing work with a strong European focus in the broader context of related efforts, and explains how the DLE Metric can help track the progress towards DLE for all languages of Europe, focusing in particular on the role played by the TFs.
no code implementations • LREC 2022 • Michael Raring, Malte Ostendorff, Georg Rehm
Essential is the automated processing of text segments extracted from different content resources by identifying the relevance of a text segment to a topic and its semantic relation to other text segments.
no code implementations • LREC 2022 • Melina Plakidis, Georg Rehm
We present a dataset consisting of German offensive and non-offensive tweets, annotated for speech acts.
no code implementations • ISA (LREC) 2022 • Julian Moreno-Schneider, Rémi Calizzano, Florian Kintzel, Georg Rehm, Dimitris Galanis, Ian Roberts
Interoperability is a necessity for the resolution of complex tasks that require the interconnection of several NLP services.
1 code implementation • 17 Apr 2024 • Orhun Caglidil, Malte Ostendorff, Georg Rehm
However, prior research has primarily focused on the English language, especially in the context of gender bias.
no code implementations • 12 Apr 2024 • Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer
We design a specific application of the ORKG-Dataset semantic model based on 40 diverse research datasets on scientific information extraction.
no code implementations • 23 Jan 2023 • Malte Ostendorff, Georg Rehm
To address this problem, we introduce a cross-lingual and progressive transfer learning approach, called CLP-Transfer, that transfers models from a source language, for which pretrained models are publicly available, like English, to a new target language.
1 code implementation • 29 Apr 2022 • Konstantin Schulz, Jens Rauenbusch, Jan Fillies, Lisa Rutenburg, Dimitrios Karvelas, Georg Rehm
The increasingly rapid spread of information about COVID-19 on the web calls for automatic measures of quality assurance.
1 code implementation • 28 Mar 2022 • Malte Ostendorff, Till Blume, Terry Ruas, Bela Gipp, Georg Rehm
We compare and analyze three generic document embeddings, six specialized document embeddings and a pairwise classification baseline in the context of research paper recommendations.
no code implementations • Findings (ACL) 2022 • Qian Ruan, Malte Ostendorff, Georg Rehm
Using various experimental settings on three datasets (i. e., CNN/DailyMail, PubMed and arXiv), our HiStruct+ model outperforms a strong baseline collectively, which differs from our model only in that the hierarchical structure information is not injected.
Ranked #12 on Text Summarization on Pubmed
1 code implementation • 14 Feb 2022 • Malte Ostendorff, Nils Rethmeier, Isabelle Augenstein, Bela Gipp, Georg Rehm
Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics.
Ranked #1 on Document Classification on SciDocs (MeSH)
1 code implementation • 28 Apr 2021 • Malte Ostendorff, Elliott Ash, Terry Ruas, Bela Gipp, Julian Moreno-Schneider, Georg Rehm
Simultaneously, legal recommender systems are typically evaluated in small-scale user study without any public available benchmark datasets.
no code implementations • EACL 2021 • Georg Rehm, Stelios Piperidis, Kalina Bontcheva, Jan Hajic, Victoria Arranz, Andrejs Vasi{\c{l}}jevs, Gerhard Backfried, Jose Manuel Gomez-Perez, Ulrich Germann, R{\'e}mi Calizzano, Nils Feldhus, Stefanie Hegele, Florian Kintzel, Katrin Marheinecke, Julian Moreno-Schneider, Dimitris Galanis, Penny Labropoulou, Miltos Deligiannis, Katerina Gkirtzou, Athanasia Kolovou, Dimitris Gkoumas, Leon Voukoutis, Ian Roberts, Jana Hamrlova, Dusan Varis, Lukas Kacena, Khalid Choukri, Val{\'e}rie Mapelli, Micka{\"e}l Rigault, Julija Melnika, Miro Janosik, Katja Prinz, Andres Garcia-Silva, Cristian Berrio, Ondrej Klejch, Steve Renals
Europe is a multilingual society, in which dozens of languages are spoken.
1 code implementation • COLING 2020 • Malte Ostendorff, Terry Ruas, Till Blume, Bela Gipp, Georg Rehm
Our findings motivate future research of aspect-based document similarity and the development of a recommender system based on the evaluated techniques.
no code implementations • LREC 2020 • Julian Moreno-Schneider, Peter Bourgonje, Florian Kintzel, Georg Rehm
We present a workflow manager for the flexible creation and customisation of NLP processing pipelines.
no code implementations • 25 Apr 2020 • Georg Rehm, Peter Bourgonje, Stefanie Hegele, Florian Kintzel, Julián Moreno Schneider, Malte Ostendorff, Karolina Zaczynska, Armin Berger, Stefan Grill, Sören Räuchle, Jens Rauenbusch, Lisa Rutenburg, André Schmidt, Mikka Wild, Henry Hoffmann, Julian Fink, Sarah Schulz, Jurica Seva, Joachim Quantz, Joachim Böttger, Josefine Matthey, Rolf Fricke, Jan Thomsen, Adrian Paschke, Jamal Al Qundus, Thomas Hoppe, Naouel Karam, Frauke Weichhardt, Christian Fillies, Clemens Neudecker, Mike Gerber, Kai Labusch, Vahid Rezanezhad, Robin Schaefer, David Zellhöfer, Daniel Siewert, Patrick Bunk, Lydia Pintscher, Elena Aleynikova, Franziska Heine
In all domains and sectors, the demand for intelligent systems to support the processing and generation of digital content is rapidly increasing.
no code implementations • 25 Apr 2020 • Georg Rehm, Karolina Zaczynska, Julián Moreno-Schneider, Malte Ostendorff, Peter Bourgonje, Maria Berger, Jens Rauenbusch, André Schmidt, Mikka Wild
Previous work of ours on Semantic Storytelling uses text analytics procedures including Named Entity Recognition and Event Detection.
no code implementations • 21 Apr 2020 • Georg Rehm
Annotations can be annotated themselves using more abstract annotations.
1 code implementation • LREC 2020 • Georg Rehm, Dimitrios Galanis, Penny Labropoulou, Stelios Piperidis, Martin Welß, Ricardo Usbeck, Joachim köhler, Miltos Deligiannis, Katerina Gkirtzou, Johannes Fischer, Christian Chiarcos, Nils Feldhus, Julián Moreno-Schneider, Florian Kintzel, Elena Montiel, Víctor Rodríguez Doncel, John P. McCrae, David Laqua, Irina Patricia Theile, Christian Dittmar, Kalina Bontcheva, Ian Roberts, Andrejs Vasiljevs, Andis Lagzdiņš
With regard to the wider area of AI/LT platform interoperability, we concentrate on two core aspects: (1) cross-platform search and discovery of resources and services; (2) composition of cross-platform service workflows.
no code implementations • 16 Apr 2020 • Julián Moreno-Schneider, Peter Bourgonje, Florian Kintzel, Georg Rehm
We present a workflow manager for the flexible creation and customisation of NLP processing pipelines.
no code implementations • LREC 2020 • Penny Labropoulou, Katerina Gkirtzou, Maria Gavriilidou, Miltos Deligiannis, Dimitrios Galanis, Stelios Piperidis, Georg Rehm, Maria Berger, Valérie Mapelli, Mickaël Rigault, Victoria Arranz, Khalid Choukri, Gerhard Backfried, José Manuel Gómez Pérez, Andres Garcia Silva
In this paper we present ELG-SHARE, a rich metadata schema catering for the description of Language Resources and Technologies (processing and generation services and tools, models, corpora, term lists, etc.
no code implementations • LREC 2020 • Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavriilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Marko Tadić, Dan Tufiş, Tamás Váradi, Kadri Vider, Andy Way, François Yvon
Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality.
no code implementations • LREC 2020 • Georg Rehm, Maria Berger, Ela Elsholz, Stefanie Hegele, Florian Kintzel, Katrin Marheinecke, Stelios Piperidis, Miltos Deligiannis, Dimitris Galanis, Katerina Gkirtzou, Penny Labropoulou, Kalina Bontcheva, David Jones, Ian Roberts, Jan Hajic, Jana Hamrlová, Lukáš Kačena, Khalid Choukri, Victoria Arranz, Andrejs Vasiļjevs, Orians Anvari, Andis Lagzdiņš, Jūlija Meļņika, Gerhard Backfried, Erinç Dikici, Miroslav Janosik, Katja Prinz, Christoph Prinz, Severin Stampler, Dorothea Thomas-Aniola, José Manuel Gómez Pérez, Andres Garcia Silva, Christian Berrío, Ulrich Germann, Steve Renals, Ondrej Klejch
With 24 official EU and many additional languages, multilingualism in Europe and an inclusive Digital Single Market can only be enabled through Language Technologies (LTs).
1 code implementation • LREC 2020 • Dmitrii Aksenov, Julián Moreno-Schneider, Peter Bourgonje, Robert Schwarzenberg, Leonhard Hennig, Georg Rehm
The results of our models are compared to a baseline and the state-of-the-art models on the CNN/Daily Mail dataset.
no code implementations • LREC 2020 • Sarah Schulz, Jurica Ševa, Samuel Rodriguez, Malte Ostendorff, Georg Rehm
We present a new corpus comprising annotations of medical entities in case reports, originating from PubMed Central's open access library.
1 code implementation • LREC 2020 • Elena Leitner, Georg Rehm, Julián Moreno-Schneider
We describe a dataset developed for Named Entity Recognition in German federal court decisions.
no code implementations • LREC 2020 • Julián Moreno-Schneider, Georg Rehm, Elena Montiel-Ponsoda, Víctor Rodriguez-Doncel, Artem Revenko, Sotirios Karampatakis, Maria Khvalchik, Christian Sageder, Jorge Gracia, Filippo Maganza
Legal technology is currently receiving a lot of attention from various angles.
4 code implementations • 22 Mar 2020 • Malte Ostendorff, Terry Ruas, Moritz Schubotz, Georg Rehm, Bela Gipp
In this paper, we model the problem of finding the relationship between two documents as a pairwise document classification task.
1 code implementation • KONVENS / GermEval 2019 2019 • Malte Ostendorff, Peter Bourgonje, Maria Berger, Julian Moreno-Schneider, Georg Rehm, Bela Gipp
In this paper, we focus on the classification of books using short descriptive texts (cover blurbs) and additional metadata.
no code implementations • WS 2019 • Georg Rehm, Juli{\'a}n Moreno-Schneider, Jorge Gracia, Artem Revenko, Victor Mireles, Maria Khvalchik, Ilan Kernerman, Andis Lagzdins, Marcis Pinnis, Artus Vasilevskis, Elena Leitner, Jan Milde, Pia Wei{\ss}enhorn
We present a portfolio of natural legal language processing and document curation services currently under development in a collaborative European project.
no code implementations • WS 2017 • Julian Moreno-Schneider, Ankit Srivastava, Peter Bourgonje, David Wabnitz, Georg Rehm
We present a prototypical content curation dashboard, to be used in the newsroom, and several of its underlying semantic content analysis components (such as named entity recognition, entity linking, summarisation and temporal expression analysis).
no code implementations • WS 2017 • Peter Bourgonje, Julian Moreno Schneider, Georg Rehm
We present a system for the detection of the stance of headlines with regard to their corresponding article bodies.
no code implementations • WS 2017 • Georg Rehm, Julian Moreno Schneider, Peter Bourgonje, Ankit Srivastava, Jan Nehring, Armin Berger, Luca K{\"o}nig, S{\"o}ren R{\"a}uchle, Jens Gerth
We present an approach at identifying a specific class of events, movement action events (MAEs), in a data set that consists of ca.
no code implementations • SEMEVAL 2017 • Ankit Srivastava, Georg Rehm, Julian Moreno Schneider
We describe our submissions for SemEval-2017 Task 8, Determining Rumour Veracity and Support for Rumours.
no code implementations • CONLL 2017 • Daniel Zeman, Martin Popel, Milan Straka, Jan Haji{\v{c}}, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gokirmak, Anna Nedoluzhko, Silvie Cinkov{\'a}, Jan Haji{\v{c}} jr., Jaroslava Hlav{\'a}{\v{c}}ov{\'a}, V{\'a}clava Kettnerov{\'a}, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Jenna Kanerva, Stina Ojala, Anna Missil{\"a}, Christopher D. Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria de Paiva, Kira Droganova, H{\'e}ctor Mart{\'\i}nez Alonso, {\c{C}}a{\u{g}}r{\i} {\c{C}}{\"o}ltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, M, Michael l, Jesse Kirchner, Hector Fern Alcalde, ez, Jana Strnadov{\'a}, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendon{\c{c}}a, L, Tatiana o, Rattima Nitisaroj, Josie Li
The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets.
no code implementations • LREC 2016 • Georg Rehm
The different phases that involve creating and distributing an LR can be conceptualised as a life cycle.
no code implementations • LREC 2016 • Georg Rehm, Jan Haji{\v{c}}, Josef van Genabith, Andrejs Vasiljevs
META-NET is a European network of excellence, founded in 2010, that consists of 60 research centres in 34 European countries.
no code implementations • LREC 2014 • Stelios Piperidis, Harris Papageorgiou, Christian Spurk, Georg Rehm, Khalid Choukri, Olivier Hamon, Nicoletta Calzolari, Riccardo Del Gratta, Bernardo Magnini, Christian Girardi
This paper presents META-SHARE (www. meta-share. eu), an open language resource infrastructure, and its usage since its Europe-wide deployment in early 2013.
no code implementations • LREC 2014 • Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, N{\'u}ria Bel, Audron{\.e} Bielevi{\v{c}}ien{\.e}, Lars Borin, Ant{\'o}nio Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garab{\'\i}k, Marko Grobelnik, Carmen Garc{\'\i}a-Mateo, Josef van Genabith, Jan Haji{\v{c}}, Inma Hern{\'a}ez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lind{\'e}n, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunci{\'o}n Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Stelios Piperidis, Adam Przepi{\'o}rkowski, Eir{\'\i}kur R{\"o}gnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Marko Tadi{\'c}, Paul Thompson, Dan Tufi{\c{s}}, Tam{\'a}s V{\'a}radi, Andrejs Vasi{\c{l}}jevs, Kadri Vider, Jolanta Zabarskaite
This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of funding for LT topics.