no code implementations • 1 Mar 2024 • Margherita Martorana, Tobias Kuhn, Lise Stork, Jacco van Ossenbruggen
This work proposes a novel approach that leverages LLMs for text classification using a controlled topic vocabulary, which has the potential to facilitate automated metadata enrichment, thereby enhancing dataset retrieval and the Findability, Accessibility, Interoperability and Reusability (FAIR) of research data on the Web.
1 code implementation • 3 Mar 2022 • Cristina-Iulia Bucur, Tobias Kuhn, Davide Ceolin, Jacco van Ossenbruggen
With the rapidly increasing amount of scientific literature, it is getting continuously more difficult for researchers in different disciplines to be updated with the recent findings in their field of study. Processing scientific articles in an automated fashion has been proposed as a solution to this problem, but the accuracy of such processing remains very poor for extraction tasks beyond the basic ones. Few approaches have tried to change how we publish scientific results in the first place, by making articles machine-interpretable by expressing them with formal semantics from the start. In the work presented here, we set out to demonstrate that we can formally publish high-level scientific claims in formal logic, and publish the results in a special issue of an existing journal. We use the concept and technology of nanopublications for this endeavor, and represent not just the submissions and final papers in this RDF-based format, but also the whole process in between, including reviews, responses, and decisions. We do this by performing a field study with what we call formalization papers, which contribute a novel formalization of a previously published claim. We received 15 submissions from 18 authors, who then went through the whole publication process leading to the publication of their contributions in the special issue. Our evaluation shows the technical and practical feasibility of our approach. The participating authors mostly showed high levels of interest and confidence, and mostly experienced the process as not very difficult, despite the technical nature of the current user interfaces. We believe that these results indicate that it is possible to publish scientific results from different fields with machine-interpretable semantics from the start, which in turn opens countless possibilities to radically improve in the future the effectiveness and efficiency of the scientific endeavor as a whole.
no code implementations • 27 Sep 2021 • Cristina-Iulia Bucur, Tobias Kuhn, Davide Ceolin, Jacco van Ossenbruggen
Analyzing the main claims from a sample of scientific articles from all disciplines, we find that their semantics are more complex than what a straight-forward application of formalisms like RDF or OWL account for, but we managed to elicit a clear semantic pattern which we call the 'super-pattern'.
1 code implementation • COLING (LAW) 2020 • Timo Lek, Anna de Groot, Tobias Kuhn, Roser Morante
Research in Computational Linguistics is dependent on text corpora for training and testing new tools and methodologies.
2 code implementations • 20 Nov 2019 • Remzi Celebi, Joao Rebelo Moreira, Ahmed A. Hassan, Sandeep Ayyar, Lars Ridder, Tobias Kuhn, Michel Dumontier
Our evaluation shows the high degree to which our FAIRified OpenPREDICT workflow now adheres to the FAIR principles and the practicality and usefulness of being able to answer our new competency questions.
no code implementations • 26 Aug 2019 • Tobias Kuhn, Steven Bourke, Levin Brinkmann, Tobias Buchwald, Conor Digan, Hendrik Hache, Sebastian Jaeger, Patrick Lehmann, Oskar Maier, Stefan Matting, Yura Okulovsky
We do this by surfacing relevant items of clothing during the outfit building process based on what our stylist is doing and what the preferences of our customer are.
no code implementations • 24 Jul 2017 • Tom Jansen, Tobias Kuhn
The number of scientific articles has grown rapidly over the years and there are no signs that this growth will slow down in the near future.
no code implementations • 9 May 2016 • Tobias Kuhn
It is rare that texts or entire books written in a Controlled Natural Language (CNL) become very popular, but exactly this has happened with a book that has been published last year.
no code implementations • 23 Sep 2015 • Kurt Winkler, Tobias Kuhn
Overall, the output from the catalogue system can be considered virtually equivalent to a text written by avalanche forecasters and then manually translated by professional translators.
2 code implementations • 20 Aug 2015 • Tobias Kuhn
The concept of nanopublications was first proposed about six years ago, but it lacked openly available implementations.
Digital Libraries
no code implementations • CL 2014 • Tobias Kuhn
The goal of this article is to provide a common terminology and a common model for CNL, to contribute to the understanding of their general nature, to provide a starting point for researchers interested in the area, and to help developers to make design decisions.
1 code implementation • 11 Nov 2014 • Tobias Kuhn, Christine Chichester, Michael Krauthammer, Michel Dumontier
Making available and archiving scientific results is for the most part still considered the task of classical publishing companies, despite the fact that classical forms of publishing centered around printed narrative articles no longer seem well-suited in the digital age.
Digital Libraries
no code implementations • 23 May 2014 • Kurt Winkler, Tobias Kuhn, Martin Volk
After being operational for two winter seasons, we assess here the quality of the produced texts based on an evaluation where participants rate real danger descriptions from both origins, the catalogue of phrases versus the manually written and translated texts.
1 code implementation • 16 Jan 2014 • Tobias Kuhn, Michel Dumontier
To make digital resources on the web verifiable, immutable, and permanent, we propose a technique to include cryptographic hash values in URIs.
Cryptography and Security Digital Libraries Networking and Internet Architecture H.3.4; H.3.5
no code implementations • 12 Nov 2013 • Tobias Kuhn, Alexandre Bergel
Writing documentation about software internals is rarely considered a rewarding activity.