Search Results for author: Tobias Kuhn

Found 15 papers, 6 papers with code

Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment

no code implementations1 Mar 2024 Margherita Martorana, Tobias Kuhn, Lise Stork, Jacco van Ossenbruggen

This work proposes a novel approach that leverages LLMs for text classification using a controlled topic vocabulary, which has the potential to facilitate automated metadata enrichment, thereby enhancing dataset retrieval and the Findability, Accessibility, Interoperability and Reusability (FAIR) of research data on the Web.

Retrieval text-classification +2

Nanopublication-Based Semantic Publishing and Reviewing: A Field Study with Formalization Papers

1 code implementation3 Mar 2022 Cristina-Iulia Bucur, Tobias Kuhn, Davide Ceolin, Jacco van Ossenbruggen

With the rapidly increasing amount of scientific literature, it is getting continuously more difficult for researchers in different disciplines to be updated with the recent findings in their field of study. Processing scientific articles in an automated fashion has been proposed as a solution to this problem, but the accuracy of such processing remains very poor for extraction tasks beyond the basic ones. Few approaches have tried to change how we publish scientific results in the first place, by making articles machine-interpretable by expressing them with formal semantics from the start. In the work presented here, we set out to demonstrate that we can formally publish high-level scientific claims in formal logic, and publish the results in a special issue of an existing journal. We use the concept and technology of nanopublications for this endeavor, and represent not just the submissions and final papers in this RDF-based format, but also the whole process in between, including reviews, responses, and decisions. We do this by performing a field study with what we call formalization papers, which contribute a novel formalization of a previously published claim. We received 15 submissions from 18 authors, who then went through the whole publication process leading to the publication of their contributions in the special issue. Our evaluation shows the technical and practical feasibility of our approach. The participating authors mostly showed high levels of interest and confidence, and mostly experienced the process as not very difficult, despite the technical nature of the current user interfaces. We believe that these results indicate that it is possible to publish scientific results from different fields with machine-interpretable semantics from the start, which in turn opens countless possibilities to radically improve in the future the effectiveness and efficiency of the scientific endeavor as a whole.

Formal Logic

Expressing High-Level Scientific Claims with Formal Semantics

no code implementations27 Sep 2021 Cristina-Iulia Bucur, Tobias Kuhn, Davide Ceolin, Jacco van Ossenbruggen

Analyzing the main claims from a sample of scientific articles from all disciplines, we find that their semantics are more complex than what a straight-forward application of formalisms like RDF or OWL account for, but we managed to elicit a clear semantic pattern which we call the 'super-pattern'.

Formal Logic Vocal Bursts Intensity Prediction

Provenance for Linguistic Corpora Through Nanopublications

1 code implementation COLING (LAW) 2020 Timo Lek, Anna de Groot, Tobias Kuhn, Roser Morante

Research in Computational Linguistics is dependent on text corpora for training and testing new tools and methodologies.

Towards FAIR protocols and workflows: The OpenPREDICT case study

2 code implementations20 Nov 2019 Remzi Celebi, Joao Rebelo Moreira, Ahmed A. Hassan, Sandeep Ayyar, Lars Ridder, Tobias Kuhn, Michel Dumontier

Our evaluation shows the high degree to which our FAIRified OpenPREDICT workflow now adheres to the FAIR principles and the practicality and usefulness of being able to answer our new competency questions.

Supporting stylists by recommending fashion style

no code implementations26 Aug 2019 Tobias Kuhn, Steven Bourke, Levin Brinkmann, Tobias Buchwald, Conor Digan, Hendrik Hache, Sebastian Jaeger, Patrick Lehmann, Oskar Maier, Stefan Matting, Yura Okulovsky

We do this by surfacing relevant items of clothing during the outfit building process based on what our stylist is doing and what the preferences of our customer are.

Extracting Core Claims from Scientific Articles

no code implementations24 Jul 2017 Tom Jansen, Tobias Kuhn

The number of scientific articles has grown rapidly over the years and there are no signs that this growth will slow down in the near future.

Sentence

The Controlled Natural Language of Randall Munroe's Thing Explainer

no code implementations9 May 2016 Tobias Kuhn

It is rare that texts or entire books written in a Controlled Natural Language (CNL) become very popular, but exactly this has happened with a book that has been published last year.

Fully automatic multi-language translation with a catalogue of phrases - successful employment for the Swiss avalanche bulletin

no code implementations23 Sep 2015 Kurt Winkler, Tobias Kuhn

Overall, the output from the catalogue system can be considered virtually equivalent to a text written by avalanche forecasters and then manually translated by professional translators.

Translation

nanopub-java: A Java Library for Nanopublications

2 code implementations20 Aug 2015 Tobias Kuhn

The concept of nanopublications was first proposed about six years ago, but it lacked openly available implementations.

Digital Libraries

A Survey and Classification of Controlled Natural Languages

no code implementations CL 2014 Tobias Kuhn

The goal of this article is to provide a common terminology and a common model for CNL, to contribute to the understanding of their general nature, to provide a starting point for researchers interested in the area, and to help developers to make design decisions.

Classification General Classification +1

Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data

1 code implementation11 Nov 2014 Tobias Kuhn, Christine Chichester, Michael Krauthammer, Michel Dumontier

Making available and archiving scientific results is for the most part still considered the task of classical publishing companies, despite the fact that classical forms of publishing centered around printed narrative articles no longer seem well-suited in the digital age.

Digital Libraries

Evaluating the fully automatic multi-language translation of the Swiss avalanche bulletin

no code implementations23 May 2014 Kurt Winkler, Tobias Kuhn, Martin Volk

After being operational for two winter seasons, we assess here the quality of the produced texts based on an evaluation where participants rate real danger descriptions from both origins, the catalogue of phrases versus the manually written and translated texts.

Translation

Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data

1 code implementation16 Jan 2014 Tobias Kuhn, Michel Dumontier

To make digital resources on the web verifiable, immutable, and permanent, we propose a technique to include cryptographic hash values in URIs.

Cryptography and Security Digital Libraries Networking and Internet Architecture H.3.4; H.3.5

Verifiable Source Code Documentation in Controlled Natural Language

no code implementations12 Nov 2013 Tobias Kuhn, Alexandre Bergel

Writing documentation about software internals is rarely considered a rewarding activity.

Cannot find the paper you are looking for? You can Submit a new open access paper.