Search Results for author: Kaspar Beelen

Found 6 papers, 5 papers with code

Metadata Might Make Language Models Better

no code implementations18 Nov 2022 Kaspar Beelen, Daniel van Strien

This paper discusses the benefits of including metadata when training language models on historical collections.

Language Modelling

MapReader: A Computer Vision Pipeline for the Semantic Exploration of Maps at Scale

1 code implementation30 Nov 2021 Kasra Hosseini, Daniel C. S. Wilson, Kaspar Beelen, Katherine McDonough

We present MapReader, a free, open-source software library written in Python for analyzing large map collections (scanned or born-digital).

16k Image Classification

Neural Language Models for Nineteenth-Century English

2 code implementations24 May 2021 Kasra Hosseini, Kaspar Beelen, Giovanni Colavizza, Mariona Coll Ardanuy

We present four types of neural language models trained on a large historical dataset of books in English, published between 1760-1900 and comprised of ~5. 1 billion tokens.

Language Modelling

Words are Malleable: Computing Semantic Shifts in Political and Media Discourse

1 code implementation15 Nov 2017 Hosein Azarbonyad, Mostafa Dehghani, Kaspar Beelen, Alexandra Arkut, Maarten Marx, Jaap Kamps

We propose an approach for detecting semantic shifts between different viewpoints--broadly defined as a set of texts that share a specific metadata feature, which can be a time-period, but also a social entity such as a political party.

Cannot find the paper you are looking for? You can Submit a new open access paper.