Search Results for author: Johann-Mattis List

Found 21 papers, 17 papers with code

The SIGTYP 2022 Shared Task on the Prediction of Cognate Reflexes

1 code implementation NAACL (SIGTYP) 2022 Johann-Mattis List, Ekaterina Vylomova, Robert Forkel, Nathan Hill, Ryan Cotterell

This study describes the structure and the results of the SIGTYP 2022 shared task on the prediction of cognate reflexes from multilingual wordlists.

Image Restoration

A Computational Model for the Assessment of Mutual Intelligibility Among Closely Related Languages

2 code implementations5 Feb 2024 Jessica Nieder, Johann-Mattis List

Closely related languages show linguistic similarities that allow speakers of one language to understand speakers of another language without having actively learned it.

Are Sounds Sound for Phylogenetic Reconstruction?

1 code implementation5 Feb 2024 Luise Häuser, Gerhard Jäger, Taraka Rama, Johann-Mattis List, Alexandros Stamatakis

In traditional studies on language evolution, scholars often emphasize the importance of sound laws and sound correspondences for phylogenetic inference of language family trees.

Representing and Computing Uncertainty in Phonological Reconstruction

1 code implementation19 Oct 2023 Johann-Mattis List, Nathan W. Hill, Robert Forkel, Frederic Blum

Despite the inherently fuzzy nature of reconstructions in historical linguistics, most scholars do not represent their uncertainty when proposing proto-forms.

Trimming Phonetic Alignments Improves the Inference of Sound Correspondence Patterns from Multilingual Wordlists

no code implementations31 Mar 2023 Frederic Blum, Johann-Mattis List

Sound correspondence patterns form the basis of cognate detection and phonological reconstruction in historical language comparison.

Inference of Partial Colexifications from Multilingual Wordlists

1 code implementation1 Feb 2023 Johann-Mattis List

The past years have seen a drastic rise in studies devoted to the investigation of colexification patterns in individual languages families in particular and the languages of the world in specific.

Detecting Lexical Borrowings from Dominant Languages in Multilingual Wordlists

1 code implementation1 Feb 2023 John E. Miller, Johann-Mattis List

Language contact is a pervasive phenomenon reflected in the borrowing of words from donor to recipient languages.

A New Framework for Fast Automated Phonological Reconstruction Using Trimmed Alignments and Sound Correspondence Patterns

1 code implementation LChange (ACL) 2022 Johann-Mattis List, Robert Forkel, Nathan W. Hill

Computational approaches in historical linguistics have been increasingly applied during the past decade and many new methods that implement parts of the traditional comparative method have been proposed.

CLDFBench: Give Your Cross-Linguistic Data a Lift

1 code implementation LREC 2020 Robert Forkel, Johann-Mattis List

With cldfbench, we introduce a framework for the retro-standardization of legacy data and the curation of new datasets that drastically simplifies the creation of CLDF by providing a consistent, reproducible workflow that rigorously supports version control and long term archiving of research data and code.

An Automated Framework for Fast Cognate Detection and Bayesian Phylogenetic Inference in Computational Historical Linguistics

1 code implementation ACL 2019 Taraka Rama, Johann-Mattis List

We present a fully automated workflow for phylogenetic reconstruction on large datasets, consisting of two novel methods, one for fast detection of cognates and one for fast Bayesian phylogenetic inference.

Automatic Inference of Sound Correspondence Patterns across Multiple Languages

2 code implementations CL 2019 Johann-Mattis List

By excluding those patterns that occur in only a few cognate sets, the core of regularly recurring sound correspondences can be inferred.

Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic Reconstruction in Historical Linguistics?

1 code implementation NAACL 2018 Taraka Rama, Johann-Mattis List, Johannes Wahle, Gerhard Jäger

We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by comparing how useful automatically inferred cognates are for the task of phylogenetic inference compared to classical manually annotated cognate sets.

A Web-Based Interactive Tool for Creating, Inspecting, Editing, and Publishing Etymological Datasets

1 code implementation EACL 2017 Johann-Mattis List

The paper presents the Etymological DICtionary ediTOR (EDICTOR), a free, interactive, web-based tool designed to aid historical linguists in creating, editing, analysing, and publishing etymological datasets.

Word Alignment

Using support vector machines and state-of-the-art algorithms for phonetic alignment to identify cognates in multi-lingual wordlists

no code implementations EACL 2017 Gerhard J{\"a}ger, Johann-Mattis List, Pavel Sofroniev

Most current approaches in phylogenetic linguistics require as input multilingual word lists partitioned into sets of etymologically related words (cognates).

Concepticon: A Resource for the Linking of Concept Lists

1 code implementation LREC 2016 Johann-Mattis List, Michael Cysouw, Robert Forkel

We present an attempt to link the large amount of different concept lists which are used in the linguistic literature, ranging from Swadesh lists in historical linguistics to naming tests in clinical studies and psycholinguistics.

A Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology

1 code implementation LREC 2014 Johann-Mattis List, Jelena Proki{\'c}

In the last two decades, alignment analyses have become an important technique in quantitative historical linguistics and dialectology.

Cannot find the paper you are looking for? You can Submit a new open access paper.