Search Results for author: Elena Volodina

Found 27 papers, 1 papers with code

Generation of Synthetic Error Data of Verb Order Errors for Swedish

no code implementations NAACL (BEA) 2022 Judit Casademont Moner, Elena Volodina

We report on our work-in-progress to generate a synthetic error dataset for Swedish by replicating errors observed in the authentic error annotated dataset.

Sentence

LEGATO: A flexible lexicographic annotation tool

no code implementations WS (NoDaLiDa) 2019 David Alfter, Therese Lindström Tiedemann, Elena Volodina

This article is a report from an ongoing project aiming at analyzing lexical and grammatical competences of Swedish as a Second language (L2).

Lexical Analysis Morphological Analysis

Grandma Karl is 27 years old -- research agenda for pseudonymization of research data

no code implementations30 Aug 2023 Elena Volodina, Simon Dobnik, Therese Lindström Tiedemann, Xuan-Son Vu

Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e. g names or political opinions.

Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

no code implementations17 Jun 2022 David Alfter, Therese Lindström Tiedemann, Elena Volodina

In this study we investigate to which degree experts and non-experts agree on questions of difficulty in a crowdsourcing experiment.

DaLAJ - a dataset for linguistic acceptability judgments for Swedish: Format, baseline, sharing

no code implementations14 May 2021 Elena Volodina, Yousuf Ali Mohammed, Julia Klezl

We present DaLAJ 1. 0, a Dataset for Linguistic Acceptability Judgments for Swedish, comprising 9 596 sentences in its first version; and the initial experiment using it for the binary classification task.

Binary Classification Linguistic Acceptability +2

Towards Privacy by Design in Learner Corpora Research: A Case of On-the-fly Pseudonymization of Swedish Learner Essays

no code implementations COLING 2020 Elena Volodina, Yousuf Ali Mohammed, Sandra Derbring, Arild Matsson, Beata Megyesi

The process includes three steps: identification of personal information in an unstructured text, labeling for a category, and pseudonymization.

Investigating the importance of linguistic complexity features across different datasets related to language learning

no code implementations WS 2018 Ildik{\'o} Pil{\'a}n, Elena Volodina

We present the results of our investigations aiming at identifying the most informative linguistic complexity features for classifying language learning levels in three different datasets.

General Classification Reading Comprehension

Towards Single Word Lexical Complexity Prediction

no code implementations WS 2018 David Alfter, Elena Volodina

In this paper we present work-in-progress where we investigate the usefulness of previously created word lists to the task of single-word lexical complexity analysis and prediction of the complexity level for learners of Swedish as a second language.

General Classification Lexical Complexity Prediction

Candidate sentence selection for language learning exercises: from a comprehensive framework to an empirical evaluation

no code implementations12 Jun 2017 Ildikó Pilán, Elena Volodina, Lars Borin

We present a framework and its implementation relying on Natural Language Processing methods, which aims at the identification of exercise item candidates from corpora.

Sentence

Coursebook Texts as a Helping Hand for Classifying Linguistic Complexity in Language Learners' Writings

no code implementations WS 2016 Ildik{\'o} Pil{\'a}n, David Alfter, Elena Volodina

We bring together knowledge from two different types of language learning data, texts learners read and texts they write, to improve linguistic complexity classification in the latter.

Classification Domain Adaptation +1

SVALex: a CEFR-graded Lexical Resource for Swedish Foreign and Second Language Learners

no code implementations LREC 2016 Thomas Fran{\c{c}}ois, Elena Volodina, Ildik{\'o} Pil{\'a}n, Ana{\"\i}s Tack

The paper introduces SVALex, a lexical resource primarily aimed at learners and teachers of Swedish as a foreign and second language that describes the distribution of 15, 681 words and expressions across the Common European Framework of Reference (CEFR).

A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity

no code implementations29 Mar 2016 Ildikó Pilán, Sowmya Vajjala, Elena Volodina

Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level.

Sentence

Reusing Swedish FrameNet for training semantic roles

no code implementations LREC 2014 Ildik{\'o} Pil{\'a}n, Elena Volodina

In this article we present the first experiences of reusing the Swedish FrameNet (SweFN) as a resource for training semantic roles.

Multiple-choice

Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish

no code implementations LREC 2012 Elena Volodina, Sofie Johansson Kokkinakis

We provide a short description of the KELLY project; examine the methodological approach and mention some details on the compiling of the corpus from which the list has been derived.

Information Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.