Language Acquisition
69 papers with code • 1 benchmarks • 6 datasets
Language acquisition refers to tasks related to the learning of a second language.
Libraries
Use these libraries to find Language Acquisition models and implementationsDatasets
Latest papers
BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models
Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels.
How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Accurate syntactic representations are essential for robust generalization in natural language.
Analysing the Impact of Audio Quality on the Use of Naturalistic Long-Form Recordings for Infant-Directed Speech Research
Our results show that the use of modest and high audio quality naturalistic speech data result in largely similar conclusions on IDS and ADS in terms of acoustic analyses and modelling experiments.
Computational Language Acquisition with Theory of Mind
We also find some evidence that increasing task difficulty in the training process results in more fluent and precise utterances in evaluation.
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
In partnership with CoNLL and CMCL, we provide a platform for approaches to pretraining with a limited-size corpus sourced from data inspired by the input to children.
Learning a Grammar Inducer from Massive Uncurated Instructional Videos
While previous work focuses on building systems for inducing grammars on text that are well-aligned with video content, we investigate the scenario, in which text and video are only in loose correspondence.
Is neural language acquisition similar to natural? A chronological probing study
The probing methodology allows one to obtain a partial representation of linguistic phenomena stored in the inner layers of the neural network, using external classifiers and statistical analysis.
Color Overmodification Emerges from Data-Driven Learning and Pragmatic Reasoning
It seems likely that these patterns are shaped by the environment a speaker is exposed to in complex ways.
A Computational Acquisition Model for Multimodal Word Categorization
Recent advances in self-supervised modeling of text and images open new opportunities for computational models of child language acquisition, which is believed to rely heavily on cross-modal signals.
YACLC: A Chinese Learner Corpus with Multidimensional Annotation
This resource is of great relevance for second language acquisition research, foreign-language teaching, and automatic grammatical error correction.