Search Results for author: Afra Alishahi

Found 21 papers, 12 papers with code

Encoding of lexical tone in self-supervised models of spoken language

no code implementations • 25 Mar 2024 • Gaofei Shen, Michaela Watkins, Afra Alishahi, Arianna Bisazza, Grzegorz Chrupała

Interpretability research has shown that self-supervised Spoken Language Models (SLMs) encode a wide variety of features in human speech from the acoustic, phonetic, phonological, syntactic and semantic levels, to speaker characteristics.

Paper
Add Code

Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers

1 code implementation • 15 Oct 2023 • Hosein Mohebbi, Grzegorz Chrupała, Willem Zuidema, Afra Alishahi

Transformers have become a key architecture in speech processing, but our understanding of how they build up representations of acoustic and linguistic structure is limited.

Decoder speech-recognition +1

Paper
Code

Wave to Syntax: Probing spoken language models for syntax

1 code implementation • 30 May 2023 • Gaofei Shen, Afra Alishahi, Arianna Bisazza, Grzegorz Chrupała

Understanding which information is encoded in deep models of spoken and written language has been the focus of much research in recent years, as it is crucial for debugging and improving these architectures.

Paper
Code

Quantifying Context Mixing in Transformers

1 code implementation • 30 Jan 2023 • Hosein Mohebbi, Willem Zuidema, Grzegorz Chrupała, Afra Alishahi

Self-attention weights and their transformed variants have been the main source of information for analyzing token-to-token interactions in Transformer-based models.

Paper
Code

Learning English with Peppa Pig

1 code implementation • 25 Feb 2022 • Mitja Nikolaus, Afra Alishahi, Grzegorz Chrupała

In the real world the coupling between the linguistic and the visual modality is loose, and often confounded by correlations with non-semantic aspects of the speech signal.

Descriptive

Paper
Code

Discrete representations in neural models of spoken language

1 code implementation • EMNLP (BlackboxNLP) 2021 • Bertrand Higy, Lieke Gelderloos, Afra Alishahi, Grzegorz Chrupała

The distributed and continuous representations used by neural networks are at odds with representations employed in linguistics, which are typically symbolic.

Attribute Quantization

Paper
Code

Learning to Understand Child-directed and Adult-directed Speech

no code implementations • ACL 2020 • Lieke Gelderloos, Grzegorz Chrupała, Afra Alishahi

Speech directed to children differs from adult-directed speech in linguistic aspects such as repetition, word choice, and sentence length, as well as in aspects of the speech signal itself, such as prosodic and phonemic variation.

Language Acquisition Sentence

Paper
Add Code

Analyzing analytical methods: The case of phonology in neural models of spoken language

1 code implementation • ACL 2020 • Grzegorz Chrupała, Bertrand Higy, Afra Alishahi

Given the fast development of analysis techniques for NLP and speech processing systems, few systematic studies have been conducted to compare the strengths and weaknesses of each method.

Paper
Code

Bootstrapping Disjoint Datasets for Multilingual Multimodal Representation Learning

no code implementations • 9 Nov 2019 • Ákos Kádár, Grzegorz Chrupała, Afra Alishahi, Desmond Elliott

However, we do find that using an external machine translation model to generate the synthetic data sets results in better performance.

Machine Translation Representation Learning +4

Paper
Add Code

Correlating neural and symbolic representations of language

1 code implementation • ACL 2019 • Grzegorz Chrupała, Afra Alishahi

Analysis methods which enable us to better understand the representations and functioning of neural models of language are increasingly needed as deep learning becomes the dominant approach in NLP.

Paper
Code

Analyzing and Interpreting Neural Networks for NLP: A Report on the First BlackboxNLP Workshop

no code implementations • 5 Apr 2019 • Afra Alishahi, Grzegorz Chrupała, Tal Linzen

The EMNLP 2018 workshop BlackboxNLP was dedicated to resources and techniques specifically developed for analyzing and understanding the inner-workings and representations acquired by neural models of language.

Paper
Add Code

Lessons learned in multilingual grounded language learning

1 code implementation • CONLL 2018 • Ákos Kádár, Desmond Elliott, Marc-Alexandre Côté, Grzegorz Chrupała, Afra Alishahi

Recent work has shown how to learn better visual-semantic embeddings by leveraging image descriptions in more than one language.

Grounded language learning Sentence

Paper
Code

Revisiting the Hierarchical Multiscale LSTM

no code implementations • COLING 2018 • Ákos Kádár, Marc-Alexandre Côté, Grzegorz Chrupała, Afra Alishahi

Hierarchical Multiscale LSTM (Chung et al., 2016a) is a state-of-the-art language model that learns interpretable structure from character-level input.

Language Modelling

Paper
Add Code

On the difficulty of a distributional semantics of spoken language

no code implementations • WS 2019 • Grzegorz Chrupała, Lieke Gelderloos, Ákos Kádár, Afra Alishahi

In the domain of unsupervised learning most work on speech has focused on discovering low-level constructs such as phoneme inventories or word-like units.

Paper
Add Code

Encoding of phonology in a recurrent neural model of grounded speech

1 code implementation • CONLL 2017 • Afra Alishahi, Marie Barking, Grzegorz Chrupała

We study the representation and encoding of phonemes in a recurrent neural network model of grounded speech.

Clustering

Paper
Code

Representations of language in a model of visually grounded speech signal

2 code implementations • ACL 2017 • Grzegorz Chrupała, Lieke Gelderloos, Afra Alishahi

We present a visually grounded model of speech perception which projects spoken utterances and images to a joint semantic space.

Paper
Code

Representation of linguistic form and function in recurrent neural networks

1 code implementation • CL 2017 • Ákos Kádár, Grzegorz Chrupała, Afra Alishahi

We present novel methods for analyzing the activation patterns of RNNs from a linguistic point of view and explore the types of linguistic structure they learn.

Language Modelling Sentence +1