TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Caller Detection	InfantMarmosetsVox	NPC	Macro AUC	77.32	# 1
Caller Detection	InfantMarmosetsVox	Data2Vec	Macro AUC	0.7304	# 6
Caller Detection	InfantMarmosetsVox	DistilHubert	Macro AUC	0.7626	# 4
Caller Detection	InfantMarmosetsVox	TERA	Macro AUC	0.7403	# 5
Caller Detection	InfantMarmosetsVox	VQ-APC	Macro AUC	0.7845	# 3
Caller Detection	InfantMarmosetsVox	WavLM	Macro AUC	0.786	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/can-self-supervised-neural-networks-pre/caller-detection-on-infantmarmosetsvox)](https://paperswithcode.com/sota/caller-detection-on-infantmarmosetsvox?p=can-self-supervised-neural-networks-pre)`

Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?

23 May 2023 · Eklavya Sarkar, Mathew Magimai. -Doss ·

Self-supervised learning (SSL) models use only the intrinsic structure of a given signal, independent of its acoustic domain, to extract essential information from the input to an embedding space. This implies that the utility of such representations is not limited to modeling human speech alone. Building on this understanding, this paper explores the cross-transferability of SSL neural representations learned from human speech to analyze bio-acoustic signals. We conduct a caller discrimination analysis and a caller detection study on Marmoset vocalizations using eleven SSL models pre-trained with various pretext tasks. The results show that the embedding spaces carry meaningful caller information and can successfully distinguish the individual identities of Marmoset callers without fine-tuning. This demonstrates that representations pre-trained on human speech can be effectively applied to the bio-acoustics domain, providing valuable insights for future investigations in this field.

PDF Abstract

Code

Add Remove Mark official

idiap/ssl-caller-detection official

Tasks

Add Remove

Caller Detection

Self-Supervised Learning

Speaker Recognition

Datasets

InfantMarmosetsVox

Results from the Paper

Edit

Ranked #1 on Caller Detection on InfantMarmosetsVox

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Caller Detection	InfantMarmosetsVox	NPC	Macro AUC	77.32	# 1	Compare
Caller Detection	InfantMarmosetsVox	Data2Vec	Macro AUC	0.7304	# 6	Compare
Caller Detection	InfantMarmosetsVox	DistilHubert	Macro AUC	0.7626	# 4	Compare
Caller Detection	InfantMarmosetsVox	TERA	Macro AUC	0.7403	# 5	Compare
Caller Detection	InfantMarmosetsVox	VQ-APC	Macro AUC	0.7845	# 3	Compare
Caller Detection	InfantMarmosetsVox	WavLM	Macro AUC	0.786	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove