Search Results for author: Anthony Larcher

Found 18 papers, 2 papers with code

Overlaps and Gender Analysis in the Context of Broadcast Media

no code implementations • LREC 2022 • Martin Lebourdais, Marie Tahon, Antoine Laurent, Sylvain Meignier, Anthony Larcher

Our main goal is to study the interactions between speakers according to their gender and role in broadcast media.

Paper
Add Code

Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

no code implementations • 13 Feb 2024 • Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas

A channel-number invariant loss is proposed to learn a unique feature representation regardless of the number of available microphones.

Action Detection Activity Detection +2

Paper
Add Code

Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains

no code implementations • 24 Jul 2023 • Martin Lebourdais, Théo Mariotte, Marie Tahon, Anthony Larcher, Antoine Laurent, Silvio Montresor, Sylvain Meignier, Jean-Hugh Thomas

Voice activity and overlapped speech detection (respectively VAD and OSD) are key pre-processing tasks for speaker diarization.

Multi-class Classification speaker-diarization +1

Paper
Add Code

Multi-microphone Automatic Speech Segmentation in Meetings Based on Circular Harmonics Features

no code implementations • 7 Jun 2023 • Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas

Pipeline systems rely on speech segmentation to extract speakers' segments and achieve robust speaker diarization.

Action Detection Activity Detection +4

Paper
Add Code

Evaluation of Speaker Anonymization on Emotional Speech

no code implementations • 15 Apr 2023 • Hubert Nourtel, Pierre Champion, Denis Jouvet, Anthony Larcher, Marie Tahon

This paper studies the impact of the speaker anonymization baseline system of the VPC on emotional information present in speech utterances.

Automatic Speech Recognition Emotion Recognition +3

Paper
Add Code

Are disentangled representations all you need to build speaker anonymization systems?

no code implementations • 22 Aug 2022 • Pierre Champion, Denis Jouvet, Anthony Larcher

We propose enhancing the disentanglement by removing speaker information from the acoustic model using vector quantization.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Privacy-Preserving Speech Representation Learning using Vector Quantization

no code implementations • 15 Mar 2022 • Pierre Champion, Denis Jouvet, Anthony Larcher

With the popularity of virtual assistants (e. g., Siri, Alexa), the use of speech recognition is now becoming more and more widespread. However, speech signals contain a lot of sensitive information, such as the speaker's identity, which raises privacy concerns. The presented experiments show that the representations extracted by the deep layers of speech recognition networks contain speaker information. This paper aims to produce an anonymous representation while preserving speech recognition performance. To this end, we propose to use vector quantization to constrain the representation space and induce the network to suppress the speaker identity. The choice of the quantization dictionary size allows to configure the trade-off between utility (speech recognition) and privacy (speaker identity concealment).

Privacy Preserving Quantization +3

Paper
Add Code

On the invertibility of a voice privacy system using embedding alignement

1 code implementation • 8 Oct 2021 • Pierre Champion, Thomas Thebaud, Gaël Le Lan, Anthony Larcher, Denis Jouvet

This paper explores various attack scenarios on a voice anonymization system using embeddings alignment techniques.

Translation

Paper
Code

Evaluating X-vector-based Speaker Anonymization under White-box Assessment

no code implementations • 24 Sep 2021 • Pierre Champion, Denis Jouvet, Anthony Larcher

In the scenario of the Voice Privacy challenge, anonymization is achieved by converting all utterances from a source speaker to match the same target identity; this identity being randomly selected.

Paper
Add Code

A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender

no code implementations • 21 Jan 2021 • Pierre Champion, Denis Jouvet, Anthony Larcher

Speech pseudonymization aims at altering a speech signal to map the identifiable personal characteristics of a given speaker to another identity.

Paper
Add Code

End-to-end anti-spoofing with RawNet2

1 code implementation • 2 Nov 2020 • Hemlata Tak, Jose Patino, Massimiliano Todisco, Andreas Nautsch, Nicholas Evans, Anthony Larcher

Spoofing countermeasures aim to protect automatic speaker verification systems from attempts to manipulate their reliability with the use of spoofed speech signals.

Speaker Verification

Paper
Code

\'Evaluation de syst\`emes apprenant tout au long de la vie (Evaluation of lifelong learning systems )

no code implementations • JEPTALNRECITAL 2020 • Yevhenii Prokopalo, Sylvain Meignier, Olivier Galibert, Lo{\"\i}c Barrault, Anthony Larcher

Une adaptation de leur mod{\`e}le par des experts en apprentissage automatique est possible mais tr{\`e}s co{\^u}teuse alors que les soci{\'e}t{\'e}s utilisant ces syst{\`e}mes disposent d{'}experts du domaine qui pourraient accompagner ces syst{\`e}mes dans un apprentissage tout au long de la vie.

Paper
Add Code

Evaluation of Lifelong Learning Systems

no code implementations • LREC 2020 • Yevhenii Prokopalo, Sylvain Meignier, Olivier Galibert, Loic Barrault, Anthony Larcher

Current intelligent systems need the expensive support of machine learning experts to sustain their performance level when used on a daily basis.

BIG-bench Machine Learning

Paper
Add Code

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

no code implementations • 16 Apr 2019 • Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Huy Dat Tran, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-Francois Bonastre, Cheng-Lin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas Evans

The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE).

Domain Adaptation Speaker Recognition

Paper
Add Code

Autoapprentissage pour le regroupement en locuteurs : premi\`eres investigations (First investigations on self trained speaker diarization )

no code implementations • JEPTALNRECITAL 2016 • Ga{\"e}l Le Lan, Sylvain Meignier, Delphine Charlet, Anthony Larcher

This paper investigates self trained cross-show speaker diarization applied to collections of French TV archives, based on an \textit{i-vector/PLDA} framework.

Domain Adaptation speaker-diarization +1

Paper
Add Code

Exploration de param\`etres acoustiques d\'eriv\'es de GMM pour l'adaptation non supervis\'ee de mod\`eles acoustiques \`a base de r\'eseaux de neurones profonds (Exploring GMM-derived features for unsupervised adaptation of deep neural network acoustic models)

no code implementations • JEPTALNRECITAL 2016 • Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher, Yannick Est{\`e}ve

L{'}{\'e}tude pr{\'e}sent{\'e}e dans cet article am{\'e}liore une m{\'e}thode r{\'e}cemment propos{\'e}e pour l{'}adaptation de mod{\`e}les acoustiques markoviens coupl{\'e}s {\`a} un r{\'e}seau de neurones profond (DNN-HMM).

Paper
Add Code

Fantastic 4 system for NIST 2015 Language Recognition Evaluation

no code implementations • 5 Feb 2016 • Kong Aik Lee, Ville Hautamäki, Anthony Larcher, Wei Rao, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Ivan Kukanov, Amir Poorjam, Trung Ngo Trong, Xiong Xiao, Cheng-Lin Xu, Hai-Hua Xu, Bin Ma, Haizhou Li, Sylvain Meignier

This article describes the systems jointly submitted by Institute for Infocomm (I$^2$R), the Laboratoire d'Informatique de l'Universit\'e du Maine (LIUM), Nanyang Technology University (NTU) and the University of Eastern Finland (UEF) for 2015 NIST Language Recognition Evaluation (LRE).

regression

Paper
Add Code

Analyse en Composante Principale pour l'extraction des i-vecteurs en v\'erification du locuteur (Principal Component Analysis for i-vector extraction in speaker verification.) [in French]

no code implementations • JEPTALNRECITAL 2012 • Anthony Larcher, Pierre-Michel Bousquet, Driss Matrouf, Jean-Francois Bonastre

Dimensionality Reduction Speaker Verification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.