Search Results for author: Prachi Singh

Found 12 papers, 5 papers with code

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

no code implementations23 Jan 2024 Prachi Singh, Sriram Ganapathy

Speaker diarization, the task of segmenting an audio recording based on speaker identity, constitutes an important speech pre-processing step for several downstream applications.

Clustering Graph Clustering +4

Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms

no code implementations28 Feb 2023 Prachi Singh, Srikrishna Karanam, Sumit Shekhar

We consider and propose a new problem of retrieving audio files relevant to multimodal design document inputs comprising both textual elements and visual imagery, e. g., birthday/greeting cards.

Retrieval

Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization

no code implementations24 Feb 2023 Prachi Singh, Amrit Kaul, Sriram Ganapathy

We also propose an approach to jointly update the embedding extractor and the GNN model to perform end-to-end speaker diarization (E2E-SHARC).

Clustering Graph Clustering +2

Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization

1 code implementation14 Sep 2021 Prachi Singh, Sriram Ganapathy

In this paper, we propose an approach that jointly learns the speaker embeddings and the similarity metric using principles of self-supervised learning.

Clustering Graph Clustering +5

Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization

1 code implementation19 Apr 2021 Prachi Singh, Sriram Ganapathy

In this paper, we propose a representation learning and clustering algorithm that can be iteratively performed for improved speaker diarization.

Clustering Representation Learning +3

LEAP Submission for the Third DIHARD Diarization Challenge

no code implementations6 Apr 2021 Prachi Singh, Rajat Varma, Venkat Krishnamohan, Srikanth Raj Chetupalli, Sriram Ganapathy

This paper describes the challenge submission, the post-evaluation analysis and improvements observed on the DIHARD-III dataset.

Clustering speaker-diarization +1

The Third DIHARD Diarization Challenge

3 code implementations2 Dec 2020 Neville Ryant, Prachi Singh, Venkat Krishnamohan, Rajat Varma, Kenneth Church, Christopher Cieri, Jun Du, Sriram Ganapathy, Mark Liberman

DIHARD III was the third in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variability in recording equipment, noise conditions, and conversational domain.

speaker-diarization Speaker Diarization +1

Deep Self-Supervised Hierarchical Clustering for Speaker Diarization

1 code implementation10 Aug 2020 Prachi Singh, Sriram Ganapathy

In this paper, we propose a novel algorithm for hierarchical clustering which combines the speaker clustering along with a representation learning framework.

Audio and Speech Processing

LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis

no code implementations7 Feb 2020 Shreyas Ramoji, Prashant Krishnan, Bhargavram Mysore, Prachi Singh, Sriram Ganapathy

In this paper, we provide a detailed account of the LEAP SRE system submitted to the CTS challenge focusing on the novel components in the back-end system modeling.

Speaker Recognition Speaker Verification

Pairwise Discriminative Neural PLDA for Speaker Verification

1 code implementation20 Jan 2020 Shreyas Ramoji, Prashant Krishnan V, Prachi Singh, Sriram Ganapathy

The pre-processing steps of linear discriminant analysis (LDA), unit length normalization and within class covariance normalization are all modeled as layers of a neural model and the speaker verification cost functions can be back-propagated through these layers during training.

Speaker Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.