Search Results for author: Prachi Singh

Found 12 papers, 5 papers with code

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

no code implementations • 23 Jan 2024 • Prachi Singh, Sriram Ganapathy

Speaker diarization, the task of segmenting an audio recording based on speaker identity, constitutes an important speech pre-processing step for several downstream applications.

Clustering Graph Clustering +4

Paper
Add Code

Summary of the DISPLACE Challenge 2023 -- DIarization of SPeaker and LAnguage in Conversational Environments

no code implementations • 21 Nov 2023 • Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy

In multi-lingual societies, where multiple languages are spoken in a small geographic vicinity, informal conversations often involve mix of languages.

speaker-diarization Speaker Diarization

Paper
Add Code

DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

no code implementations • 1 Mar 2023 • Shikha Baghel, Shreyas Ramoji, Sidharth, Ranjana H, Prachi Singh, Somil Jain, Pratik Roy Chowdhuri, Kaustubh Kulkarni, Swapnil Padhi, Deepu Vijayasenan, Sriram Ganapathy

The challenge attempts to highlight outstanding issues in speaker diarization (SD) in multilingual settings with code-mixing.

speaker-diarization Speaker Diarization

Paper
Add Code

Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms

no code implementations • 28 Feb 2023 • Prachi Singh, Srikrishna Karanam, Sumit Shekhar

We consider and propose a new problem of retrieving audio files relevant to multimodal design document inputs comprising both textual elements and visual imagery, e. g., birthday/greeting cards.

Retrieval

Paper
Add Code

Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization

no code implementations • 24 Feb 2023 • Prachi Singh, Amrit Kaul, Sriram Ganapathy

We also propose an approach to jointly update the embedding extractor and the GNN model to perform end-to-end speaker diarization (E2E-SHARC).

Clustering Graph Clustering +2

Paper
Add Code

Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization

1 code implementation • 14 Sep 2021 • Prachi Singh, Sriram Ganapathy

In this paper, we propose an approach that jointly learns the speaker embeddings and the similarity metric using principles of self-supervised learning.

Clustering Graph Clustering +5

Paper
Code

Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization

1 code implementation • 19 Apr 2021 • Prachi Singh, Sriram Ganapathy

In this paper, we propose a representation learning and clustering algorithm that can be iteratively performed for improved speaker diarization.

Clustering Representation Learning +3

Paper
Code

LEAP Submission for the Third DIHARD Diarization Challenge

no code implementations • 6 Apr 2021 • Prachi Singh, Rajat Varma, Venkat Krishnamohan, Srikanth Raj Chetupalli, Sriram Ganapathy

This paper describes the challenge submission, the post-evaluation analysis and improvements observed on the DIHARD-III dataset.

Clustering speaker-diarization +1

Paper
Add Code

The Third DIHARD Diarization Challenge

3 code implementations • 2 Dec 2020 • Neville Ryant, Prachi Singh, Venkat Krishnamohan, Rajat Varma, Kenneth Church, Christopher Cieri, Jun Du, Sriram Ganapathy, Mark Liberman

DIHARD III was the third in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variability in recording equipment, noise conditions, and conversational domain.

speaker-diarization Speaker Diarization +1

5,086

Paper
Code

Deep Self-Supervised Hierarchical Clustering for Speaker Diarization

1 code implementation • 10 Aug 2020 • Prachi Singh, Sriram Ganapathy

In this paper, we propose a novel algorithm for hierarchical clustering which combines the speaker clustering along with a representation learning framework.

Audio and Speech Processing

Paper
Code

LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis

no code implementations • 7 Feb 2020 • Shreyas Ramoji, Prashant Krishnan, Bhargavram Mysore, Prachi Singh, Sriram Ganapathy

In this paper, we provide a detailed account of the LEAP SRE system submitted to the CTS challenge focusing on the novel components in the back-end system modeling.

Speaker Recognition Speaker Verification

Paper
Add Code

Pairwise Discriminative Neural PLDA for Speaker Verification

1 code implementation • 20 Jan 2020 • Shreyas Ramoji, Prashant Krishnan V, Prachi Singh, Sriram Ganapathy

The pre-processing steps of linear discriminant analysis (LDA), unit length normalization and within class covariance normalization are all modeled as layers of a neural model and the speaker verification cost functions can be back-propagated through these layers during training.

Speaker Verification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.