1 code implementation • 17 Apr 2024 • Amit Kumar Singh Yadav, Kratika Bhagtani, Davide Salvi, Paolo Bestagini, Edward J. Delp
In this work, we examine bias in existing synthetic speech detectors to determine if they will unfairly target a particular gender, age and accent group.
no code implementations • 22 Feb 2024 • Amit Kumar Singh Yadav, Ziyue Xiang, Kratika Bhagtani, Paolo Bestagini, Stefano Tubaro, Edward J. Delp
We evaluate the detection performance of PS3DT on ASVspoof2019 dataset.
no code implementations • 6 Apr 2023 • Amit Kumar Singh Yadav, Kratika Bhagtani, Ziyue Xiang, Paolo Bestagini, Stefano Tubaro, Edward J. Delp
We also visualize the representation obtained from DSVAE for 17 different speech synthesizers and verify that they are indeed interpretable and discriminate bona fide and synthetic speech from each of the synthesizers.
no code implementations • 26 Apr 2022 • Kratika Bhagtani, Amit Kumar Singh Yadav, Emily R. Bartusiak, Ziyue Xiang, Ruiting Shao, Sriram Baireddy, Edward J. Delp
In this paper, we review recent work in media forensics for digital images, video, audio (specifically speech), and documents.