Search Results for author: Ugur Sahin

Found 2 papers, 0 papers with code

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining

no code implementations7 Nov 2023 Ugur Sahin, Hang Li, Qadeer Khan, Daniel Cremers, Volker Tresp

Leveraging these generative hard negative samples, we significantly enhance VLMs' performance in tasks involving multimodal compositional reasoning.

Introducing Model Inversion Attacks on Automatic Speaker Recognition

no code implementations9 Jan 2023 Karla Pizzi, Franziska Boenisch, Ugur Sahin, Konstantin Böttinger

To the best of our knowledge, our work is the first one extending MI attacks to audio data, and our results highlight the security risks resulting from the extraction of the biometric data in that setup.

Speaker Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.