no code implementations • 7 Nov 2023 • Ugur Sahin, Hang Li, Qadeer Khan, Daniel Cremers, Volker Tresp
Leveraging these generative hard negative samples, we significantly enhance VLMs' performance in tasks involving multimodal compositional reasoning.
no code implementations • 9 Jan 2023 • Karla Pizzi, Franziska Boenisch, Ugur Sahin, Konstantin Böttinger
To the best of our knowledge, our work is the first one extending MI attacks to audio data, and our results highlight the security risks resulting from the extraction of the biometric data in that setup.