Search Results for author: David Glukhov

Found 2 papers, 0 papers with code

LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?

no code implementations • 20 Jul 2023 • David Glukhov, Ilia Shumailov, Yarin Gal, Nicolas Papernot, Vardan Papyan

Specifically, we demonstrate that semantic censorship can be perceived as an undecidable problem, highlighting the inherent challenges in censorship that arise due to LLMs' programmatic and instruction-following capabilities.

Computer Security Instruction Following

Paper
Add Code

Augment then Smooth: Reconciling Differential Privacy with Certified Robustness

no code implementations • 14 Jun 2023 • Jiapeng Wu, Atiyeh Ashari Ghomi, David Glukhov, Jesse C. Cresswell, Franziska Boenisch, Nicolas Papernot

Differential privacy and randomized smoothing are effective defenses that provide certifiable guarantees for each of these threats, however, it is not well understood how implementing either defense impacts the other.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.