no code implementations • 29 Jan 2024 • Michael Feffer, Anusha Sinha, Zachary C. Lipton, Hoda Heidari
In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks.
no code implementations • 7 Apr 2022 • Violet Turri, Rachel Dzombak, Eric Heim, Nathan VanHoudnos, Jay Palat, Anusha Sinha
Current test and evaluation (T&E) methods for assessing machine learning (ML) system performance often rely on incomplete metrics.