Search Results for author: Zhuowen Yuan

Found 3 papers, 1 papers with code

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

no code implementations19 Mar 2024 Zhuowen Yuan, Zidi Xiong, Yi Zeng, Ning Yu, Ruoxi Jia, Dawn Song, Bo Li

The innovative use of constrained optimization and a fusion-based guardrail approach represents a significant step forward in developing more secure and reliable LLMs, setting a new standard for content moderation frameworks in the face of evolving digital threats.

Data Augmentation

SecretGen: Privacy Recovery on Pre-Trained Models via Distribution Discrimination

1 code implementation25 Jul 2022 Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li

We first explore different statistical information which can discriminate the private training distribution from other distributions.

Model Selection Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.