Search Results for author: Simon Goldstein

Found 1 papers, 0 papers with code

AI Deception: A Survey of Examples, Risks, and Potential Solutions

no code implementations28 Aug 2023 Peter S. Park, Simon Goldstein, Aidan O'Gara, Michael Chen, Dan Hendrycks

This paper argues that a range of current AI systems have learned how to deceive humans.

Cannot find the paper you are looking for? You can Submit a new open access paper.