Search Results for author: Rebecca Gorman

Found 5 papers, 0 papers with code

CoinRun: Solving Goal Misgeneralisation

no code implementations28 Sep 2023 Stuart Armstrong, Alexandre Maranhão, Oliver Daniels-Koch, Patrick Leask, Rebecca Gorman

Goal misgeneralisation is a key challenge in AI alignment -- the task of getting powerful Artificial Intelligences to align their goals with human intentions and human morality.

Strengthening the EU AI Act: Defining Key Terms on AI Manipulation

no code implementations30 Aug 2023 Matija Franklin, Philip Moreira Tomei, Rebecca Gorman

The European Union's Artificial Intelligence Act aims to regulate manipulative and harmful uses of AI, but lacks precise definitions for key concepts.

Concept Extrapolation: A Conceptual Primer

no code implementations19 Jun 2023 Matija Franklin, Rebecca Gorman, Hal Ashton, Stuart Armstrong

This article is a primer on concept extrapolation - the ability to take a concept, a feature, or a goal that is defined in one context and extrapolate it safely to a more general context.

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI

no code implementations20 Mar 2022 Matija Franklin, Hal Ashton, Rebecca Gorman, Stuart Armstrong

We operationalize preference to incorporate concepts from various disciplines, outlining the importance of meta-preferences and preference-change preferences, and proposing a preliminary framework for how preferences change.

Recommendation Systems

The dangers in algorithms learning humans' values and irrationalities

no code implementations28 Feb 2022 Rebecca Gorman, Stuart Armstrong

For an artificial intelligence (AI) to be aligned with human values (or human preferences), it must first learn those values.

Cannot find the paper you are looking for? You can Submit a new open access paper.