Search Results for author: Gavin Leech

Found 6 papers, 1 papers with code

Ten Hard Problems in Artificial Intelligence We Must Get Right

no code implementations6 Feb 2024 Gavin Leech, Simson Garfinkel, Misha Yagudin, Alexander Briand, Aleksandr Zhuravlev

We explore the AI2050 "hard problems" that block the promise of AI and cause AI risks: (1) developing general capabilities of the systems; (2) assuring the performance of AI systems and their training processes; (3) aligning system goals with human goals; (4) enabling great applications of AI in real life; (5) addressing economic disruptions; (6) ensuring the participation of all; (7) at the same time ensuring socially responsible deployment; (8) addressing any geopolitical disruptions that AI causes; (9) promoting sound governance of the technology; and (10) managing the philosophical disruptions for humans living in the age of AI.

Activation Addition: Steering Language Models Without Optimization

1 code implementation20 Aug 2023 Alexander Matt Turner, Lisa Thiergart, David Udell, Gavin Leech, Ulisse Mini, Monte MacDiarmid

We demonstrate ActAdd on GPT-2 on OpenWebText and ConceptNet, and replicate the effect on Llama-13B and GPT-J-6B.

Prompt Engineering

Massively Parallel Reweighted Wake-Sleep

no code implementations18 May 2023 Thomas Heap, Gavin Leech, Laurence Aitchison

Attaining such a large number of importance samples is intractable in all but the smallest models.

Bayesian Inference

Decision trees compensate for model misspecification

no code implementations8 Feb 2023 Hugh Panton, Gavin Leech, Laurence Aitchison

These perform well in the presence of complex interactions, with tree depth governing the order of interactions.

Legally grounded fairness objectives

no code implementations24 Sep 2020 Dylan Holden-Sim, Gavin Leech, Laurence Aitchison

Recent work has identified a number of formally incompatible operational measures for the unfairness of a machine learning (ML) system.

Fairness

Cannot find the paper you are looking for? You can Submit a new open access paper.