Search Results for author: Michael Pieler

Found 7 papers, 3 papers with code

Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning

no code implementations14 Oct 2022 Louis Castricato, Alexander Havrilla, Shahbuland Matiana, Michael Pieler, Anbang Ye, Ian Yang, Spencer Frazier, Mark Riedl

However, simply fine-tuning a generative language model with a contrastive reward model does not always reliably result in a story generation system capable of generating stories that meet user preferences.

Contrastive Learning Language Modelling +4

Few-shot Adaptation Works with UnpredicTable Data

1 code implementation1 Aug 2022 Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer, Ethan Perez

Finetuning on the resulting dataset leads to improved FSL performance on Natural Language Processing (NLP) tasks, but not proportionally to dataset scale.

Domain Adaptation Few-Shot Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.