Search Results for author: Peter Chatain

Found 3 papers, 1 papers with code

Markovian Agents for Truthful Language Modeling

no code implementations29 Apr 2024 Scott Viteri, Max Lamparth, Peter Chatain, Clark Barrett

We formalize the idea that the truthfulness of a sender to a receiver LM is the degree to which the sender helps the receiver predict their future observations.

Language Modelling

SuperHF: Supervised Iterative Learning from Human Feedback

1 code implementation25 Oct 2023 Gabriel Mukobi, Peter Chatain, Su Fong, Robert Windesheim, Gitta Kutyniok, Kush Bhatia, Silas Alberti

Here, we focus on two prevalent methods used to align these models, Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF).

Language Modelling

Do Neural Networks Generalize from Self-Averaging Sub-classifiers in the Same Way As Adaptive Boosting?

no code implementations14 Feb 2023 Michael Sun, Peter Chatain

In recent years, neural networks (NNs) have made giant leaps in a wide variety of domains.

Cannot find the paper you are looking for? You can Submit a new open access paper.