Search Results for author: Mathieu Rita

Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning

While Reinforcement Learning (RL) has been proven essential for tuning large language models (LLMs), it can lead to reward over-optimization (ROO).

Paper
Code

Computational modeling plays an essential role in the study of language emergence.

Paper
Add Code

Compositionality is a hallmark of human language that not only enables linguistic generalization, but also potentially facilitates acquisition.

Paper
Add Code

Lewis signaling games are a class of simple communication games for simulating the emergence of language.

Paper
Code

Previous work has shown that artificial neural agents naturally develop surprisingly non-efficient codes.

Paper
Add Code

Previous work has shown that artificial neural agents naturally develop surprisingly non-efficient codes.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.