1 code implementation • 30 Apr 2024 • Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin
While Reinforcement Learning (RL) has been proven essential for tuning large language models (LLMs), it can lead to reward over-optimization (ROO).
no code implementations • 18 Mar 2024 • Mathieu Rita, Paul Michel, Rahma Chaabouni, Olivier Pietquin, Emmanuel Dupoux, Florian Strub
Computational modeling plays an essential role in the study of language emergence.
no code implementations • 22 May 2023 • Emily Cheng, Mathieu Rita, Thierry Poibeau
Compositionality is a hallmark of human language that not only enables linguistic generalization, but also potentially facilitates acquisition.
1 code implementation • 30 Sep 2022 • Mathieu Rita, Corentin Tallec, Paul Michel, Jean-bastien Grill, Olivier Pietquin, Emmanuel Dupoux, Florian Strub
Lewis signaling games are a class of simple communication games for simulating the emergence of language.
no code implementations • CONLL 2020 • Mathieu Rita, Rahma Chaabouni, Emmanuel Dupoux
Previous work has shown that artificial neural agents naturally develop surprisingly non-efficient codes.
1 code implementation • 5 Oct 2020 • Mathieu Rita, Rahma Chaabouni, Emmanuel Dupoux
Previous work has shown that artificial neural agents naturally develop surprisingly non-efficient codes.