no code implementations • 21 Feb 2024 • Amit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth Daly, Karthikeyan Natesan Ramamurthy
and a set of LLMs, we rank them without access to any ground truth or reference responses.
no code implementations • 1 Dec 2023 • Svetoslav Nizhnichenkov, Rahul Nair, Elizabeth Daly, Brian Mac Namee
In this paper, we aim to characterise impacted cohorts when mitigation interventions are applied.
1 code implementation • 30 Aug 2023 • Jasmina Gajcin, James McCarthy, Rahul Nair, Radu Marinescu, Elizabeth Daly, Ivana Dusparic
Our approach allows the user to provide trajectory-level feedback on agent's behavior during training, which can be integrated as a reward shaping signal in the following training iteration.
no code implementations • 29 Jul 2022 • Stefano Teso, Öznur Alkan, Wolfang Stammer, Elizabeth Daly
Explanations have gained an increasing level of interest in the AI and Machine Learning (ML) communities in order to improve model transparency and allow users to form a mental model of a trained ML model.
no code implementations • 18 Jul 2022 • James McCarthy, Rahul Nair, Elizabeth Daly, Radu Marinescu, Ivana Dusparic
Explainability of Reinforcement Learning (RL) policies remains a challenging research problem, particularly when considering RL in a safety context.
no code implementations • 17 Dec 2021 • Jasmina Gajcin, Rahul Nair, Tejaswini Pedapati, Radu Marinescu, Elizabeth Daly, Ivana Dusparic
In complex tasks where the reward function is not straightforward and consists of a set of objectives, multiple reinforcement learning (RL) policies that perform task adequately, but employ different strategies can be trained by adjusting the impact of individual objectives on reward function.
no code implementations • 2 Jul 2021 • Paulito P. Palmes, Akihiro Kishimoto, Radu Marinescu, Parikshit Ram, Elizabeth Daly
The pipeline optimization problem in machine learning requires simultaneous optimization of pipeline structures and parameter adaptation of their elements.
no code implementations • 15 Jun 2020 • Oznur Alkan, Elizabeth Daly
However, temporal aspects of a user profile may not always be explicitly available and so we may need to infer this information from available resources.
no code implementations • 2 Feb 2019 • Adi Botea, Christian Muise, Shubham Agarwal, Oznur Alkan, Ondrej Bajgar, Elizabeth Daly, Akihiro Kishimoto, Luis Lastras, Radu Marinescu, Josef Ondrej, Pablo Pedemonte, Miroslav Vodolan
Dialogue systems have many applications such as customer support or question answering.
no code implementations • NAACL 2018 • L{\'e}a Deleris, Francesca Bonin, Elizabeth Daly, St{\'e}phane Deparis, Yufang Hou, Charles Jochim, Yassine Lassoued, Killian Levacher
Having an understanding of interpersonal relationships is helpful in many contexts.