no code implementations • 2 Aug 2023 • Michel Aractingi, Pierre-Alexandre Léziart, Thomas Flayols, Julien Perez, Tomi Silander, Philippe Souères
We detail the learning procedure and method for transfer on the real robot.
no code implementations • 20 Nov 2020 • Maxime Pietrantoni, Boris Chidlovskii, Tomi Silander
The navigation pipeline is decomposed as a localization module, a planning module and a local navigation module.
no code implementations • ICLR 2020 • Michel Aractingi, Christopher Dance, Julien Perez, Tomi Silander
The results of this method, called invariance regularization, show an improvement in the generalization of policies to environments not seen during training.
no code implementations • 27 Sep 2018 • Morgan Funtowicz, Tomi Silander, Arnaud Sors, Julien Perez
More precisely, our forward model is trained to produce realistic observations of the future while a discriminator model is trained to distinguish between real images and the model’s prediction of the future.
no code implementations • ICLR 2018 • julien perez, Tomi Silander
In this paper, we propose to introduce the paradigm of contextual bandits as framework for pro-active dialog systems.
no code implementations • 31 May 2017 • Julien Perez, Tomi Silander
In this paper, we explore the use of a recently proposed attention-based model, the Gated End-to-End Memory Network, for sequential control.
no code implementations • 29 Mar 2017 • Christopher R. Dance, Tomi Silander
We discuss computation of that index, give closed-form formulae for it, and compare the performance of the associated index policy with heuristic policies.
no code implementations • NeurIPS 2015 • Christopher R. Dance, Tomi Silander
We study the restless bandit associated with an extremely simple scalar Kalman filter model in discrete time.
no code implementations • NeurIPS 2012 • Trung Nguyen, Tomi Silander, Tze Y. Leong
We study how to automatically select and adapt multiple abstractions or representations of the world to support model-based reinforcement learning.
Model-based Reinforcement Learning reinforcement-learning +2