Search Results for author: Tomi Silander

Found 9 papers, 0 papers with code

Controlling the Solo12 Quadruped Robot with Deep Reinforcement Learning

no code implementations • 2 Aug 2023 • Michel Aractingi, Pierre-Alexandre Léziart, Thomas Flayols, Julien Perez, Tomi Silander, Philippe Souères

We detail the learning procedure and method for transfer on the real robot.

reinforcement-learning

Paper
Add Code

Learning Synthetic to Real Transfer for Localization and Navigational Tasks

no code implementations • 20 Nov 2020 • Maxime Pietrantoni, Boris Chidlovskii, Tomi Silander

The navigation pipeline is decomposed as a localization module, a planning module and a local navigation module.

Autonomous Navigation Image Retrieval +2

Paper
Add Code

Improving the Generalization of Visual Navigation Policies using Invariance Regularization

no code implementations • ICLR 2020 • Michel Aractingi, Christopher Dance, Julien Perez, Tomi Silander

The results of this method, called invariance regularization, show an improvement in the generalization of policies to environments not seen during training.

Reinforcement Learning (RL) Visual Navigation

Paper
Add Code

DEEP ADVERSARIAL FORWARD MODEL

no code implementations • 27 Sep 2018 • Morgan Funtowicz, Tomi Silander, Arnaud Sors, Julien Perez

More precisely, our forward model is trained to produce realistic observations of the future while a discriminator model is trained to distinguish between real images and the model’s prediction of the future.

Image Generation Reinforcement Learning (RL)

Paper
Add Code

Contextual memory bandit for pro-active dialog engagement

no code implementations • ICLR 2018 • julien perez, Tomi Silander

In this paper, we propose to introduce the paradigm of contextual bandits as framework for pro-active dialog systems.

Multi-Armed Bandits

Paper
Add Code

Non-Markovian Control with Gated End-to-End Memory Policy Networks

no code implementations • 31 May 2017 • Julien Perez, Tomi Silander

In this paper, we explore the use of a recently proposed attention-based model, the Gated End-to-End Memory Network, for sequential control.

OpenAI Gym

Paper
Add Code

Optimal Policies for Observing Time Series and Related Restless Bandit Problems

no code implementations • 29 Mar 2017 • Christopher R. Dance, Tomi Silander

We discuss computation of that index, give closed-form formulae for it, and compare the performance of the associated index policy with heuristic policies.

Time Series Time Series Analysis

Paper
Add Code

When are Kalman-filter restless bandits indexable?

no code implementations • NeurIPS 2015 • Christopher R. Dance, Tomi Silander

We study the restless bandit associated with an extremely simple scalar Kalman filter model in discrete time.

Paper
Add Code

Transferring Expectations in Model-based Reinforcement Learning

no code implementations • NeurIPS 2012 • Trung Nguyen, Tomi Silander, Tze Y. Leong

We study how to automatically select and adapt multiple abstractions or representations of the world to support model-based reinforcement learning.

Model-based Reinforcement Learning reinforcement-learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.