Search Results for author: Jean Mercat

Found 9 papers, 3 papers with code

Residual Q-Learning: Offline and Online Policy Customization without Value

no code implementations NeurIPS 2023 Chenran Li, Chen Tang, Haruki Nishimura, Jean Mercat, Masayoshi Tomizuka, Wei Zhan

Specifically, we formulate the customization problem as a Markov Decision Process (MDP) with a reward function that combines 1) the inherent reward of the demonstration; and 2) the add-on reward specified by the downstream task.

Imitation Learning Q-Learning

RAP: Risk-Aware Prediction for Robust Planning

1 code implementation4 Oct 2022 Haruki Nishimura, Jean Mercat, Blake Wulfe, Rowan Mcallister, Adrien Gaidon

Robust planning in interactive scenarios requires predicting the uncertain future to make risk-aware decisions.

Control-Aware Prediction Objectives for Autonomous Driving

no code implementations28 Apr 2022 Rowan Mcallister, Blake Wulfe, Jean Mercat, Logan Ellis, Sergey Levine, Adrien Gaidon

Autonomous vehicle software is typically structured as a modular pipeline of individual components (e. g., perception, prediction, and planning) to help separate concerns into interpretable sub-tasks.

Autonomous Driving Trajectory Prediction

Dynamics-Aware Comparison of Learned Reward Functions

no code implementations ICLR 2022 Blake Wulfe, Ashwin Balakrishna, Logan Ellis, Jean Mercat, Rowan Mcallister, Adrien Gaidon

The ability to learn reward functions plays an important role in enabling the deployment of intelligent agents in the real world.

Higher Order Linear Transformer

no code implementations28 Oct 2020 Jean Mercat

Following up on the linear transformer part of the article from Katharopoulos et al., that takes this idea from Shen et al., the trick that produces a linear complexity for the attention mechanism is re-used and extended to a second-order approximation of the softmax normalization.

Social Attention for Autonomous Decision-Making in Dense Traffic

no code implementations27 Nov 2019 Edouard Leurent, Jean Mercat

We study the design of learning architectures for behavioural planning in a dense traffic setting.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.