Search Results for author: Shivam Singhal

Found 2 papers, 1 papers with code

Preventing Reward Hacking with Occupancy Measure Regularization

1 code implementation5 Mar 2024 Cassidy Laidlaw, Shivam Singhal, Anca Dragan

Thus, we propose regularizing based on the OM divergence between policies instead of AD divergence to prevent reward hacking.

Desk Organization: Effect of Multimodal Inputs on Spatial Relational Learning

no code implementations3 Aug 2021 Ryan Rowe, Shivam Singhal, Daqing Yi, Tapomayukh Bhattacharjee, Siddhartha S. Srinivasa

We examine the problem of desk organization: learning how humans spatially position different objects on a planar surface according to organizational ''preference''.

Position Relational Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.