no code implementations • 3 Nov 2023 • Durgesh Kalwar, Omkar Shelke, Harshad Khadilkar
We consider the inventory management problem, where the goal is to balance conflicting objectives such as availability and wastage of a large range of products in a store.
no code implementations • 3 Nov 2023 • Durgesh Kalwar, Vineeth B. S
At each step, the agent receives an observation of the function's value at a point decided by the agent.
no code implementations • 2 Mar 2022 • Durgesh Kalwar, Omkar Shelke, Somjit Nath, Hardik Meisheri, Harshad Khadilkar
Exploration methods have been used to sample better trajectories in large environments while auxiliary tasks have been incorporated where the reward is sparse.