no code implementations • ACL 2022 • Govardana Sachithanandam Ramachandran, Kazuma Hashimoto, Caiming Xiong
Further more we demonstrate sample efficiency, where our method trained only on 20% of the data, are comparable to current state of the art method trained on 100% data on two out of there evaluation metrics.
1 code implementation • 10 Mar 2021 • Govardana Sachithanandam Ramachandran, Kazuma Hashimoto, Caiming Xiong
This method gives guarantees on dialogue policy's performance and also learns to shape rewards according to intentions behind human responses, rather than just mimicking demonstration data; this couple with batch-RL helps overall with sample efficiency of the framework.
1 code implementation • 7 Dec 2020 • Govardana Sachithanandam Ramachandran, Ivan Brugere, Lav R. Varshney, Caiming Xiong
Similarly, social networks within universities and organizations may enable certain groups to more easily access people with valuable information or influence.
no code implementations • 30 Jun 2019 • Niranjan Balachandar, Justin Dieter, Govardana Sachithanandam Ramachandran
We train and evaluate our multi-agent methods against a team operating with a smart hand-coded policy.
2 code implementations • 11 Mar 2017 • Govardana Sachithanandam Ramachandran, Ajay Sohmshetty
We propose extensions for the Dynamic Memory Network (DMN), specifically within the attention mechanism, we call the resulting Neural Architecture as Dynamic Memory Tensor Network (DMTN).