no code implementations • 12 Jul 2020 • Faruk Kucuksubasi, Elif Surer
We reached similar policy optimization performance results with the PrediNet architecture and MHDPA; additionally, we achieved to extract the propositional representation explicitly ---which makes the agent's statistical policy logic more interpretable and tractable.