no code implementations • 4 Sep 2020 • Seiji Ishihara, Harukazu Igarashi
As experimental results of an application of our method on speed control of an automobile, it was confirmed that the proposed method has the effect of suppressing the undesirable fluctuation in time-series of the output value.
no code implementations • 30 Jan 2019 • Harukazu Igarashi, Yuichi Morioka, Kazumasa Yamamoto
In our new proposals, evaluation functions are learned by Monte Carlo sampling, which is performed with the backup policy in the search tree produced by Monte Carlo Softmax Search.