no code implementations • 11 Dec 2022 • Shuo Yang, Yang Jiao, Shaoyu Dou, Mana Zheng, Chen Zhu
The bilevel optimization is used to automatically update the hyperparameter, and the gradient of the hyperparameter is the approximate gradient based on the best response function.