Search Results for author: Huacong Jiang

Found 1 papers, 0 papers with code

Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent

no code implementations11 Oct 2021 Weiming Liu, Huacong Jiang, Bin Li, Houqiang Li

Follow-the-Regularized-Lead (FTRL) and Online Mirror Descent (OMD) are regret minimization algorithms for Online Convex Optimization (OCO), they are mathematically elegant but less practical in solving Extensive-Form Games (EFGs).

counterfactual

Cannot find the paper you are looking for? You can Submit a new open access paper.