论文信息 - On-line Variance Minimization in O(n2) per Trial?

On-line Variance Minimization in O(n2) per Trial?

Consider the following canonical online learning problem with matrices [WK06]: In each trial t the algorithm chooses a density matrix Wt ∈ Rn×n (i.e., a positive semi-definite matrix with trace one). Then nature chooses a symmetric loss matrix Lt ∈ Rn×n whose eigenvalues lie in the interval [0, 1] and the algorithms incurs loss tr(WtLt). The goal is to find algorithms that for any sequence of trials have small regret against the best dyad chosen in hindsight. Here a dyad is an outer product uu# of a unit vector u in Rn. More precisely the regret after T trials is defined as follows:

Manfred K. Warmuth | Elad Hazan | Satyen Kale

[1] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[2] Sanjeev Arora,et al. A combinatorial, primal-dual approach to semidefinite programs , 2007, STOC.

[3] Gunnar Rätsch,et al. Matrix Exponentiated Gradient Updates for On-line Learning and Bregman Projection , 2004, J. Mach. Learn. Res..

[4] Manfred K. Warmuth,et al. Online variance minimization , 2011, Machine Learning.

[5] Manfred K. Warmuth,et al. Optimum Follow the Leader Algorithm , 2005, COLT.