论文信息 - Divergences, surrogate loss functions and experimental design

Divergences, surrogate loss functions and experimental design

In this paper, we provide a general theorem that establishes a correspondence between surrogate loss functions in classification and the family of f-divergences. Moreover, we provide constructive procedures for determining the f-divergence induced by a given surrogate loss, and conversely for finding all surrogate loss functions that realize a given f-divergence. Next we introduce the notion of universal equivalence among loss functions and corresponding f-divergences, and provide necessary and sufficient conditions for universal equivalence to hold. These ideas have applications to classification problems that also involve a component of experiment design; in particular, we leverage our results to prove consistency of a procedure for learning a classifier under decentralization requirements.

Martin J. Wainwright | Michael I. Jordan | XuanLong Nguyen

[1] S. M. Ali,et al. A General Class of Coefficients of Divergence of One Distribution from Another , 1966 .

[2] Martin J. Wainwright,et al. On divergences, surrogate loss functions, and decentralized detection , 2005, ArXiv.

[3] Flemming Topsøe,et al. Some inequalities for information divergence and related measures of discrimination , 2000, IEEE Trans. Inf. Theory.

[4] 丸山徹. Convex Analysisの二,三の進展について , 1977 .

[5] Michael I. Jordan,et al. Nonparametric decentralized detection using kernel methods , 2005, IEEE Transactions on Signal Processing.

[6] Tong Zhang. Statistical behavior and consistency of classification methods based on convex risk minimization , 2003 .

[7] John N. Tsitsiklis,et al. Extremal properties of likelihood-ratio quantizers , 1993, IEEE Trans. Commun..

[8] Ingo Steinwart,et al. Consistency of support vector machines and other regularized kernel classifiers , 2005, IEEE Transactions on Information Theory.

[9] D. Blackwell. Equivalent Comparisons of Experiments , 1953 .

[10] H. V. Poor,et al. Applications of Ali-Silvey Distance Measures in the Design of Generalized Quantizers for Binary Decision Systems , 1977, IEEE Trans. Commun..

[11] T. Kailath. The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .

[12] Michael I. Jordan,et al. Convexity, Classification, and Risk Bounds , 2006 .