论文信息 - Online learning versus offline learning

Online learning versus offline learning

We present an off-line variant of the mistake-bound model of learning. Just like in the well studied on-line model, a learner in the offline model has to learn an unknown concept from a sequence of elements of the instance space on which he makes “guess and test” trials. In both models, the aim of the learner is to make as few mistakes as possible. The difference between the models is that, while in the on-line model only the set of possible elements is known, in the off-line model the sequence of elements (i.e., the identity of the elements as well as the order in which they are to be presented) is known to the learner in advance.

[1] Norbert Sauer,et al. On the Density of Families of Sets , 1972, J. Comb. Theory, Ser. A.

[2] N. Littlestone. Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[3] David Haussler,et al. Learning decision trees from random examples , 1988, COLT '88.

[4] Dana Angluin. Equivalence queries and approximate fingerprints , 1989, COLT '89.

[5] N. Littlestone. Mistake bounds and logarithmic linear-threshold learning algorithms , 1990 .

[6] Ronald L. Rivest,et al. Introduction to Algorithms , 1990 .

[7] Jerome A. Feldman,et al. Learning Automata from Ordered Examples , 1991, Mach. Learn..

[8] Wolfgang Maass,et al. On-line learning with an oblivious environment and the power of randomization , 1991, COLT '91.

[9] Philip M. Long,et al. Apple tasting and nearly one-sided learning , 1992, Proceedings., 33rd Annual Symposium on Foundations of Computer Science.

[10] Avrim Blum. Rank-r Decision Trees are a Subclass of r-Decision Lists , 1992, Inf. Process. Lett..

[11] Neri Merhav,et al. Universal prediction of individual sequences , 1992, IEEE Trans. Inf. Theory.

[12] Zhixiang Chen,et al. On-line learning of rectangles , 1992, COLT '92.

[13] Ronald L. Rivest,et al. Learning Binary Relations and Total Orders , 1993, SIAM J. Comput..

[14] David Haussler,et al. How to use expert advice , 1993, STOC.

[15] Neri Merhav,et al. Universal schemes for sequential decision from individual data sequences , 1993, IEEE Trans. Inf. Theory.

[16] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..

[17] Avrim Blum. Separating Distribution-Free and Mistake-Bound Learning Models over the Boolean Domain , 1994, SIAM J. Comput..