论文信息 - Understanding the Behavior of Co-training

Understanding the Behavior of Co-training

Recently there has been signi cant interest in supervised learning algorithms that combine labeled and unlabeled data for text learning tasks. The co-training setting (Blum & Mitchell, 1998) applies to datasets that have a natural separation of their features into two disjoint sets. We demonstrate that when learning from labeled and unlabeled data, algorithms explicitly leveraging a natural independent split of the features outperform algorithms that do not. When a natural split does not exist, co-training algorithms that manufacture a feature split may out-perform algorithms not using a split. These results help explain why co-training algorithms are both discriminative in nature and robust to the assumptions of their embedded classi ers.

Kamal Nigam | Rayid Ghani | K. Nigam | R. Ghani

[1] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2] David Gelperin,et al. On the Optimality of A* , 1977, Artif. Intell..

[3] W. Bruce Croft,et al. Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[4] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[5] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[6] Chris Buckley,et al. New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[7] Thorsten Joachims,et al. A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[8] Andrew McCallum,et al. A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[9] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[10] David D. Lewis,et al. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[11] Kamal Nigamyknigam,et al. Employing Em in Pool-based Active Learning for Text Classiication , 1998 .

[12] David Haussler,et al. Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[13] Yoram Singer,et al. Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[14] Ellen Riloff,et al. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[15] Thorsten Joachims,et al. Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.