论文信息 - Combinatorial Topic Models using Small-Variance Asymptotics

Combinatorial Topic Models using Small-Variance Asymptotics

Topic models have emerged as fundamental tools in unsupervised machine learning. Most modern topic modeling algorithms take a probabilistic view and derive inference algorithms based on Latent Dirichlet Allocation (LDA) or its variants. In contrast, we study topic modeling as a combinatorial optimization problem, and propose a new objective function derived from LDA by passing to the small-variance limit. We minimize the derived objective by using ideas from combinatorial optimization, which results in a new, fast, and high-quality topic modeling algorithm. In particular, we show that our results are competitive with popular LDA-based topic modeling approaches, and also discuss the (dis)similarities between our approach and its probabilistic counterparts.

Ke Jiang | Suvrit Sra | Brian Kulis

[1] John D. Lafferty,et al. Correlated Topic Models , 2005, NIPS.

[2] Jun Zhu,et al. Small-Variance Asymptotics for Dirichlet Process Mixtures of SVMs , 2014, AAAI.

[3] Yee Whye Teh,et al. A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation , 2006, NIPS.

[4] Sanjeev Arora,et al. Learning Topic Models -- Going beyond SVD , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[5] Francis R. Bach,et al. Rethinking LDA: Moment Matching for Discrete ICA , 2015, NIPS.

[6] Sergei Vassilvitskii,et al. k-means++: the advantages of careful seeding , 2007, SODA '07.

[7] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[8] Andrew Zisserman,et al. Advances in Neural Information Processing Systems (NIPS) , 2007 .

[9] Inderjit S. Dhillon,et al. Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[10] Thomas L. Griffiths,et al. Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[11] Mark Steyvers,et al. Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12] Jonathan P. How,et al. Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture , 2013, NIPS.

[13] Jun Zhu,et al. DP-space: Bayesian Nonparametric Subspace Clustering with Small-variance Asymptotics , 2015, ICML.

[14] Sam T. Roweis,et al. EM Algorithms for PCA and SPCA , 1997, NIPS.

[15] Michael I. Jordan,et al. Hierarchical Dirichlet Processes , 2006 .

[16] Ke Jiang,et al. Small-Variance Asymptotics for Hidden Markov Models , 2013, NIPS.

[17] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[18] Alexander J. Smola,et al. Scalable inference in latent variable models , 2012, WSDM '12.

[19] Evangelos Markakis,et al. Greedy facility location algorithms analyzed using dual fitting with factor-revealing LP , 2002, JACM.

[20] Chiranjib Bhattacharyya,et al. A provable SVD-based algorithm for learning topics in dominant admixture corpus , 2014, NIPS.

[21] Inderjit S. Dhillon,et al. Information theoretic clustering of sparse cooccurrence data , 2003, Third IEEE International Conference on Data Mining.

[22] W. Eric L. Grimson,et al. Spatial Latent Dirichlet Allocation , 2007, NIPS.

[23] Michael I. Jordan,et al. Small-Variance Asymptotics for Exponential Family Dirichlet Process Mixture Models , 2012, NIPS.

[24] Michael I. Jordan,et al. MAD-Bayes: MAP-based Asymptotic Derivations from Bayes , 2012, ICML.

[25] Daphne Koller,et al. Restricted Bayes Optimal Classifiers , 2000, AAAI/IAAI.

[26] Ardavan Saeedi,et al. JUMP-Means: Small-Variance Asymptotics for Markov Jump Processes , 2015, ICML.

[27] Pradeep Ravikumar,et al. A Convex Exemplar-based Approach to MAD-Bayes Dirichlet Process Mixture Models , 2015, ICML.

[28] K. Schittkowski,et al. NONLINEAR PROGRAMMING , 2022 .

[29] Alexander J. Smola,et al. An architecture for parallel topic models , 2010, Proc. VLDB Endow..

[30] Michael I. Jordan,et al. Revisiting k-means: New Algorithms via Bayesian Nonparametrics , 2011, ICML.

[31] Sanjeev Arora,et al. A Practical Algorithm for Topic Modeling with Provable Guarantees , 2012, ICML.

[32] Inderjit S. Dhillon,et al. Iterative clustering of high dimensional text data augmented by local search , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[33] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[34] Anima Anandkumar,et al. A Spectral Algorithm for Latent Dirichlet Allocation , 2012, Algorithmica.

[35] Francis R. Bach,et al. Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[36] Thomas Hofmann,et al. Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[37] David M. Blei,et al. Deep Exponential Families , 2014, AISTATS.

[38] Alexander J. Smola,et al. Reducing the sampling complexity of topic models , 2014, KDD.

[39] Dan Roth,et al. A Discriminative Latent Variable Model for Online Clustering , 2014, ICML.

[40] Ruslan Salakhutdinov,et al. Evaluation methods for topic models , 2009, ICML '09.

[41] Seungjin Choi,et al. Bayesian Hierarchical Clustering with Exponential Family: Small-Variance Asymptotics and Reducibility , 2015, AISTATS.