Variational Inference over Combinatorial Spaces

Since the discovery of sophisticated fully polynomial randomized algorithms for a range of #P problems [1, 2, 3], theoretical work on approximate inference in combinatorial spaces has focused on Markov chain Monte Carlo methods. Despite their strong theoretical guarantees, the slow running time of many of these randomized algorithms and the restrictive assumptions on the potentials have hindered the applicability of these algorithms to machine learning. Because of this, in applications to combinatorial spaces simple exact models are often preferred to more complex models that require approximate inference [4]. Variational inference would appear to provide an appealing alternative, given the success of variational methods for graphical models [5]; unfortunately, however, it is not obvious how to develop variational approximations for combinatorial objects such as matchings, partial orders, plane partitions and sequence alignments. We propose a new framework that extends variational inference to a wide range of combinatorial spaces. Our method is based on a simple assumption: the existence of a tractable measure factorization, which we show holds in many examples. Simulations on a range of matching models show that the algorithm is more general and empirically faster than a popular fully polynomial randomized algorithm. We also apply the framework to the problem of multiple alignment of protein sequences, obtaining state-of-the-art results on the BAliBASE dataset [6].

[1]  Lars Eilstrup Rasmussen,et al.  Approximating the Permanent: A Simple Approach , 1994, Random Struct. Algorithms.

[2]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[3]  G. Brightwell,et al.  Counting linear extensions , 1991 .

[4]  Lior Pachter,et al.  Multiple alignment by sequence annealing , 2007, Bioinform..

[5]  Michael Chertkov,et al.  Belief propagation and loop calculus for the permanent of a non-negative matrix , 2009, ArXiv.

[6]  Ben Taskar,et al.  Alignment by Agreement , 2006, NAACL.

[7]  D. Haussler,et al.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. , 2003, Molecular biology and evolution.

[8]  Daniel Tarlow,et al.  Using Combinatorial Optimization within Max-Product Belief Propagation , 2006, NIPS.

[9]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[10]  D. Higgins,et al.  See Blockindiscussions, Blockinstats, Blockinand Blockinauthor Blockinprofiles Blockinfor Blockinthis Blockinpublication Clustal: Blockina Blockinpackage Blockinfor Blockinperforming Multiple Blockinsequence Blockinalignment Blockinon Blockina Minicomputer Article Blockin Blockinin Blockin , 2022 .

[11]  Bart Selman,et al.  Sparse Message Passing Algorithms for Weighted Maximum Satisfiability , 2007 .

[12]  D. Wilson Mixing times of lozenge tiling and card shuffling Markov chains , 2001, math/0102193.

[13]  David A. Smith,et al.  Dependency Parsing by Belief Propagation , 2008, EMNLP.

[14]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[15]  Carsten Peterson,et al.  A Mean Field Theory Learning Algorithm for Neural Networks , 1987, Complex Syst..

[16]  Chuong B. Do,et al.  ProbCons: Probabilistic consistency-based multiple sequence alignment. , 2005, Genome research.

[17]  Ben Taskar,et al.  Max-Margin Parsing , 2004, EMNLP.

[18]  Eric Vigoda,et al.  A polynomial-time approximation algorithm for the permanent of a matrix with non-negative entries , 2001, STOC '01.

[19]  I JordanMichael,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008 .

[20]  Dan Klein,et al.  Joint Parsing and Alignment with Weakly Synchronized Grammars , 2010, NAACL.

[21]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[22]  David B. Searls,et al.  Automata-Theoretic Models of Mutation and Alignment , 1995, ISMB.

[23]  Ben Taskar,et al.  A Discriminative Matching Approach to Word Alignment , 2005, HLT.

[24]  Leslie G. Valiant,et al.  The Complexity of Computing the Permanent , 1979, Theor. Comput. Sci..

[25]  Michael I. Jordan,et al.  Optimization of Structured Mean Field Objectives , 2009, UAI.

[26]  Bert Huang,et al.  Approximating the Permanent with Belief Propagation , 2009, ArXiv.

[27]  Martin J. Wainwright,et al.  Tree-reweighted belief propagation algorithms and approximate ML estimation by pseudo-moment matching , 2003, AISTATS.

[28]  M. Droste,et al.  Handbook of Weighted Automata , 2009 .

[29]  L. Khachiyan,et al.  On the conductance of order Markov chains , 1991 .

[30]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[31]  Dan Klein,et al.  Agreement-Based Learning , 2007, NIPS.