论文信息 - Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones

Mixed Memory Markov Models: Decomposing Complex Stochastic Processes as Mixtures of Simpler Ones

We study Markov models whose state spaces arise from the Cartesian product of two or more discrete random variables. We show how to parameterize the transition matrices of these models as a convex combination—or mixture—of simpler dynamical models. The parameters in these models admit a simple probabilistic interpretation and can be fitted iteratively by an Expectation-Maximization (EM) procedure. We derive a set of generalized Baum-Welch updates for factorial hidden Markov models that make use of this parameterization. We also describe a simple iterative procedure for approximately computing the statistics of the hidden states. Throughout, we give examples where mixed memory models provide a useful representation of complex stochastic processes.

Michael I. Jordan | Lawrence K. Saul | L. Saul

[1] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[2] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3] A. Nadas,et al. Estimation of probabilities in the language model of the IBM speech recognition system , 1984 .

[4] Anne Lohrli. Chapman and Hall , 1985 .

[5] A. Raftery. A model for high-order Markov chains , 1985 .

[6] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .

[8] Geoffrey E. Hinton,et al. Mean field networks that learn to discriminate temporally distorted strings , 1991 .

[9] David S. Touretzky,et al. Connectionist models : proceedings of the 1990 summer school , 1991 .

[10] Junji Yamato,et al. Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11] D. Haussler,et al. Protein modeling using hidden Markov models: analysis of globins , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.