Perturbation analysis for parameter estimation in continuous space HMMs

This paper is concerned with parameter identification in continuous space hidden Markov models. We provide a sensitivity analysis on a filtering density with respect to some model parameters to obtain the gradient of the log-likelihood function. The resulting estimator is combined with an efficient variance reduction technique in a stochastic gradient ascent method to identify the parameter. The main ideas consist in using a non-naive Finite Difference approach and an Infinitesimal Perturbation Analysis by defining an augmented Markov chain which allows to express the gradient of the filtering density as an expectation of a functional along this chain. This provides a general methodology that enables us to transform any algorithm that estimates the filtering density, such as interacting particle methods, into an algorithm that estimates its gradient. We propose therefore two algorithms, which are proven to be consistent and whose computational complexity is linear in the number of particles. Numerical experiments show their respective performances in terms of variance, accuracy, and computational complexity.

[1]  Johann Fichou,et al.  Particle-based methods for parameter estimation and tracking: Numerical experiments , 2004 .

[2]  Nando de Freitas,et al.  Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[3]  R. Douc,et al.  Limit theorems for weighted samples with applications to sequential Monte Carlo methods , 2008 .

[4]  Paul Glasserman,et al.  Gradient Estimation Via Perturbation Analysis , 1990 .

[5]  R. Douc,et al.  Asymptotics of the maximum likelihood estimator for general hidden Markov models , 2001 .

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Geir Storvik,et al.  Particle filters for state-space models with the presence of unknown static parameters , 2002, IEEE Trans. Signal Process..

[8]  Christophe Andrieu,et al.  Particle methods for change detection, system identification, and control , 2004, Proceedings of the IEEE.

[9]  P. Moral Feynman-Kac Formulae: Genealogical and Interacting Particle Systems with Applications , 2004 .

[10]  M. Schervish Theory of Statistics , 1995 .

[11]  Eric Moulines,et al.  On the use of particle filtering for maximum likelihood parameter estimation , 2005, 2005 13th European Signal Processing Conference.

[12]  François Le Gland,et al.  A particle implementation of the recursive MLE for partially observed diffusions , 2003 .

[13]  Eric Moulines,et al.  Comparison of resampling schemes for particle filtering , 2005, ISPA 2005. Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005..

[14]  A Orman,et al.  Optimization of Stochastic Models: The Interface Between Simulation and Optimization , 2012, J. Oper. Res. Soc..

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  A. Doucet,et al.  Parameter estimation in general state-space models using particle methods , 2003 .

[17]  Arnaud Doucet,et al.  Particle methods for optimal filter derivative: application to parameter estimation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[18]  Paul Glasserman,et al.  Monte Carlo Methods in Financial Engineering , 2003 .

[19]  Laurent Mevel,et al.  Exponential Forgetting and Geometric Ergodicity in Hidden Markov Models , 2000, Math. Control. Signals Syst..