论文信息 - Regret-Optimal Filtering

Regret-Optimal Filtering

We consider the problem of filtering in linear state-space models (e.g., the Kalman filter setting) through the lens of regret optimization. Specifically, we study the problem of causally estimating a desired signal, generated by a linear state-space model driven by process noise, based on noisy observations of a related observation process. We define a novel regret criterion for estimator design as the difference of the estimation error energies between a clairvoyant estimator that has access to all future observations (a socalled smoother) and a causal one that only has access to current and past observations. The regret-optimal estimator is the causal estimator that minimizes the worst-case regret across all bounded-energy noise sequences. We provide a solution for the regret filtering problem at two levels. First, an horizonindependent solution at the operator level is obtained by reducing the regret to the wellknown Nehari problem. Secondly, our main result for state-space models is an explicit estimator that achieves the optimal regret. The regret-optimal estimator is represented as a finite-dimensional state-space whose parameters can be computed by solving three Riccati equations and a single Lyapunov equation. We demonstrate the applicability and efficacy of the estimator in a variety of problems and observe that the estimator has average and worst-case performances that are simultaneously close to their optimal values. The authors are with the Department of Electrical Engineering at Caltech, e-mails: {oron,hassibi}@caltech.edu. Proceedings of the 24 International Conference on Artificial Intelligence and Statistics (AISTATS) 2021, San Diego, California, USA. PMLR: Volume 130. Copyright 2021 by the author(s).

Babak Hassibi | Oron Sabag

[1] Max Simchowitz,et al. Logarithmic Regret for Adversarial Online Control , 2020, ICML.

[2] Shahin Shahrampour,et al. Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems , 2021, ArXiv.

[3] Ali H. Sayed,et al. H∞ optimality of the LMS algorithm , 1996, IEEE Trans. Signal Process..

[4] Yonina C. Eldar,et al. Robust mean-squared error estimation in the presence of model uncertainties , 2005, IEEE Transactions on Signal Processing.

[5] L. Rodman,et al. Nehari Interpolation Problem , 1990 .

[6] Tomer Koren,et al. Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret , 2021, ICML.

[7] Yonina C. Eldar,et al. A competitive minimax approach to robust estimation of random parameters , 2004, IEEE Transactions on Signal Processing.

[8] T. Kailath,et al. Indefinite-quadratic estimation and control: a unified approach to H 2 and H ∞ theories , 1999 .

[9] Z. Nehari. On Bounded Bilinear Forms , 1957 .

[10] Nevena Lazic,et al. Model-Free Linear Quadratic Control via Reduction to Expert Prediction , 2018, AISTATS.

[11] Sham M. Kakade,et al. Online Control with Adversarial Disturbances , 2019, ICML.

[12] Pramod P. Khargonekar,et al. FILTERING AND SMOOTHING IN AN H" SETTING , 1991 .

[13] Babak Hassibi,et al. Regret-optimal measurement-feedback control , 2020, L4DC.

[14] M. Grimble,et al. H∞ Robust Linear Estimator , 1990 .

[15] Sham M. Kakade,et al. The Nonstochastic Control Problem , 2020, ALT.

[16] Max Simchowitz,et al. Improper Learning for Non-Stochastic Control , 2020, COLT.

[17] Meir Feder,et al. Sequential prediction under log-loss and misspecification , 2021, COLT.

[18] U. Shaked,et al. H,-OPTIMAL ESTIMATION: A TUTORIAL , 1992 .

[19] Michael J. Grimble,et al. Solution of the H∞ optimal linear filtering problem for discrete-time systems , 1990, IEEE Trans. Acoust. Speech Signal Process..

[20] Babak Hassibi,et al. Regret-Optimal Full-Information Control , 2021, ArXiv.

[21] Kristiaan Pelckmans,et al. Worst-Case Prediction Performance Analysis of the Kalman Filter , 2016, IEEE Transactions on Automatic Control.

[22] Minyue Fu,et al. A linear matrix inequality approach to robust H∞ filtering , 1997, IEEE Trans. Signal Process..

[23] Csaba Szepesvári,et al. Regret Bounds for the Adaptive Control of Linear Quadratic Systems , 2011, COLT.

[24] Deepan Muthirayan,et al. Online Learning Robust Control of Nonlinear Dynamical Systems , 2021, ArXiv.

[25] Norbert Wiener,et al. Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications , 1949 .

[26] Yishay Mansour,et al. Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret , 2019, ICML.

[27] T. Başar,et al. A New Approach to Linear Filtering and Prediction Problems , 2001 .

[28] Yonina C. Eldar,et al. Minimax MSE-ratio estimation with signal covariance uncertainties , 2005, IEEE Transactions on Signal Processing.

[29] Babak Hassibi,et al. Regret-Optimal Controller for the Full-Information Problem , 2021, 2021 American Control Conference (ACC).