论文信息 - On Robustness / Performance Tradeoffs in Linear Programming and Markov Decision Processes

On Robustness / Performance Tradeoffs in Linear Programming and Markov Decision Processes

Computation of a satisfactory policy for a decision problem when the parameters of the model are uncertain is a problem encountered in many applications. The traditional robust approach is based on a worst-case analysis and may lead to overly conservative solutions. In this paper we directly quantify the robustness to uncertainty and consider the tradeoff between the nominal performance and robustness measures. Optimization in both linear programming and Markov decision processes is discussed. For linear programming we consider the tradeoff between the nominal cost of a solution and a robustness measure that quantifies the magnitude of constraint violation under the most adversarial parameters. We propose an algorithm that computes the whole set of Pareto efficient solutions based on parametric linear programming. For Markov decision processes, we consider the tradeoff between the performance under nominal parameters and the performance under adversarial parameters. For the special case where only the rewards are uncertain, we propose an algorithm that computes the whole set of Pareto efficient policies in a single pass.

Shie Mannor | Huan Xu

[1] Matthias Heger,et al. Consideration of Risk in Reinforcement Learning , 1994, ICML.

[2] Robert J. Vanderbei,et al. Robust Optimization of Large-Scale Systems , 1995, Oper. Res..

[3] John N. Tsitsiklis,et al. Introduction to linear optimization , 1997, Athena scientific optimization and computation series.

[4] E. Polak,et al. On Multicriteria Optimization , 1976 .

[5] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[6] Melvyn Sim,et al. The Price of Robustness , 2004, Oper. Res..

[7] Chelsea C. White,et al. Markov Decision Processes with Imprecise Transition Probabilities , 1994, Oper. Res..

[8] Andrew Y. Ng,et al. Solving Uncertain Markov Decision Processes , 2001 .

[9] Peter Geibel,et al. Reinforcement Learning with Bounded Risk , 2001, ICML.

[10] Yuval Rabani,et al. Linear Programming , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[11] Allen L. Soyster,et al. Technical Note - Convex Programming with Set-Inclusive Constraints and Applications to Inexact Linear Programming , 1973, Oper. Res..

[12] Arkadi Nemirovski,et al. Robust solutions of uncertain linear programs , 1999, Oper. Res. Lett..

[13] Laurent El Ghaoui,et al. Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..