Repeated Inverse Reinforcement Learning for AI Safety
暂无分享,去创建一个
[1] J. Neumann,et al. Theory of Games and Economic Behavior: 60th Anniversary Commemorative Edition , 2020 .
[2] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[3] Christos Dimitrakakis,et al. Preference elicitation and inverse reinforcement learning , 2011, ECML/PKDD.
[4] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[5] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[6] Pieter Abbeel,et al. An Application of Reinforcement Learning to Aerobatic Helicopter Flight , 2006, NIPS.
[7] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[8] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[9] Kareem Amin,et al. Towards Resolving Unidentifiability in Inverse Reinforcement Learning , 2016, ArXiv.
[10] S. Schneider. Science fiction and philosophy : from time travel to superintelligence , 2016 .
[11] L. Lovász,et al. Geometric Algorithms and Combinatorial Optimization , 1981 .
[12] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[13] Craig Boutilier,et al. Regret-based Reward Elicitation for Markov Decision Processes , 2009, UAI.
[14] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[15] Stuart J. Russell,et al. Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..
[16] Craig Boutilier,et al. Eliciting Additive Reward Functions for Markov Decision Processes , 2011, IJCAI.
[17] Craig Boutilier,et al. Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies , 2010, AAAI.
[18] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..
[19] Anca D. Dragan,et al. Cooperative Inverse Reinforcement Learning , 2016, NIPS.
[20] Daphne Koller,et al. Making Rational Decisions Using Adaptive Utility Elicitation , 2000, AAAI/IAAI.