暂无分享,去创建一个
[1] Jaime F. Fisac,et al. A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems , 2017, IEEE Transactions on Automatic Control.
[2] Tomás Svoboda,et al. Safe Exploration Techniques for Reinforcement Learning - An Overview , 2014, MESAS.
[3] Tom Schaul,et al. Deep Q-learning From Demonstrations , 2017, AAAI.
[4] Sergey Levine,et al. Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning , 2017, ICLR.
[5] Ralph Neuneier,et al. Risk-Sensitive Reinforcement Learning , 1998, Machine Learning.
[6] Daan Wierstra,et al. Variational Intrinsic Control , 2016, ICLR.
[7] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[8] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[9] Peter Stone,et al. Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces , 2017, AAAI.
[10] Jessica Taylor,et al. Alignment for Advanced Machine Learning Systems , 2020, Ethics of Artificial Intelligence.
[11] Christoph Salge,et al. Empowerment - an Introduction , 2013, ArXiv.
[12] Shie Mannor,et al. Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach , 2015, NIPS.
[13] Anca D. Dragan,et al. Cooperative Inverse Reinforcement Learning , 2016, NIPS.
[14] Shane Legg,et al. Deep Reinforcement Learning from Human Preferences , 2017, NIPS.
[15] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[16] Owain Evans,et al. Trial without Error: Towards Safe Reinforcement Learning via Human Intervention , 2017, AAMAS.
[17] Shakir Mohamed,et al. Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning , 2015, NIPS.
[18] Yuval Tassa,et al. Safe Exploration in Continuous Action Spaces , 2018, ArXiv.
[19] Anca D. Dragan,et al. Inverse Reward Design , 2017, NIPS.
[20] Andreas Krause,et al. Safe Exploration in Finite Markov Decision Processes with Gaussian Processes , 2016, NIPS.
[21] Chrystopher L. Nehaniv,et al. All Else Being Equal Be Empowered , 2005, ECAL.
[22] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[23] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[24] Pieter Abbeel,et al. Safe Exploration in Markov Decision Processes , 2012, ICML.
[25] Jianfeng Gao,et al. Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear , 2016, ArXiv.
[26] Alexandre M. Bayen,et al. A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.
[27] Laurent Orseau,et al. AI Safety Gridworlds , 2017, ArXiv.
[28] Stuart Armstrong,et al. Low Impact Artificial Intelligences , 2017, ArXiv.
[29] Claire J. Tomlin,et al. Guaranteed Safe Online Learning via Reachability: tracking a ground target using a quadrotor , 2012, 2012 IEEE International Conference on Robotics and Automation.