Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
暂无分享,去创建一个
Matthew W. Hoffman | Martin A. Riedmiller | Jost Tobias Springenberg | N. Heess | Siqi Liu | A. Abdolmaleki | Siqi Liu | Arunkumar Byravan | Bobak Shahriari | A. Friesen | J. T. Springenberg
[1] Daniel Wontae Nam,et al. GMAC: A Distributional Perspective on Actor-Critic Framework , 2021, ICML.
[2] Svetha Venkatesh,et al. Distributional Reinforcement Learning via Moment Matching , 2020, AAAI.
[3] Sergio Gomez Colmenarejo,et al. Acme: A Research Framework for Distributed Reinforcement Learning , 2020, ArXiv.
[4] Kyungjae Lee,et al. Distributional Deep Reinforcement Learning with a Mixture of Gaussians , 2019, 2019 International Conference on Robotics and Automation (ICRA).
[5] Yuval Tassa,et al. Relative Entropy Regularized Policy Iteration , 2018, ArXiv.
[6] Rémi Munos,et al. Implicit Quantile Networks for Distributional Reinforcement Learning , 2018, ICML.
[7] Yee Whye Teh,et al. An Analysis of Categorical Distributional Reinforcement Learning , 2018, AISTATS.
[8] Matthew W. Hoffman,et al. Distributed Distributional Deterministic Policy Gradients , 2018, ICLR.
[9] Yuval Tassa,et al. Maximum a Posteriori Policy Optimisation , 2018, ICLR.
[10] Marc G. Bellemare,et al. Distributional Reinforcement Learning with Quantile Regression , 2017, AAAI.
[11] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.
[12] Marc G. Bellemare,et al. The Cramer Distance as a Solution to Biased Wasserstein Gradients , 2017, ArXiv.
[13] David Silver,et al. Learning values across many orders of magnitude , 2016, NIPS.
[14] Shie Mannor,et al. Learning the Variance of the Reward-To-Go , 2016, J. Mach. Learn. Res..
[15] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[16] Mohammad Ghavamzadeh,et al. Actor-Critic Algorithms for Risk-Sensitive MDPs , 2013, NIPS.
[17] Bernhard Schölkopf,et al. A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..
[18] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[19] Masashi Sugiyama,et al. Parametric Return Density Estimation for Reinforcement Learning , 2010, UAI.