P. Thomas
发表
Marc G. Bellemare,
Rémi Munos,
Philip S. Thomas,
2015,
AAAI.
Philip Thomas,
P. Thomas,
2014,
ICML.
P. Thomas,
G. Konidaris,
2011
.
Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces
pdf
Bo Liu,
Ji Liu,
Sridhar Mahadevan,
2014,
ArXiv.
Andrew G. Barto,
Philip S. Thomas,
A. Barto,
2012,
2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).
Philip S. Thomas,
P. Thomas,
2011,
NIPS.
Andrew G. Barto,
Philip S. Thomas,
A. Barto,
2011,
ICML.
Philip S. Thomas,
Emma Brunskill,
P. Thomas,
2016,
ICML.
George Konidaris,
Philip S. Thomas,
Sarah Osentoski,
2011,
AAAI.
Philip S. Thomas,
Mohammad Ghavamzadeh,
Georgios Theocharous,
2015,
IJCAI.
Philip S. Thomas,
Mohammad Ghavamzadeh,
Georgios Theocharous,
2015,
AAAI.
Philip S. Thomas,
Kathleen M. Jagodnik,
Robert F. Kirsch,
2017,
IEEE Transactions on Neural Systems and Rehabilitation Engineering.
Philip S. Thomas,
Kathleen M. Jagodnik,
Antonie J. van den Bogert,
2009,
IAAI.
Philip S. Thomas,
Mohammad Ghavamzadeh,
Georgios Theocharous,
2015,
WWW.
Philip S. Thomas,
Mohammad Ghavamzadeh,
Georgios Theocharous,
2017,
AAAI.
Philip S. Thomas,
Georgios Theocharous,
James Kostas,
2019,
ICML.
Bruno Castro da Silva,
Andrew G. Barto,
Philip S. Thomas,
2017,
ArXiv.
Philip S. Thomas,
Christoph Dann,
Emma Brunskill,
2018,
ICML.
P. Thomas,
2019
.
Critic,
P. Thomas,
Saket Tiwari,
2018,
AAAI.
Scott M. Jordan,
P. Thomas,
Yash Chandak,
2020,
ICML.
Kathleen M. Jagodnik,
P. Thomas,
M. Branicky,
2022
.
Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines
pdf
Philip S. Thomas,
Emma Brunskill,
P. Thomas,
2017,
ArXiv.
P. Thomas,
S. Zilberstein,
Abhinav Bhatia,
2022,
ArXiv.
Scott Niekum,
George Konidaris,
Philip S. Thomas,
2015,
NIPS 2015.
P. Thomas,
Michael Norrish,
Jared Yeager,
2022,
ITP.
David M. Bossens,
P. Thomas,
2022,
2212.03932.
Philip S. Thomas,
Pinar Ozisik,
P. Thomas,
2020,
NeurIPS.
Bruno Castro da Silva,
Scott Niekum,
Philip S. Thomas,
2021,
NeurIPS.
Philip S. Thomas,
Christoph Dann,
Emma Brunskill,
2017,
ArXiv.
Philip S. Thomas,
Erik G. Learned-Miller,
P. Thomas,
2019,
ICML.
Yuriy Brun,
Ari Kobren,
Philip S. Thomas,
2019,
NeurIPS.
Yuriy Brun,
Philip S Thomas,
Bruno Castro da Silva,
2019,
Science.
Philip S. Thomas,
Georgios Theocharous,
Chris Nota,
2019,
AAAI.
Philip S. Thomas,
Frits de Nijs,
Georgios Theocharous,
2020,
ArXiv.
P. Thomas,
S. Niekum,
Yuriy Brun,
2022,
ICLR.
Philip S. Thomas,
Georgios Theocharous,
Yash Chandak,
2019,
AAAI.
Sridhar Mahadevan,
Philip S. Thomas,
Georgios Theocharous,
2020,
ICML.
Philip S. Thomas,
Emma Brunskill,
Shayan Doroudi,
2017,
UAI.
P. Thomas,
Yash Chandak,
Shiv Shankar,
2023,
arXiv.org.
P. Thomas,
Georgios Theocharous,
James Kostas,
2021,
arXiv.org.
Philip S. Thomas,
Chris Nota,
James E. Kostas,
2019,
ICML.
Philip S. Thomas,
P. Thomas,
2015,
ArXiv.
Philip S. Thomas,
Emma Brunskill,
Zhaohan Guo,
2017,
NIPS.
Philip S. Thomas,
Mohammad Ghavamzadeh,
Georgios Theocharous,
2015,
ICML.
Philip S. Thomas,
Shiv Shankar,
Yash Chandak,
2021,
AAAI.
Philip S. Thomas,
Francisco M. Garcia,
P. Thomas,
2019,
AAMAS.
Scott M. Jordan,
P. Thomas,
Martha White,
2020,
NeurIPS.
Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
pdf
Romain Laroche,
Joelle Pineau,
Philip S. Thomas,
2021,
NeurIPS.
James E. Kostas,
P. Thomas,
Yuriy Brun,
2023,
2023 IEEE/ACM 45th International Conference on Software Engineering: Companion Proceedings (ICSE-Companion).
P. Thomas,
Yuriy Brun,
B. C. Silva,
2022,
arXiv.org.
Nathaniel D. Bastian,
P. Thomas,
Yash Chandak,
2023,
NeurIPS.
Scott Niekum,
Philip S. Thomas,
Stephen Giguere,
2021,
NeurIPS.
Philip S. Thomas,
Chris Nota,
P. Thomas,
2019,
AAMAS.
Philip S. Thomas,
Georgios Theocharous,
Scott M. Jordan,
2021,
ICML.
Scott M. Jordan,
James E. Kostas,
P. Thomas,
2023,
ArXiv.
David M. Bossens,
P. Thomas,
2022,
arXiv.org.
Philip S. Thomas,
Chris Nota,
Francisco M. Garcia,
2020,
ArXiv.
Philip S. Thomas,
James Kostas,
Chris Nota,
2019
.
Philip S. Thomas,
Francisco M. Garcia,
Bruno C. da Silva,
2017,
AAMAS.
Philip S. Thomas,
Emma Brunskill,
Zhaohan Daniel Guo,
2017,
ArXiv.
Sridhar Mahadevan,
Philip S. Thomas,
Stephen Giguere,
2013,
NIPS.
Josiah P. Hanna,
P. Thomas,
P. Stone,
2017,
International Conference on Machine Learning.
Philip S. Thomas,
Kathleen M. Jagodnik,
Antonie J. van den Bogert,
2016,
IEEE Transactions on Human-Machine Systems.
P. Thomas,
S. Niekum,
Yuriy Brun,
2022,
ICLR.
P. Thomas,
Abhishek Sharma,
R. Kozma,
2019,
ArXiv.
Hananel Hazan,
Robert Kozma,
Philip S. Thomas,
2019,
1910.06489.
Scott Niekum,
George Konidaris,
Philip S. Thomas,
2011,
NIPS.
Kathleen M. Jagodnik,
Philip S Thomas,
Michael Branicky,
2008,
The ... Yale Workshop on Adaptive and Learning Systems.
Philip S. Thomas,
William Dabney,
P. Thomas,
2014,
AAAI.
Philip Thomas,
P. Thomas,
2014,
ICML.
Philip S. Thomas,
Emma Brunskill,
P. Thomas,
2016,
AAAI.
R. S. Sutton,
P. S. Thomas,
R. Sutton,
2017
.