论文信息 - Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution - 字舞流文

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

Jose A. Arjona-Medina | Vihang P. Patil | S. Hochreiter | Johannes Brandstetter | M. Hofmarcher | Marius-Constantin Dinu | Matthias Dorfer | P. Blies | Vihang Patil | Sepp Hochreiter