Publications
-
Kernel and Rich Regimes in Overparametrized Models
Blake Woodworth, Suriya Gunasekar, Jason D. Lee, Edward Moroshko, Pedro Savarese, Itay Golan, Daniel Soudry, Nathan Srebro
PMLR, 2020
-
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration
Brahma Pavse, Faraz Torabi, Josiah Hanna, Garrett Warnell, and Peter Stone
IROS, 2020
-
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma Pavse, Ishan Durugkar, Josiah Hanna, and Peter Stone
ICML, 2020
-
The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Yuchen Cui, Qiping Zhang, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox
CRL, 2020
-
On Sampling Error in Batch Action-Value Prediction Algorithms
Brahma S. Pavse, Josiah P. Hanna, Ishan Durugkar, and Peter Stone
NeurIPS, 2020
-
Learning and Reasoning for Robot Dialog and Navigation Tasks
Keting Lu, Shiqi Zhang, Peter Stone, and Xiaoping Chen
July, 2020
-
Deep R-Learning for Continual Area Sweeping
Rishi Shah, Yuqian Jiang, Justin Hart, and Peter Stone
IROS, 2020
-
Tracking Objects as Points
Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl
ECCV, 2020
-
Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog
Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, and Raymond J. Mooney
JAIR, 2020
-
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks
Lemeng Wu, Bo Liu, Peter Stone, and Qiang Liu
NeurIPS, 2020
-
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection
M. Ye, G. Gong, L. Nie, D. Zhou, A. Klivans, Q. Liu
ICML, 2020
-
Faster Johnson-Lindenstrauss Transforms via Kronecker Products
Ruhui Jin, Tamara G. Kolda, Rachel Ward
arXiv, 2020
-
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
Siddarth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, and Peter Stone
NeurIPS, 2020
-
Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes
Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam
NeurIPS, 2020
-
SoundSpaces: Audio-Visual Navigation in 3D Environments
C. Chen, U. Jain, C. Schissler, S. V. Amengual Gari, Z. Al-Halah, V. Ithapu, P. Robinson, K. Grauman
ECCV, 2020
-
Stochastic Grounded Action Transformation for Robot Learning in Simulation
Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, and Peter Stone
IROS, 2020
-
Learning Differentiable Programs with Admissible Neural Heuristics
Ameesh Shah, Eric Zhan, Jennifer J. Sun, Abhinav Verma, Yisong Yue, Swarat Chaudhuri
NeurIPS, 2020
-
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning
Ishan Durugkar, Elad Liebman, and Peter Stone
IJCAI, 2020