Publications
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds
Y. Feng, Z. Tang, N. Zhang, Q. Liu
ICLR, 2021
A Scavenger Hunt for Service Robots
Harel Yedidsion, Jennifer Suriadinata, Zifan Xu, Stefan Debruyn, and Peter Stone
ICRA, 2021
Trading off Accuracy for Speedup: Multiplier Bootstraps for Subgraph Counts
Qiaohui Lin, Robert Lunde, Purnamrita Sarkar
arXiv, 2021
Streaming k-PCA: Efficient guarantees for Oja's algorithm, beyond rank-one updates
De Huang, Jonathan Niles-Weed, Rachel Ward. arXiv, 2021
arXiv, 2021
Sequential Online Chore Division for Autonomous Vehicle Convoy Formation
Harel Yedidsion, Shani Alkoby, and Peter Stone
arXiv, 2021
Towards Long-Form Video Understanding
Chao-Yuan Wu, Philipp Krähenbühl
CVPR, 2021
Learning to Set Waypoints for Audio-Visual Navigation
C. Chen, S. Majumder, Z. Al-Halah, R. Gao, S. Ramakrishnan, K. Grauman
ICLR, 2021
A Robust Spectral Clustering Algorithm for Sub-Gaussian Mixture Models with Outliers
Prateek R. Srivastava, Purnamrita Sarkar, Grani A. Hanasusanto
arXiv, 2021
Stochastic Grounded Action Transformation for Robot Learning in Simulation
Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, and Peter Stone
IROS, 2020
Learning Differentiable Programs with Admissible Neural Heuristics
Ameesh Shah, Eric Zhan, Jennifer J. Sun, Abhinav Verma, Yisong Yue, Swarat Chaudhuri
NeurIPS, 2020
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning
Ishan Durugkar, Elad Liebman, and Peter Stone
IJCAI, 2020
Learning Affordance Landscapes for Interaction Exploration in 3D Environments
T. Nagarajan and. K. Grauman. NeurIPS
NeurIPS, 2020
Neurosymbolic Reinforcement Learning with Formally Verified Exploration
Greg Anderson, Abhinav Verma, Isil Dillig, Swarat Chaudhuri
NeurIPS, 2020
Using Human-Inspired Signals to Disambiguate Navigational Intentions
Justin Hart, Reuth Mirsky, Xuesu Xiao, Stone Tejeda, Bonny Mahajan, Jamin Goo, Kathryn Baldauf, Sydney Owen, and Peter Stone
ICSR, 2020
Meta-learning for mixed linear regression
Weihao Kong, Raghav Somani, Zhao Song, Sham Kakade, Sewoong Oh
arXiv, 2020
The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits
Ronshee Chawla, Abishek Sankararaman, Ayalvadi Ganesh, Sanjay Shakkottai
AISTATS, 2020
Modeling Fashion Influence from Photos
Z. Al-Halah and K. Grauman
IEEE, 2020
Reinforced Grounded Action Transformation for Sim-to-Real Transfer
Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, and Peter Stone
IROS, 2020