Reinforcement learning with factorization
Devavrat Shah
Massachusetts Institute of Technology
August 3, 2021
A Lyapunov approach for finite-sample convergence bounds with off-policy RL
Sanjay Shakkottai
University of Texas Austin
August 3, 2021
Preference based RL with finite time guarantees
Aarti Singh
Carnegie Mellon University
August 2, 2021
Towards a Theory of Representation Learning for Reinforcement Learning
Alekh Agarwal
Microsoft
August 2, 2021
Reinforcement Learning in High Dimensional Systems (and why "reward" is not enough...)
Sham Kakade
University of Washington
August 2, 2021