Talks

Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
RL Theory Virtual seminar (video, slides)