Loading Events

Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning

CMSA EVENTS: CMSA NEW TECHNOLOGIES IN MATHEMATICS

When: November 6, 2024
2:00 pm - 3:00 pm
Where: CMSA, 20 Garden St, G10
Address: 20 Garden Street, Cambridge, MA 02138, United States
Speaker: Dylan Foster (Microsoft Research)

Imitation learning (IL) aims to mimic the behavior of an expert in a sequential decision making task by learning from demonstrations, and has been widely applied to robotics, autonomous driving, and autoregressive language generation. The simplest approach to IL, behavior cloning (BC), is thought to incur sample complexity with unfavorable quadratic dependence on the problem horizon, motivating a variety of different online algorithms that attain improved linear horizon dependence under stronger assumptions on the data and the learner’s access to the expert.In this talk, we revisit the apparent gap between offline and online IL from a learning-theoretic perspective, with a focus on general policy classes up to and including deep neural networks. Through a new analysis of behavior cloning with the logarithmic loss, we will show that it is possible to achieve horizon-independent sample complexity in offline IL whenever (i) the range of the cumulative payoffs is controlled, and (ii) an appropriate notion of supervised learning complexity for the policy class is controlled. When specialized to stationary policies, this implies that the gap between offline and online IL is smaller than previously thought. We will then discuss implications of this result and investigate the extent to which it bears out empirically.

Bio: Dylan Foster is a principal researcher at Microsoft Research, New York. Previously, he was a postdoctoral fellow at MIT, and received his PhD in computer science from Cornell University, advised by Karthik Sridharan. His research focuses on problems at the intersection of machine learning, AI, interactive decision making. He has received several awards for his work, including the best paper award at COLT (2019) and best student paper award at COLT (2018, 2019).


In-person and on Zoom:

https://harvard.zoom.us/j/92220006185?pwd=V3mrb4cNSbgRXtNJtRJkTvWFVhmbI5.1
Password: cmsa