Date
Thu, 17 Jun 2021
Time
16:00 - 17:00
Speaker
HAOYANG CAO
Organisation
Alan Turing Institute

Abstract: In this work, we analyze a class of stochastic inverse optimal control problems with entropy regularization. We first characterize the set of solutions for the inverse control problem. This solution set exemplifies the issue of degeneracy in generic inverse control problems that there exist multiple reward or cost functions that can explain the displayed optimal behavior. Then we establish one resolution for the degeneracy issue by providing one additional optimal policy under a different discount factor. This resolution does not depend on any prior knowledge of the solution set. Through a simple numerical experiment with deterministic transition kernel, we demonstrate the ability of accurately extracting the cost function through our proposed resolution.

 

Joint work with Sam Cohen (Oxford) and Lukasz Szpruch (Edinburgh).

Please contact us with feedback and comments about this page. Last updated on 03 Apr 2022 01:32.