Celestial w1+∞ symmetries from twistor space
Adamo, T Mason, L Sharma, A Symmetry, Integrability and Geometry: Methods and Applications (SIGMA) volume 18 (08 Mar 2022)
2-Group symmetries and their classification in 6d
Apruzzi, F Bhardwaj, L Gould, D Schafer-Nameki, S SciPost Physics volume 12 (18 Mar 2022)
Snowmass Whitepaper: Physical Mathematics 2021
Bah, I Freed, D Moore, G Nekrasov, N Razamat, S Schäfer-Nameki, S (09 Mar 2022)
Video conference with David Levy, Ian Griffiths, Sam Cohen and Pete Grindrod

Background
Among the many asylum and sanctuary seekers and academics seeking refuge are mathematicians and statisticians; they are part of the global mathematical sciences community. However, when they come to the UK they may be unable to work and so feel isolated from the subject they love. They are in personal and intellectual limbo.

Convergence of a time-stepping scheme to the free boundary in the supercooled Stefan problem
Song, Z Reisinger, C Kaushansky, V Shkolnikov, M Annals of Applied Probability volume 33 issue 1 274-298 (21 Feb 2023)
Fri, 06 May 2022

14:00 - 15:00
N3.12

Once and Twice Categorified Algebra

Thibault Décoppet
(University of Oxford)
Abstract

I will explain in what sense the theory of finite tensor categories is a categorification of the theory of finite dimensional algebras. In particular, I will introduce finite module categories, review a key result of Ostrik, and present Morita theory for finite categories. I will give many examples to illustrate these ideas. Then, I will explain the elementary properties of finite braided tensor categories. If time permits, I will also mention my own work, which consists in categorifying these ideas once more!

Mon, 09 May 2022

15:30 - 16:30
L3

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

LUKASZ SZPRUCH
(University of Edinburgh)
Abstract

 We develop a probabilistic framework for analysing model-based reinforcement learning in the episodic setting. We then apply it to study finite-time horizon stochastic control problems with linear dynamics but unknown coefficients and convex, but possibly irregular, objective function. Using probabilistic representations, we study regularity of the associated cost functions and establish precise estimates for the performance gap between applying optimal feedback control derived from estimated and true model parameters. We identify conditions under which this performance gap is quadratic, improving the linear performance gap in recent work [X. Guo, A. Hu, and Y. Zhang, arXiv preprint, arXiv:2104.09311, (2021)], which matches the results obtained for stochastic linear-quadratic problems. Next, we propose a phase-based learning algorithm for which we show how to optimise exploration-exploitation trade-off and achieve sublinear regrets in high probability and expectation. When assumptions needed for the quadratic performance gap hold, the algorithm achieves an order (N‾‾√lnN) high probability regret, in the general case, and an order ((lnN)2) expected regret, in self-exploration case, over N episodes, matching the best possible results from the literature. The analysis requires novel concentration inequalities for correlated continuous-time observations, which we derive.

 

-----------------------------------------------------------------------
Dr Lukasz Szpruch

Subscribe to