Thu, 25 Apr 2024
16:00
L4

Reinforcement Learning in near-continuous time for continuous state-action spaces

Dr Lorenzo Croissant
(CEREMADE, Université Paris-Dauphine)
Further Information

Please join us for reshments outside the lecture room from 1530.

Abstract

We consider the reinforcement learning problem of controlling an unknown dynamical system to maximise the long-term average reward along a single trajectory. Most of the literature considers system interactions that occur in discrete time and discrete state-action spaces. Although this standpoint is suitable for games, it is often inadequate for systems in which interactions occur at a high frequency, if not in continuous time, or those whose state spaces are large if not inherently continuous. Perhaps the only exception is the linear quadratic framework for which results exist both in discrete and continuous time. However, its ability to handle continuous states comes with the drawback of a rigid dynamic and reward structure.

        This work aims to overcome these shortcomings by modelling interaction times with a Poisson clock of frequency $\varepsilon^{-1}$ which captures arbitrary time scales from discrete ($\varepsilon=1$) to continuous time ($\varepsilon\downarrow0$). In addition, we consider a generic reward function and model the state dynamics according to a jump process with an arbitrary transition kernel on $\mathbb{R}^d$. We show that the celebrated optimism protocol applies when the sub-tasks (learning and planning) can be performed effectively. We tackle learning by extending the eluder dimension framework and propose an approximate planning method based on a diffusive limit ($\varepsilon\downarrow0$) approximation of the jump process.

        Overall, our algorithm enjoys a regret of order $\tilde{\mathcal{O}}(\sqrt{T})$ or $\tilde{\mathcal{O}}(\varepsilon^{1/2} T+\sqrt{T})$ with the approximate planning. As the frequency of interactions blows up, the approximation error $\varepsilon^{1/2} T$ vanishes, showing that $\tilde{\mathcal{O}}(\sqrt{T})$ is attainable in near-continuous time.

Mon, 18 Mar 2024 16:15 -
Tue, 19 Mar 2024 17:00
L2

Characteristic Boundary Value Problems and Magneto-Hydrodynamics

Professor Paolo Secchi
(University of Brescia)
Further Information

This course is running as part of the National PDE Network Meeting being held in Oxford 18-21 March 2024, and jointly with the 13th Oxbridge PDE conference.

The course is broken into 3 sessions over two days, thus, with all sessions taking place in L2:

16:15-16:55:    Short Course II-1 Monday 18 March Characteristic Boundary Problems and Magneto-HydrodynamicsSECCHI-part 1_0.pdf

11:35-12:15:    Short Course II-2 Tuesday 19 March Characteristic Boundary Problems and Magneto-Hydrodynamics SECCHI-part 2.pdf

16:15-16:55:    Short Course II-3 Tuesday 19 March Characteristic Boundary Problems and Magneto-Hydrodynamics SECCHI-part 3.pdf

 

Abstract

The course aims to provide an introduction to the theory of initial boundary value problems for Friedrichs symmetrizable systems, with particular interest for the applications to the equations of ideal Magneto-Hydrodynamics (MHD). 

We first analyse different kinds of boundary conditions and present the main results about the well-posedness. In the case of the characteristic boundary, we discuss the possible loss of regularity in the normal direction to the boundary and the use of suitable anisotropic Sobolev spaces in MHD.  

Finally, we give a short introduction to the Kreiss-Lopatinskii approach and discuss a simple boundary value problem for the wave equation that may admit estimates with a loss of derivatives from the data. 

 

Pedestrian models with congestion effects
Aceves-Sanchez, P Bailo, R Degond, P Mercier, Z Mathematical Models and Methods in Applied Sciences volume 34 issue 6 1001-1041 (15 Mar 2024)
Classical non-relativistic fractons
Prakash, A Goriely, A Sondhi, S Physical Review B: Condensed Matter and Materials Physics volume 109 (27 Feb 2024)
Deciphering the diversity and sequence of extracellular matrix and cellular spatial patterns in lung adenocarcinoma using topological data analysis
Yoon, I Jenkins, R Colliver, E Zhang, H Novo, D Moore, D Ramsden, Z Rullan, A Fu, X Yuan, Y Harrington, H Swanton, C Byrne, H Sahai, E
Fri, 14 Jun 2024

14:00 - 15:00
L3

Brain mechanics in the Data era

Prof Antoine Jerusalem
(Dept of Engineering Science University of Oxford)
Abstract

In this presentation, we will review how the field of Mechanics of Materials is generally framed and see how it can benefit from and be of benefit to the current progress in AI. We will approach this problematic in the particular context of Brain mechanics with an application to traumatic brain injury in police investigations. Finally we will briefly show how our group is currently applying the same methodology to a range of engineering challenges.

Mon, 25 Mar 2024
15:00
L4

Uhlenbeck compactness theorems and isometric immersions

Professor Siran Li
(Shanghai Jiao Tong University)
Abstract

In this short course, we survey the celebrated weak and strong compactness theorems proved by Karen Uhlenbeck in 1982. These results are fundamental to the gauge theory and have found numerous applications to geometry, topology, and theoretical physics. The proof is based on the ingenious idea of putting connections into ``Uhlenbeck--Coulomb gauge'', which enables the use of standard elliptic and/or nonlinear PDE techniques, as well as involved local-to-global patching arguments. We aim at giving detailed explanation of the proof, and we shall also discuss the relation between Uhlenbeck's compactness and the classical geometric problem of isometric immersions of submanifolds into Euclidean spaces.

Fri, 07 Jun 2024

14:00 - 15:00
L3

Modeling the electromechanics of aerial electroreception

Dr Isaac Vikram Chenchiah
(School of Mathematics University of Bristol)
Abstract
Aerial electroreception is the ability of some arthropods (e.g., bees) to detect electric fields in the environment. I present an overview of our attempts to model the electromechanics of this recently discovered phenomenon and how it might contribute to the sensory biology of arthropods. This is joint work with Daniel Robert and Ryan Palmer.


 

Fri, 01 Mar 2024
12:00
L3

Motivic coaction and single-valued map of polylogarithms from zeta generators

Hadleigh Frost
(Merton College Oxford)
Abstract
The motivic coaction and single-valued map play an important role in our understanding of perturbative string theory. We use a new Lie-algebraic approach to give new formulas for the motivic coaction and single-valued map of multiple polylogarithms in any number of variables. The new formulas are computationally useful and give answers (if desired) directly in a fibration basis. Our key idea is to understand extensions of the braid algebra, that "encode" the appearance of multiple zeta values in the formulas. Speculatively, this idea could help to understand these important structures beyond genus zero.
Fri, 31 May 2024

14:00 - 15:00
L3

Cytoneme-mediated morphogenesis

Prof Paul Bressloff
(Dept of Mathematics Imperial College London)
Abstract

Morphogen protein gradients play an essential role in the spatial regulation of patterning during embryonic development.  The most commonly accepted mechanism of protein gradient formation involves the diffusion and degradation of morphogens from a localized source. Recently, an alternative mechanism has been proposed, which is based on cell-to-cell transport via thin, actin-rich cellular extensions known as cytonemes. It has been hypothesized that cytonemes find their targets via a random search process based on alternating periods of retraction and growth, perhaps mediated by some chemoattractant. This is an actin-based analog of the search-and-capture model of microtubules of the mitotic spindle searching for cytochrome binding sites (kinetochores) prior to separation of cytochrome pairs. In this talk, we introduce a search-and-capture model of cytoneme-based morphogenesis, in which nucleating cytonemes from a source cell dynamically grow and shrink until making contact with a target cell and delivering a burst of morphogen. We model the latter as a one-dimensional search process with stochastic resetting, finite returns times and refractory periods. We use a renewal method to calculate the splitting probabilities and conditional mean first passage times (MFPTs) for the cytoneme to be captured by a given target cell. We show how multiple rounds of search-and-capture, morphogen delivery, cytoneme retraction and nucleation events lead to the formation of a morphogen gradient. We proceed by formulating the morphogen bursting model as a queuing process, analogous to the study of translational bursting in gene networks. We end by briefly discussing current work on a model of cytoneme-mediated within-host viral spread.

Subscribe to