Multi-armed bandit under uncertainty
Abstract
In a robust decision, we are pessimistic toward our decision making when the probability measure is unknown. In particular, we optimise our decision under the worst case scenario (e.g. via value at risk or expected shortfall). On the other hand, most theories in reinforcement learning (e.g. UCB or epsilon-greedy algorithm) tell us to be more optimistic in order to encourage learning. These two approaches produce an apparent contradict in decision making. This raises a natural question. How should we make decisions, given they will affect our short-term outcomes, and information available in the future?
In this talk, I will discuss this phenomenon through the classical multi-armed bandit problem which is known to be solved via Gittins' index theory under the setting of risk (i.e. when the probability measure is fixed). By extending this result to an uncertainty setting, we can show that it is possible to take into account both uncertainty and learning for a future benefit at the same time. This can be done by extending a consistent nonlinear expectation (i.e. nonlinear expectation with tower property) through multiple filtrations.
At the end of the talk, I will present numerical results which illustrate how we can control our level of exploration and exploitation in our decision based on some parameters.
Oxford Mathematics Open Day Live Stream - 3 July
On 3 July we shall we live streaming our Open Day for prospective applicants as part of our going Behind the Scenes' at Oxford Mathematics. This is our way of making the Open Day 'open' to everyone, wherever you are.
The running order:
10.00am - James Munro introduces you to Mathematics at Oxford
10.30am - Vicky Neale on Pure Mathematics at Oxford
11.00am - Dominic Vella on Applied Mathematics at Oxford
AND
Certain inflammatory and infectious diseases, including atherosclerosis and tuberculosis, are caused by the accumulation inside immune cells of harmful substances, such as lipids and bacteria. A multidisciplinary study published in Proceedings B of the Royal Society, by researchers from the Universities of Oxford and Sydney, has shown how cell cannibalism contributes to this process.
From knots to homotopy theory
Note: unusual time!
Abstract
Knots and their groups are a traditional topic of geometric topology. In this talk, I will explain how aspects of the subject can be approached as a homotopy theorist, rephrasing old results and leading to new ones. Part of this reports on joint work with Tyler Lawson.
A neural network approach to SLV Calibration
Abstract
A central task in modeling, which has to be performed each day in banks and financial institutions, is to calibrate models to market and historical data. So far the choice which models should be used was not only driven by their capacity of capturing empirically the observed market features well, but rather by computational tractability considerations. Due to recent work in the context of machine learning, this notion of tractability has changed significantly. In this work, we show how a neural network approach can be applied to the calibration of (multivariate) local stochastic volatility models. We will see how an efficient calibration is possible without the need of interpolation methods for the financial data. Joint work with Christa Cuchiero and Josef Teichmann.
Analysing scientific progress and communities using topological methods
Persistence Paths and Signature Features in Topological Data Analysis
Abstract
In this talk I will introduce the concept of the path signature and motivate its recent use in analysis of time-ordered data. I will then describe a new feature map for barcodes in persistent homology by first realizing each barcode as a path in a vector space, and then computing its signature which takes values in the tensor algebra over that vector space. The composition of these two operations — barcode to path, path to tensor series — results in a feature map that has several desirable properties for statistical learning, such as universality and characteristicness.