Past Computational Mathematics and Applications Seminar

Thu, 25 Jun 2026

13:00 - 14:00

Lecture Room 4

Temporal high-order structure-preserving parametric finite element methods for curvature flows

Prof Chunmei Su

(Tsinghua University)

Abstract

Professor Chunmei Su will talk about: 'Temporal high-order structure-preserving parametric finite element methods for curvature flows'

The quality of the mesh is crucial for simulating curvature flows, as standard approaches may fail due to mesh distortion. We first present a series of high-order parametric finite element methods based on the Barrett-Garcke--Nurnberg formulation for solving various types of flows involving curves and surfaces. Extensive numerical experiments demonstrate the anticipated high-order accuracy while maintaining favorable mesh quality throughout the evolution process. Secondly, for flows involving multiple geometric structures, such as surface diffusion—which reduces area while preserving volume—we propose a type of structure-preserving method that incorporates two scalar Lagrange multipliers along with two evolution equations related to area and volume, respectively. These schemes effectively preserve the geometric structure at a fully discrete level. Comprehensive numerical experiments illustrate that our methods achieve the desired temporal accuracy, while simultaneously preserving the geometric structure of the surface diffusion.

Thu, 18 Jun 2026

14:00 - 15:00

Lecture Room 3

Fictitious domain approach to FSI: theoretical results and implementation details

Prof Daniele Boffi

(King Abdullah University of Science and Technology (KAUST))

Abstract

Professor Boffi will talk about: 'Fictitious domain approach to FSI: theoretical results and implementation details'

He will review the main aspects of our fictitious domain - distributed Lagrange multiplier - approach to the approximation of fluid-structure interaction problems. Theoretical results include the analysis of the continuous problem in a linearized setting and the stability of the discrete scheme in space and time. Professor Boffi will give details on some implementation aspects related to the treatment/integration of the coupling terms and propose a multigrid strategy for the solution of the discrete system.

Mon, 15 Jun 2026

16:30 - 17:30

Neural Networks and Classical Numerical Methods: A Theoretical Perspective

Professor Jinchao Xu

(King Abdullah University of Science and Technology (KAUST))

Abstract

Professor Jinchao Xu will talk about; 'Neural Networks and Classical Numerical Methods: A Theoretical Perspective'

This talk compares neural network-based methods with classical numerical methods from a theoretical perspective. Through several representative examples, we examine both the potential and the limitations of deep neural networks in scientific computing and, more broadly, in machine learning. We begin by comparing ReLU deep neural networks with polynomials and piecewise polynomial spaces, focusing on their structures and expressive power. We then revisit the curse of dimensionality and discuss whether deep neural networks truly offer advantages over traditional numerical methods for high-dimensional problems. Next, we consider the use of deep neural networks for solving partial differential equations, with particular emphasis on the challenge of achieving high accuracy. Finally, we examine multigrid methods and explore whether their underlying principles can help us better understand, design, and train deep neural network models with possible implications for broader AI applications.

This is a Joint OxPDE & Numerical Analysis Seminar

Thu, 11 Jun 2026

14:00 - 15:00

Lecture Room 3

Optimization Algorithms for Bilevel Learning with Applications to Imaging

Dr Lindon Roberts

(Melbourne University)

Abstract

Dr Lindon Roberts will talk about: 'Optimization Algorithms for Bilevel Learning with Applications to Imaging'

Many imaging problems, such as denoising or inpainting, can be expressed as variational regularization problems. These are optimization problems for which many suitable algorithms exist. We consider the problem of learning suitable regularizers for imaging problems from example (training) data, which can be formulated as a large-scale bilevel optimization problem.

In this talk, I will introduce new deterministic and stochastic algorithms for bilevel optimization, which require no or minimal hyperparameter tuning while retaining convergence guarantees.

This is joint work with Mohammad Sadegh Salehi and Matthias Ehrhardt (University of Bath), and Subhadip Mukherjee (IIT Kharagpur).

Thu, 04 Jun 2026

14:00 - 15:00

Lecture Room 3

New results on the inclusion of closure orbits and bundles of matrices and matrix pencils

Prof Fernando De Teran

(University of Madrid Carlos III)

Abstract

Professor De Terran will talk about: 'New results on the inclusion of closure orbits and bundles of matrices and matrix pencils'

Orbits of nxn matrices under similarity are sets of matrices with the same Jordan Canonical form (JCF). When computing the JCF (or just the eigenvalues) of a matrix, the knowledge of all possible JCFs of small perturbations of a given JCF can help to understand the output of the algorithm, which is affected by roundoff errors.

The JCFs that can be obtained after small perturbations of a given JCF, say J, correspond to orbits that ``dominate" the orbit of J. In other words, the orbit of J is in the closure of its dominant orbits. The hierarchy of orbit closures of general matrices is well-known, as well as that of the set of matrices with bounded rank.

For matrix pencils (namely, pairs of matrices with the same size) the inclusion relationship between orbit closures has been also considered since, at least the 1980's. In this case, the standard equivalence relation is the so-called strict equivalence, which preserves the eigenstructure of the pencil, and the canonical form for this relation is the Kronecker canonical form (KCF). The hierarchy of orbit closures of general pencils under strict equivalence is also well-known. However, when the pencil has some particular structure (e. g., symmetric or Hermitian) then we encounter a different problem if we want the perturbations to maintain this structure. Some effort has been devoted in recent years to the analysis of orbit closures of structured pencils.

In this talk, we will review some recent results on the inclusion relationship between orbit closures of general and bounded-rank structured matrix pencils. We will also consider the inclusion relation of bundle closures. Bundles are generalizations of orbits, allowing the eigenvalues to change, while keeping the KCF.

Thu, 28 May 2026

14:00 - 15:00

Lecture Room 3

Reducing Sample Complexity in Stochastic Derivative-Free Optimization via Tail Bounds and Hypothesis Testing

Prof Luis Nunes Vicente

(Lehigh University)

Abstract

Professor Luis Nunes Vicente will talk about 'Reducing Sample Complexity in Stochastic Derivative-Free Optimization via Tail Bounds and Hypothesis Testing';

We introduce and analyze new probabilistic strategies for enforcing sufficient decrease conditions in stochastic derivative-free optimization, with the goal of reducing sample complexity and simplifying convergence analysis. First, we develop a new tail bound condition imposed on the estimated reduction in function value, which permits flexible selection of the power used in the sufficient decrease test, q in (1,2]. This approach allows us to reduce the number of samples per iteration from the standard O(delta^{−4}) to O(delta^{-2q}), assuming that the noise moment of order q/(q-1) is bounded. Second, we formulate the sufficient decrease condition as a sequential hypothesis testing problem, in which the algorithm adaptively collects samples until the evidence suffices to accept or reject a candidate step. This test provides statistical guarantees on decision errors and can further reduce the required sample size, particularly in the Gaussian noise setting, where it can approach O(delta^{−2-r}) when the decrease is of the order of delta^r. We incorporate both techniques into stochastic direct-search and trust-region methods for potentially non-smooth, noisy objective functions, and establish their global convergence rates and properties.

This is joint work with Anjie Ding, Francesco Rinaldi, and Damiano Zeffiro.

Thu, 21 May 2026

14:00 - 15:00

Lecture Room 3

A Computational Framework for Infinite-Dimensional Nonlinear Spectral Problems

Prof Matthew J. Colbrook

(Cambridge University)

Abstract

Professor Colbrook is going to talk about: 'A Computational Framework for Infinite-Dimensional Nonlinear Spectral Problems'

Nonlinear spectral problems -- where the spectral parameter enters operator families nonlinearly -- arise in many areas of analysis and applications, yet a systematic computational theory in infinite dimensions remains incomplete. In this talk, I present a unified framework based on a solve-then-discretise philosophy (familiar, for example, from Chebfun!), ensuring that truncation preserves convergence. The setting accommodates unbounded operators, including differential operators with spectral-parameter-dependent boundary conditions.
In the first part, I introduce a provably convergent method for computing spectra and pseudospectra under the minimal assumption of gap-metric continuity of operator graphs -- the weakest natural setting in which the resolvent norm remains continuous.
In the second part, I develop a contour-based framework for discrete spectra of holomorphic operator families, with a complete analysis of stability, convergence, and randomised sketching based on Gaussian probes. This perspective unifies and extends many existing contour integral methods. Examples throughout highlight practical effectiveness and subtle phenomena unique to infinite dimensions, including the perhaps unexpected sensitivity to probe selection when seeking to avoid spectral pollution.

Thu, 14 May 2026

14:00 - 15:00

Lecture Room 3

Numerical analysis of oscillatory solutions of compressible flows

Prof Dr Maria Lukacova

(Johannes Gutenberg University Mainz)

Abstract

Speaker Prof Dr Maria Lukacova will talk about 'Numerical analysis of oscillatory solutions of compressible flows'

Oscillatory solutions of compressible flows arise in many practical situations. An iconic example is the Kelvin-Helmholtz problem, where standard numerical methods yield oscillatory solutions. In such a situation, standard tools of numerical analysis for partial differential equations are not applicable.

We will show that structure-preserving numerical methods converge in general to generalised solutions, the so-called dissipative solutions.
The latter describes the limits of oscillatory sequences. We will concentrate on the inviscid flows, the Euler equations of gas dynamics, and mention also the relevant results obtained for the viscous compressible flows, governed by the Navier-Stokes equations.

We discuss a concept of K-convergence that turns a weak convergence of numerical solutions into the strong convergence of
their empirical means to a dissipative solution. The latter satisfies a weak formulation of the Euler equations modulo the Reynolds turbulent stress. We will also discuss suitable selection criteria to recover well-posedness of the Euler equations of gas dynamics. Theoretical results will be illustrated by a series of numerical simulations.

Thu, 07 May 2026

14:00 - 15:00

Lecture Room 3

Private estimation in stochastic block models

Prof Po-Ling Loh

(Cambridge)

Abstract

Professor Po-Ling Loh will talk about; 'Private estimation in stochastic block models'

We study the problem of private estimation for stochastic block models, where the observation comes in the form of an undirected graph, and the goal is to partition the nodes into unknown, underlying communities. We consider a notion of differential privacy known as node differential privacy, meaning that two graphs are treated as neighbors if one can be transformed into the other by changing the edges connected to exactly one node. The goal is to develop algorithms with optimal misclassification error rates, subject to a certain level of differential privacy.

We present several algorithms based on private eigenvector extraction, private low-rank matrix estimation, and private SDP optimization. A key contribution of our work is a method for converting a procedure which is differentially private and has low statistical error on degree-bounded graphs to one that is differentially private on arbitrary graph inputs, while maintaining good accuracy (with high probability) on typical inputs. This is achieved by considering a certain smooth version of a map from the space of all undirected graphs to the space of bounded-degree graphs, which can be appropriately leveraged for privacy. We discuss the relative advantages of the algorithms we introduce and also provide some lower-bounds for the performance of any private community estimation algorithm.

This is joint work with Laurentiu Marchis, Ethan D'souza, and Tomas Flidr.

Thu, 30 Apr 2026

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

Modern tasking approaches to simulate black holes (and other interesting phenomena): How can we make them fit to modern hardware?

Prof Tobias Weinzierl

(Durham University)

Abstract

Professor Tobias Weinzierl will be talking about: 'Modern tasking approaches to simulate black holes (and other interesting phenomena): How can we make them fit to modern hardware?'

Over the past decade, my team has developed a simulation code for binary black hole mergers that runs on dynamically adaptive Cartesian meshes.
Its dynamic adaptivity, coupled with multiple numerical schemes operating at different scales and non-deterministic loads from puncture sources, makes task-based parallelisation a natural choice:
Task stealing across fine-grained work units balances the load across many CPU cores, while treating tasks as atomic compute units should---in theory---allow us to deploy seamlessly to accelerators. In practice, it is far from straightforward.

Fine-grained tasks clash with accelerators, which thrive on large, homogeneous data access patterns;
task bursts on the CPU overwhelm tasking systems and produce suboptimal execution schedules;
and when tasks span address spaces, expensive memory movements kill performance.
Surprisingly, many mainstream tasking frameworks even lack the features our domain demands, i.e. to express key task concepts.
Our application serves as a powerful lens for examining these challenges.
While our code base extends to other wave phenomena, Lagrangian techniques, and multigrid solvers, they all reveal the same fundamental tension:
modern hardware increasingly struggles to accommodate modern HPC concepts, and it even challenges the notion that one solution fits all hardware components.
The talk proposes practical workarounds and solutions to these shortcomings, while all solutions are designed, wherever possible, to be upstreamed into mainstream software building blocks or at least decoupled from our particular PDE solver, making them broadly applicable to the community.

This talk is hosted by Rutherford Appleton Laboratory and will take place @ Harwell Campus, Didcot, OX11 0QX

Mon, 27 Apr 2026

11:00 - 12:00

Lecture Room 6

Disjunctive Sum of Squares

Professor Amir Ali Ahmadi

(Princeton ORFE)

Abstract

Professor Amir Ali Ahmadi will talk about; 'Disjunctive Sum of Squares'

We introduce the concept of disjunctive sum of squares for certifying nonnegativity of polynomials. Unlike the popular sum of squares approach, where nonnegativity is certified by a single algebraic identity, the disjunctive sum of squares approach certifies nonnegativity using multiple algebraic identities. Our main result is a disjunctive Positivstellensatz showing that the degree of each algebraic identity can be kept as low as the degree of the polynomial whose nonnegativity is in question. Based on this result, we construct a semidefinite programming–based converging hierarchy of lower bounds for the problem of minimizing a polynomial over a compact basic semialgebraic set, in which the size of the largest semidefinite constraint remains fixed throughout the hierarchy. We further prove a second disjunctive Positivstellensatz, which leads to an optimization-free hierarchy for polynomial optimization. We specialize this result to the problem of proving copositivity of matrices. Finally, we describe how the disjunctive sum of squares approach can be combined with a branch-and-bound algorithm, and we present numerical experiments on polynomial, copositive, and combinatorial optimization problems. The talk is self-contained and assumes no prior background in sum of squares optimization.

Further Information

Bio:

Amir Ali Ahmadi is a Professor of Operations Research and Financial Engineering at Princeton University, with affiliated appointments across applied mathematics, computer science, engineering, statistics, robotics, and AI. He directs Princeton’s Minor in Optimization and Quantitative Decision Science and has also held visiting research roles at Citadel and Google Brain. He earned his PhD in EECS from MIT and was a Goldstine Fellow at IBM Research before joining Princeton. His research focuses on optimization, dynamical systems, control-oriented learning, and algorithmic complexity. He has received numerous honors, including the Sloan Fellowship, PECASE, NSF CAREER Award, DARPA Faculty Award, and several major prizes in optimization and control. He is also widely recognized for his teaching and research, with multiple best-paper awards and major teaching awards at Princeton and beyond. You can read his full bio here.

Thu, 19 Mar 2026

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

Lazy Quantum Walks with Native Multiqubit Gates

Dr Steph Foulds

(University of Strathclyde)

Abstract

Dr Steph Foulds will talk about; 'Lazy Quantum Walks with Native Multiqubit Gates'

Quantum walks, the quantum analogue to the classical random walk, have been shown to deliver the Dirac equation in the continuum limit. Recent work has shown that 'lazy', open quantum walks can be mapped to computational methods for fluid simulation such as lattice Boltzmann method, quantum fluid dynamics, and smoothed-particle hydrodynamics. This work concerns evaluating the ability of near-term hardware to perform small, proof-of-concept quantum walks - but crucially with the inclusion of a rest state to encompass 'lazy' quantum walks, providing an integral step towards quantum walks for fluid simulation.

Neutral atom hardware is a promising choice of platform for implementing quantum walks due to its ability to implement native multiqubit gates and to dynamically re-arrange qubits. Using detail realistic modelling for near-term multiqubit Rydberg gates via two-photon adiabatic rapid passage, SPAM, and passive error, we present the gate sequences and final state fidelities for quantum walks with and without a rest state on 4 to 16-node rings. This, along with results of an error model with improved two- and three-qubit gate fidelities, leads us to conclude that a native four-qubit gate is required for the near-term implementation of interesting quantum walks on neutral atom hardware.

Please note; this talk is hosted by Rutherford Appleton Laboratory, Harwell Campus, Didcot, OX11 0QX

Further Information

Join the talk on Microsoft Teams Link here

Meeting ID 351 045 392 852 1

Passcode ew9jZ7Kf

Thu, 12 Mar 2026

14:00 - 15:00

Lecture Room 3

The orbital structure of the Hill's problem

Dr Anna Lisa Varri

(University of Edinburgh)

Abstract

Dr Anna Lisa Vari will talk about: 'The orbital structure of the Hill's problem'

Hill’s problem is a limiting case of the circular restricted gravitational three-body problem in which the mass ratio between the two massive bodies tends to zero, leaving a small region surrounding the secondary in which it remains gravitationally dominant. Originally formulated in terms of point masses, Hill’s problem may be modified to include a secondary of finite extent, thus providing a more realistic description of the dynamics internal to a stellar cluster orbiting within a host galaxy. By considering stellar energies above the cluster escape energy, we may investigate the dynamics that underpin the process of stellar escape from star clusters -- a topical issue in contemporary astrophysics. Specifically, we construct a self-consistent formulation of Hill’s problem using a tidally perturbed cluster model for the secondary body. The behaviour of energetically unbound stellar orbits within such a self-consistent problem, as characterised using Poincaré surfaces of section, is then numerically explored via a structure-preserving integrator, revealing a previously unknown bifurcation in the orbital structure.

Thu, 05 Mar 2026

14:00 - 15:00

Lecture Room 3

Stabilised Finite Element Methods for General Convection–Diffusion Equations

Dr Jindong Wang

((Mathematical Institute University of Oxford))

Abstract

Dr Jindong Wang will talk about; 'Stabilised Finite Element Methods for General Convection–Diffusion Equations'

This talk presents several stabilised finite element methods for general convection–diffusion equations, with particular emphasis on recent extensions to vector-valued problems arising in magnetohydrodynamics (MHD). Owing to the non-self-adjoint structure of the operator and the potentially large disparity between convective and diffusive scales, standard Galerkin discretisations may exhibit non-physical oscillations. We design a class of upwind-type schemes and exponentially fitted methods for vector-valued problems that mitigate these effects, highlighting both their shared stabilisation mechanisms and the distinctive features that arise in the vector-valued setting. These developments illustrate concrete strategies for the design and analysis of finite element discretisations for general convection–diffusion problems.

Thu, 26 Feb 2026

14:00 - 15:00

Lecture Room 3

Paving the way to a T-coercive method for the wave equation

Dr Carolina Urzua Torres

(TU Delft)

Abstract

Dr Carolina Urzua Torres will talk about 'Paving the way to a T-coercive method for the wave equation'

Space-time Galerkin methods are gradually becoming popular, since they allow adaptivity and parallelization in space and time simultaneously. A lot of progress has been made for parabolic problems, and its success has motivated an increased interest in finding space-time formulations for the wave equation that lead to unconditionally stable discretizations. In this talk I will discuss some of the challenges that arise and some recent work in this direction.

In particular, I will present what we see as a first step toward introducing a space-time transformation operator $T$ that establishes $T$-coercivity for the weak variational formulation of the wave equation in space and time on bounded Lipschitz domains. As a model problem, we study the ordinary differential equation (ODE) $u'' + \mu u = f$ for $\mu>0$, which is linked to the wave equation via a Fourier expansion in space. For its weak formulation, we introduce a transformation operator $T_\mu$ that establishes $T_\mu$-coercivity of the bilinear form yielding an unconditionally stable Galerkin-Bubnov formulation with error estimates independent of $\mu$. The novelty of the current approach is the explicit dependence of the transformation on $\mu$ which, when extended to the framework of partial differential equations, yields an operator acting in both time and space. We pay particular attention to keeping the trial space as a standard Sobolev space, simplifying the error analysis, while only the test space is modified.
The theoretical results are complemented by numerical examples.

Thu, 19 Feb 2026

14:00 - 15:00

Lecture Room 3

Subspace Correction Methods for Convex Optimization: Algorithms, Theory, and Applications

Jongho Park

(King Abdullah University of Science and Technology (KAUST))

Abstract

Speaker Yongho Park will talk about 'Subspace Correction Methods for Convex Optimization: Algorithms, Theory, and Applications'

This talk considers a framework of subspace correction methods for convex optimization, which provides a unified perspective for the design and analysis of a wide range of iterative methods, including advanced domain decomposition and multigrid methods. We first develop a convergence theory for parallel subspace correction methods based on the observation that these methods can be interpreted as nonlinearly preconditioned gradient descent methods. This viewpoint leads to a simpler and sharper analysis compared with existing approaches. We further show how the theory can be extended to semicoercive and nearly semicoercive problems. In addition, we explore connections between subspace correction methods and other classes of iterative algorithms, such as alternating projection methods, through the lens of convex duality, thereby enabling a unified treatment. Several applications are presented, including nonlinear partial differential equations, variational inequalities, and mathematical imaging problems. The talk concludes with a discussion of relevant and emerging research directions.

Thu, 12 Feb 2026

14:00 - 15:00

Lecture Room 3

The Dean–Kawasaki Equation: Theory, Numerics, and Applications

Prof Ana Djurdjevac

(Mathematical Institute - University of Oxford)

Abstract

Professor Ana Djurdjevac will talk about; 'The Dean–Kawasaki Equation: Theory, Numerics, and Applications'

The Dean–Kawasaki equation provides a stochastic partial differential equation description of interacting particle systems at the level of empirical densities and has attracted considerable interest in statistical physics, stochastic analysis, and applied modeling. In this work, we study analytical and numerical aspects of the Dean–Kawasaki equation, with a particular focus on well-posedness, structure preservation, and possible discretization strategies. In addition, we extend the framework to the Dean–Kawasaki equation posed on smooth hypersurfaces. We discuss applications of the Dean–Kawasaki framework to particle-based models arising in biological systems and modeling social dynamics.

Thu, 05 Feb 2026

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

A Riemannian Approach for PDE-Constrained Shape Optimization Using Outer Metrics

Estefania Loayza Romero

(University of Strathclyde)

Abstract

Speaker Estefania Loayza Romero will talk about: A Riemannian Approach for PDE-Constrained Shape Optimization Using Outer Metrics

In PDE-constrained shape optimisation, shapes are traditionally viewed as elements of a Riemannian manifold, specifically as embeddings of the unit circle into the plane, modulo reparameterizations. The standard approach employs the Steklov-Poincaré metric to compute gradients for Riemannian optimisation methods. A significant limitation of current methods is the absence of explicit expressions for the geodesic equations associated with this metric. Consequently, algorithms have relied on retractions (often equivalent to the perturbation of identity method in shape optimisation) rather than true geodesic paths. Previous research suggests that incorporating geodesic equations, or better approximations thereof, can substantially enhance algorithmic performance. This talk presents numerical evidence demonstrating that using outer metrics, defined on the space of diffeomorphisms with known geodesic expressions, improves Riemannian gradient-based optimisation by significantly reducing the number of required iterations and preserving mesh quality throughout the optimisation process.

This talk is hosted at RAL.

Thu, 29 Jan 2026

14:00 - 15:00

Lecture Room 3

Finite element form-valued forms

Prof Kaibo Hu

(Mathematical Institute )

Abstract

Professor Kaibo Hu will be talking about: 'Finite element form-valued forms'

Some of the most successful vector-valued finite elements in computational electromagnetics and fluid mechanics, such as the Nédélec and Raviart-Thomas elements, are recognized as special cases of Whitney’s discrete differential forms. Recent efforts aim to go beyond differential forms and establish canonical discretizations for more general tensors. An important class is that of form-valued forms, or double forms, which includes the metric tensor (symmetric (1,1)-forms) and the curvature tensor (symmetric (2,2)-forms). Like the differential structure of forms is encoded in the de Rham complex, that of double forms is encoded in the Bernstein–Gelfand–Gelfand (BGG) sequences and their cohomologies. Important examples include the Calabi complex in geometry and the Kröner complex in continuum mechanics.
These constructions aim to address the problem of discretizing tensor fields with general symmetries on a triangulation, with a particular focus on establishing discrete differential-geometric structures and compatible tensor decompositions in 2D, 3D, and higher dimensions.

Thu, 22 Jan 2026

14:00 - 15:00

Lecture Room 3

Quadrature = rational approximation

Prof Nick Trefethen

(Harvard University)

Abstract

Professor Nick Trefethen will speak about: 'Quadrature = rational approximation'

Whenever you see a string of quadrature nodes, you can consider it as a branch cut defined by the poles of a rational approximation to the Cauchy transform of a weight function. The aim of this talk is to explain this strange statement and show how it opens the way to calculation of targeted quadrature formulas for all kinds of applications. Gauss quadrature is an example, but it is just the starting point, and many more examples will be shown. I hope this talk will change your understanding of quadrature formulas.

This is joint work with Andrew Horning.

Thu, 04 Dec 2025

14:00 - 15:00

Lecture Room 3

Sparse Grid Methods for Boundary Layer Problems

Dr Niall Madden

(University of Galway)

Abstract

In this talk, we'll consider the numerical approximation of singularly perturbed reaction-diffusion partial differential equations, by finite element methods (FEMs).

Solutions to such problems feature boundary layers, the width of which depends on the magnitude of the perturbation parameter. For many hears, some numerical analysts have been preoccupied with constructing methods that can resolve any layers present, and for which one can establish an error estimate that is independent of the perturbation parameter. Such methods are called "parameter robust", or (in some norms) "uniformly convergent".

In this talk we'll begin with the simplest possible parameter robust FEM: a standard Galerkin finite element method (FEM) applied on a suitably constructed mesh using a priori information. However, from a practical point of view, not very scalable. To resolve this issue we consider the application of sparse grid techniques. These methods have many variants, two of which we'll consider: the hierarchical basis approach (e.g., Zenger, 1991) and the
two-scale method (e.g., many papers by Aihui Zhou and co-authors). The former can be more efficient, while the latter is considered simpler in both theory and practice.

Our goal is to try to unify these two approaches (at least in two dimensions), and then extend to three-dimensional problems, and, moreover, to other FEMs.

Thu, 27 Nov 2025

14:00 - 15:00

Lecture Room 3

The Role of Inexactness in Krylov Subspace Regularization for Inverse Problems

Dr Malena Sabate Landman

((Mathematical Institute University of Oxford))

Abstract

Linear discrete inverse problems arise in many areas of science and engineering, from medical imaging and geophysics to atmospheric modelling. Their numerical solution often relies on iterative algorithms, particularly Krylov subspace methods, that can efficiently handle large-scale, ill-posed systems. In many practical settings, however, exact computations of matrix–vector products, preconditioners, or right-hand sides are either infeasible or unnecessary, leading to inexact iterations. This talk explores the interplay between inexactness and the regularizing behaviour of Krylov subspace methods for inverse problems. We discuss how approximate computations influence the regularization effect inherent in early iterations, as well as semiconvergence, and how controlled inexactness may be exploited to improve computational efficiency. The aim is to provide a broad perspective on recent insights and open questions at the interface of inverse problems, iterative solvers, and computational inexactness.

Thu, 20 Nov 2025

14:00 - 15:00

Lecture Room 3

Optimisation on Probability Distributions - Are We There Yet?

Chris Oates

(Newcastle University)

Abstract

Several interesting and emerging problems in statistics, machine learning and optimal transport can be cast as minimisation of (entropy-regularised) objective functions defined on an appropriate space of probability distributions. Numerical methods have historically focused on linear objective functions, a setting in which one has access to an unnormalised density for the distributional target. For nonlinear objectives, numerical methods are relatively under-developed; for example, mean-field Langevin dynamics is considered state-of-the-art. In the nonlinear setting even basic questions, such as how to tell whether or not a sequence of numerical approximations has practically converged, remain unanswered. Our main contribution is to present the first computable measure of sub-optimality for optimisation in this context.

Joint work with Clémentine Chazal, Heishiro Kanagawa, Zheyang Shen and Anna Korba.

Thu, 13 Nov 2025

14:00 - 15:00

Lecture Room 3

Fast Algorithms for Optimal Viscosities in Damped Mechanical Systems

Francoise Tisseur

(University of Manchester)

Abstract

Optimal damping consists of identifying a viscosity vector that maximizes the decay rate of a mechanical system's response. This can be rephrased as minimizing the trace of the solution of a Lyapunov equation whose coefficient matrix, representing the system dynamics, depends on the dampers' viscosities. The latter must be nonnegative for a physically meaningful solution, and the system must be asymptotically stable at the solution.

In this talk, we present conditions under which the system is never stable or may not be stable for certain values of the viscosity vector, and, in the latter case, discuss how to modify the constraints so as to guarantee stability. We show that the KKT conditions of our nonlinear optimization problem are equivalent to a viscosity-dependent nonlinear residual function that is equal to zero at an optimal viscosity vector. To minimize this residual function, we propose a Barzilai-Borwein residual minimization algorithm (BBRMA) and a spectral projection gradient algorithm (SPG). The efficiency of both algorithms relies on a fast computation of the gradient for BBRMA, and both the objective function and its gradient for SPG. By fully exploiting the low-rank structure of the problem, we show how to compute these in $O(n^2)$ operations, $n$ being the size of the mechanical system.

This is joint work with Qingna Li (Beijing Institute of Technology).

Thu, 06 Nov 2025

14:00 - 15:00

Lecture Room 3

When AI Goes Awry

Des Higham

(University of Edinburgh)

Abstract

Over the last decade, adversarial attack algorithms have revealed instabilities in artificial intelligence (AI) tools. These algorithms raise issues regarding safety, reliability and interpretability; especially in high risk settings. Mathematics is at the heart of this landscape, with ideas from numerical analysis, optimization, and high dimensional stochastic analysis playing key roles. From a practical perspective, there has been a war of escalation between those developing attack and defence strategies. At a more theoretical level, researchers have also studied bigger picture questions concerning the existence and computability of successful attacks. I will present examples of attack algorithms for neural networks in image classification, for transformer models in optical character recognition and for large language models. I will also show how recent generative diffusion models can be used adversarially. From a more theoretical perspective, I will outline recent results on the overarching question of whether, under reasonable assumptions, it is inevitable that AI tools will be vulnerable to attack.

Thu, 30 Oct 2025

14:00 - 15:00

Lecture Room 3

Sparse Graphical Linear Dynamical Systems

Emilie Chouzenoux

(INRIA Saclay, France)

Abstract

Time-series datasets are central in numerous fields of science and engineering, such as biomedicine, Earth observation, and network analysis. Extensive research exists on state-space models (SSMs), which are powerful mathematical tools that allow for probabilistic and interpretable learning on time series. Estimating the model parameters in SSMs is arguably one of the most complicated tasks, and the inclusion of prior knowledge is known to both ease the interpretation but also to complicate the inferential tasks. In this talk, I will introduce a novel joint graphical modeling framework called DGLASSO (Dynamic Graphical Lasso) [1], that bridges the static graphical Lasso model [2] and the causal-based graphical approach for the linear-Gaussian SSM in [3]. I will also present a new inference method within the DGLASSO framework that implements an efficient block alternating majorization-minimization algorithm. The algorithm's convergence is established by departing from modern tools from nonlinear analysis. Experimental validation on synthetic and real weather variability data showcases the effectiveness of the proposed model and inference algorithm.

[1] E. Chouzenoux and V. Elvira. Sparse Graphical Linear Dynamical Systems. Journal of Machine Learning Research, vol. 25, no. 223, pp. 1-53, 2024

[2] J. Friedman, T. Hastie, and R. Tibshirani. Sparse inverse covariance estimation with the graphical LASSO. Biostatistics, vol. 9, no. 3, pp. 432–441, Jul. 2008.

[3] V. Elvira and E. Chouzenoux. Graphical Inference in Linear-Gaussian State-Space Models. IEEE Transactions on Signal Processing, vol. 70, pp. 4757-4771, Sep. 2022.

Thu, 23 Oct 2025

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

Interior-point optimisation for quadratic programs with conic constraints

Paul Goulart

(Oxford University)

Abstract

The talk will present the open-source convex optimisation solver Clarabel, an interior-point based solver that uses a novel homogeneous embedding technique offering substantially faster solve times relative to existing open-source and commercial interior-point solvers for some problem types. This improvement is due to both a reduction in the number of required interior point iterations as well as an improvement in both the size and sparsity of the linear system that must be solved at each iteration. For large-scale problems we employ a variety of additional techniques to accelerate solve times, including chordal decomposition methods, GPU sub-solvers, and custom handling of certain specialised cones. The talk will describe details of our implementation and show performance results with respect to solvers based on the standard homogeneous self-dual embedding.

This talk is hosted by Rutherford Appleton Laboratory and will take place @ Harwell Campus, Didcot, OX11 0QX

Thu, 16 Oct 2025

14:00 - 15:00

Lecture Room 3

Piecewise rational finite element spaces of differential forms

Evan Gawlik

(Santa Clara University)

Abstract

The Whitney forms on a simplicial triangulation are piecewise affine differential forms that are dual to integration over chains. The so-called blow-up Whitney forms are piecewise rational generalizations of the Whitney forms. These differential forms, which are also called shadow forms, were first introduced by Brasselet, Goresky, and MacPherson in the 1990s. The blow-up Whitney forms exhibit singular behavior on the boundary of the simplex, and they appear to be well-suited for constructing certain novel finite element spaces, like tangentially- and normally-continuous vector fields on triangulated surfaces. This talk will discuss the blow-up Whitney forms, their properties, and their applicability to PDEs like the Bochner Laplace problem.

Thu, 09 Oct 2025

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

HSS iteration for solving the indefinite Helmholtz equation by multigrid with standard components

Colin Cotter

(Imperial College, London)

Abstract

We provide an iterative solution approach for the indefinite Helmholtz equation discretised using finite elements, based upon a Hermitian Skew-Hermitian Splitting (HSS) iteration applied to the shifted operator, and prove that the iteration is k- and mesh-robust when O(k) HSS iterations are performed. The HSS iterations involve solving a shifted operator that is suitable for approximation by multigrid using standard smoothers and transfer operators, and hence we can use O(N) parallel processors in a high performance computing implementation, where N is the total number of degrees of freedom. We argue that the algorithm converges in O(k) wallclock time when within the range of scalability of the multigrid. We provide numerical results verifying our proofs and demonstrating this claim, establishing a method that can make use of large scale high performance computing systems.

This talk is hosted by Rutherford Appleton Laboratory and will take place @ Harwell Campus, Didcot, OX11 0QX

Fri, 29 Aug 2025
12:30

TBA

Colin Cotter

(Imperial College, London)

Abstract

TBA

Thu, 19 Jun 2025
14:00

Lecture Room 3

Hilbert’s 19th problem and discrete De Giorgi-Nash-Moser theory: analysis and applications

Endre Süli

(Mathematical Institute (University of Oxford))

Abstract

This talk is concerned with the construction and mathematical analysis of a system of nonlinear partial differential equations featuring in a model of an incompressible non-Newtonian fluid, the synovial fluid, contained in the cavities of human joints. To prove the convergence of the numerical method one has to develop a discrete counterpart of the De Giorgi-Nash-Moser theorem, which guarantees a uniform bound on the sequence of continuous piecewise linear finite element approximations in a Hölder norm, for divergence-form uniformly elliptic partial differential equations with measurable coefficients.

Thu, 12 Jun 2025

14:00 - 15:00

Lecture Room 3

Finite volumes for a generalized Poisson-Nernst-Planck system with cross-diffusion and size exclusion

Clément Cancès

(INRIA LILLE)

Abstract

We propose and analyse two structure preserving finite volume schemes to approximate the solutions to a cross-diffusion system with self-consistent electric interactions introduced by Burger, Schlake & Wolfram (2012). This system has been derived thanks to probabilistic arguments and admits a thermodynamically motivated Lyapunov functional that is preserved by suitable two-point flux finite volume approximations. This allows to carry out the mathematical analysis of two schemes to be compared.

This is joint work with Maxime Herda and Annamaria Massimini.

Thu, 05 Jun 2025
14:00

Lecture Room 3

Solving sparse linear systems using quantum computing algorithms

Leigh Lapworth

(Rolls-Royce)

Abstract

The currently available quantum computers fall into the NISQ (Noisy Intermediate Scale Quantum) regime. These enable variational algorithms with a relatively small number of free parameters. We are now entering the FTQC (Fault Tolerant Quantum Computer) regime where gate fidelities are high enough that error-correction schemes are effective. The UK Quantum Missions include the target for a FTQC device that can perform a million operations by 2028, and a trillion operations by 2035.

This talk will present the outcomes from assessments of two quantum linear equation solvers for FTQCs– the Harrow–Hassidim–Lloyd (HHL) and the Quantum Singular Value Transform (QSVT) algorithms. These have used sample matrices from a Computational Fluid Dynamics (CFD) testcase. The quantum solvers have also been embedded with an outer non-linear solver to judge their impact on convergence. The analysis uses circuit emulation and is used to judge the FTQC requirements to deliver quantum utility.

Thu, 29 May 2025

14:00 - 15:00

Lecture Room 3

On the data-sparsity of the solution of Riccati equations with quasiseparable coefficients

Stefano Massei

(Universita di Pisa)

Abstract

Solving large-scale continuous-time algebraic Riccati equations is a significant challenge in various control theory applications.

This work demonstrates that when the matrix coefficients of the equation are quasiseparable, the solution also exhibits numerical quasiseparability. This property enables us to develop two efficient Riccati solvers. The first solver is applicable to the general quasiseparable case, while the second is tailored to the particular case of banded coefficients. Numerical experiments confirm the effectiveness of the proposed algorithms on both synthetic examples and case studies from the control of partial differential equations and agent-based models.

Thu, 22 May 2025

14:00 - 15:00

Lecture Room 3

When you truncate an infinite equation, what happens to the leftovers?

Geoff Vasil

(University of Edinburgh)

Abstract

Numerically solving PDEs typically requires compressing infinite information into a finite system of algebraic equations. Pragmatically, we usually follow a recipe: “Assume solutions of form X; substitute into PDE Y; discard terms by rule Z.” In contrast, Lanczos’s pioneering “tau method” prescribes modifying the PDE to form an exact finite system. Crucially, any recipe-based method can be viewed as adding a small equation correction, enabling us to compare multiple schemes independently of the solver.

This talk also addresses a paradox: PDEs often admit infinitely many solutions, but finite systems produce only a finite set. When we include a “small” correction, the missing solutions are effectively hidden. I will discuss how tau methods frame this perspective and outline proposals for systematically studying and optimising various residuals.

Thu, 15 May 2025
14:00

Lecture Room 3

Quick on the draw: high-frequency trading in the Wild West of cryptocurrency limit order-book markets

Sam Howison

(Mathematical Institute (University of Oxford))

Abstract

Cryptocurrencies such as Bitcoin have only recently become a significant part of the financial landscape. Many billions of dollars are now traded daily on limit order-book markets such as Binance, and these are probably among the most open, liquid and transparent markets there are. They therefore make an interesting platform from which to investigate myriad questions to do with market microstructure. I shall talk about a few of these, including live-trading experiments to investigate the difference between on-paper strategy analysis (typical in the academic literature) and actual trading outcomes. I shall also mention very recent work on the new Hyperliquid exchange which runs on a blockchain basis, showing how to use this architecture to obtain datasets of an unprecendented level of granularity. This is joint work with Jakob Albers, Mihai Cucuringu and Alex Shestopaloff.

Thu, 08 May 2025
14:00

(This talk is hosted by Rutherford Appleton Laboratory)

Multilevel Monte Carlo Methods with Smoothing

Aretha Teckentrup

(University of Edinburgh)

Abstract

Parameters in mathematical models are often impossible to determine fully or accurately, and are hence subject to uncertainty. By modelling the input parameters as stochastic processes, it is possible to quantify the uncertainty in the model outputs.

In this talk, we employ the multilevel Monte Carlo (MLMC) method to compute expected values of quantities of interest related to partial differential equations with random coefficients. We make use of the circulant embedding method for sampling from the coefficient, and to further improve the computational complexity of the MLMC estimator, we devise and implement the smoothing technique integrated into the circulant embedding method. This allows to choose the coarsest mesh on the first level of MLMC independently of the correlation length of the covariance function of the random field, leading to considerable savings in computational cost.

Please note; this talk is hosted by Rutherford Appleton Laboratory, Harwell Campus, Didcot, OX11 0QX

Thu, 01 May 2025

14:00 - 15:00

Lecture Room 3

Adventures in structured matrix computations

Gunnar Martinsson

(UT Austin)

Abstract

Many matrices that arise in scientific computing and in data science have internal structure that can be exploited to accelerate computations. The focus in this talk will be on matrices that are either of low rank, or can be tessellated into a collection of subblocks that are either of low rank or are of small size. We will describe how matrices of this nature arise in the context of fast algorithms for solving PDEs and integral equations, and also in handling "kernel matrices" from computational statistics. A particular focus will be on randomized algorithms for obtaining data sparse representations of such matrices.

At the end of the talk, we will explore an unorthodox technique for discretizing elliptic PDEs that was designed specifically to play well with fast algorithms for dense structured matrices.

Thu, 20 Mar 2025
14:00

(This talk is hosted by Rutherford Appleton Laboratory)

Firedrake: a differentiable programming framework for finite element simulation

David Ham

(Imperial College London)

Abstract

Differentiable programming is the underpinning technology for the AI revolution. It allows neural networks to be programmed in very high level user code while still achieving very high performance for both the evaluation of the network and, crucially, its derivatives. The Firedrake project applies exactly the same concepts to the simulation of physical phenomena modelled with partial differential equations (PDEs). By exploiting the high level mathematical abstraction offered by the finite element method, users are able to write mathematical operators for the problem they wish to solve in Python. The high performance parallel implementations of these operators are then automatically generated, and composed with the PETSc solver framework to solve the resulting PDE. However, because the symbolic differential operators are available as code, it is possible to reason symbolically about them before the numerical evaluation. In particular, the operators can be differentiated with respect to their inputs, and the resulting derivative operators composed in forward or reverse order. This creates a differentiable programming paradigm congruent with (and compatible with) machine learning frameworks such as Pytorch and JAX.

In this presentation, David Ham will present Firedrake in the context of differentiable programming, and show how this enables productivity, capability and performance to be combined in a unique way. I will also touch on the mechanism that enables Firedrake to be coupled with Pytorch and JAX.

Please note this talk will take place at Rutherford Appleton Laboratory, Harwell Campus, Didcot.

Thu, 13 Mar 2025

14:00 - 15:00

Lecture Room 3

On the long time behaviour of numerical schemes applied to Hamiltonian PDEs

Erwan Faou

(INRIA)

Abstract

In this talk I will review some recent results concerning the qualitative behaviour of symplectic integrators applied to Hamiltonian PDEs, such as the nonlinear wave equation or Schrödinger equations.

Additionally, I will discuss the problem of numerical resonances, the existence of modified energy and the existence and stability of numerical solitons over long times.

These are works with B. Grébert, D. Bambusi, G. Maierhofer and K. Schratz.

Thu, 06 Mar 2025

14:00 - 15:00

Lecture Room 3

Near-optimal hierarchical matrix approximation

Diana Halikias

(Cornell University)

Abstract

Can one recover a matrix from only matrix-vector products? If so, how many are needed? We will consider the matrix recovery problem for the class of hierarchical rank-structured matrices. This problem arises in scientific machine learning, where one wishes to recover the solution operator of a PDE from only input-output pairs of forcing terms and solutions. Peeling algorithms are the canonical method for recovering a hierarchical matrix from matrix-vector products, however their recursive nature poses a potential stability issue which may deteriorate the overall quality of the approximation. Our work resolves the open question of the stability of peeling. We introduce a robust version of peeling and prove that it achieves low error with respect to the best possible hierarchical approximation to any matrix, allowing us to analyze the performance of the algorithm on general matrices, as opposed to exactly hierarchical ones. This analysis relies on theory for low-rank approximation, as well as the surprising result that the Generalized Nystrom method is more accurate than the randomized SVD algorithm in this setting.

Thu, 27 Feb 2025

14:00 - 15:00

Lecture Room 3

Learning-enhanced structure preserving particle methods for Landau equation

Li Wang

(University of Minnesota)

Abstract

The Landau equation stands as one of the fundamental equations in kinetic theory and plays a key role in plasma physics. However, computing it presents significant challenges due to the complexity of the Landau operator, the dimensionality, and the need to preserve the physical properties of the solution. In this presentation, I will introduce deep learning assisted particle methods aimed at addressing some of these challenges. These methods combine the benefits of traditional structure-preserving techniques with the approximation power of neural networks, aiming to handle high dimensional problems with minimal training.

Thu, 20 Feb 2025

14:00 - 15:00

(This talk is hosted by Rutherford Appleton Laboratory)

Integrate your residuals while solving dynamic optimization problems

Eric Kerrigan

(Imperial College London)

Abstract

Many optimal control, estimation and design problems can be formulated as so-called dynamic optimization problems, which are optimization problems with differential equations and other constraints. State-of-the-art methods based on collocation, which enforce the differential equations at only a finite set of points, can struggle to solve certain dynamic optimization problems, such as those with high-index differential algebraic equations, consistent overdetermined constraints or problems with singular arcs. We show how numerical methods based on integrating the differential equation residuals can be used to solve dynamic optimization problems where collocation methods fail. Furthermore, we show that integrated residual methods can be computationally more efficient than direct collocation.

This seminar takes place at RAL (Rutherford Appleton Lab).

Thu, 13 Feb 2025

14:00 - 15:00

Lecture Room 3

Global Optimization with Hamilton-Jacobi PDEs

Dante Kalise

(Imperial College London)

Abstract

We introduce a novel approach to global optimization via continuous-time dynamic programming and Hamilton-Jacobi-Bellman (HJB) PDEs. For non-convex, non-smooth objective functions, we reformulate global optimization as an infinite horizon, optimal asymptotic stabilization control problem. The solution to the associated HJB PDE provides a value function which corresponds to a (quasi)convexification of the original objective. Using the gradient of the value function, we obtain a feedback law driving any initial guess towards the global optimizer without requiring derivatives of the original objective. We then demonstrate that this HJB control law can be integrated into other global optimization frameworks to improve its performance and robustness.

Thu, 06 Feb 2025

14:00 - 15:00

Lecture Room 3

Deflation Techniques for Finding Multiple Local Minima of a Nonlinear Least Squares Problem

Marcus Webb

(University of Manchester)

Abstract

Deflation is a technique to remove a solution to a problem so that other solutions to this problem can subsequently be found. The most prominent instance is deflation we see in eigenvalue solvers, but recent interest has been in deflation of rootfinding problems from nonlinear PDEs with many isolated solutions (spearheaded by Farrell and collaborators).

In this talk I’ll show you recent results on deflation techniques for optimisation algorithms with many local minima, focusing on the Gauss—Newton algorithm for nonlinear least squares problems. I will demonstrate advantages of these techniques instead of the more obvious approach of applying deflated Newton’s method to the first order optimality conditions and present some proofs that these algorithms will avoid the deflated solutions. Along the way we will see an interesting generalisation of Woodbury’s formula to least squares problems, something that should be more well known in Numerical Linear Algebra (joint work with Güttel, Nakatsukasa and Bloor Riley).

Main preprint: https://arxiv.org/abs/2409.14438.

WoodburyLS preprint: https://arxiv.org/abs/2406.15120

Thu, 30 Jan 2025

14:00 - 15:00

Lecture Room 3

Operator learning without the adjoint

Nicolas Boullé

(Imperial College London )

Abstract

There is a mystery at the heart of operator learning: how can one recover a non-self-adjoint operator from data without probing the adjoint? Current practical approaches suggest that one can accurately recover an operator while only using data generated by the forward action of the operator without access to the adjoint. However, naively, it seems essential to sample the action of the adjoint for learning time-dependent PDEs.

In this talk, we will first explore connections with low-rank matrix recovery problems in numerical linear algebra. Then, we will show that one can approximate a family of non-self-adjoint infinite-dimensional compact operators via projection onto a Fourier basis without querying the adjoint.

Thu, 23 Jan 2025

14:00 - 15:00

Lecture Room 3

Multi-Index Monte Carlo Method for Semilinear Stochastic Partial Differential Equations

Abdul Lateef Haji-Ali

(Heriot Watt)

Abstract

We present an exponential-integrator-based multi-index Monte Carlo (MIMC) method for the weak approximation of mild solutions to semilinear stochastic partial differential equations (SPDEs). Theoretical results on multi-index coupled solutions of the SPDE are provided, demonstrating their stability and the satisfaction of multiplicative error estimates. Leveraging this theory, we develop a tractable MIMC algorithm. Numerical experiments illustrate that MIMC outperforms alternative approaches, such as multilevel Monte Carlo, particularly in low-regularity settings.

Thu, 12 Dec 2024
14:00

(This talk is hosted by Rutherford Appleton Laboratory)

A Subspace-conjugate Gradient Method for Linear Matrix Equations

Davide Palitta

(Università di Bologna)

Abstract

The solution of multiterm linear matrix equations is still a very challenging task in numerical linear algebra.

If many different solid methods for equations with (at most) two terms exist in the literature, having a number of terms greater than two makes the numerical treatment of these equations much trickier. Very few options are available in the literature. In particular, to the best of our knowledge, no decomposition-based method for multiterm equations has never been proposed; only iterative procedures exist.

A non-complete list of contributions in this direction includes a greedy procedure designed by Sirkovi\'c and Kressner, projection methods tailored to the equation at hand, Riemannian optimization schemes, and matrix-oriented Krylov methods with low-rank truncations. The last class of solvers is probably one of the most commonly used ones. These schemes amount to adapting standard Krylov schemes for linear systems to matrix equations by leveraging the equivalence between matrix equations and their Kronecker form.

As long as no truncations are performed, this equivalence means that the algorithm itself is not exploiting the structure of the problem as it is not able to see that we are actually solving a matrix equation and not a linear system. The low-rank truncations we may perform in the matrix-equation framework can be viewed as a simple computational tool needed to make the solution process affordable in terms of storage allocation and not as an algorithmic advance.

By taking inspiration from the matrix-oriented cg method, our goal is to design a novel iterative scheme for the solution of multiterm matrix equations. We name this procedure the subspace-conjugate gradient method (Ss-cg) due to the peculiar orthogonality conditions imposed to compute some of the quantities involved in the scheme. As we will show in this talk, the main difference between Ss-cg and cg is the ability of the former to capitalize on the matrix equation structure of the underlying problem. In particular, we will make full use of the (low-rank) matrix format of the iterates to define appropriate ``step-sizes''. If these quantities correspond to scalars alpha_k and beta_k in cg, they will amount to small-dimensional matrices in our fresh approach.

This novel point of view leads to remarkable computational gains making Ss-cg a very competitive option for the solution of multi-term linear matrix equations.

This is a joint work with Martina Iannacito and Valeria Simoncini, both from the Department of Mathematics, University of Bologna.

Thu, 05 Dec 2024

14:00 - 15:00

Lecture Room 3

Solving (algebraic problems from) PDEs; a personal perspective

Andy Wathen

(Oxford University)

Abstract

We are now able to solve many partial differential equation problems that were well beyond reach when I started in academia. Some of this success is due to computer hardware but much is due to algorithmic advances.

I will give a personal perspective of the development of computational methodology in this area over my career thus far.

Thu, 28 Nov 2024

14:00 - 15:00

Lecture Room 3

Unleashing the Power of Deeper Layers in LLMs

Shiwei Liu

(Oxford University)

Abstract

Large Language Models (LLMs) have demonstrated impressive achievements. However, recent research has shown that their deeper layers often contribute minimally, with effectiveness diminishing as layer depth increases. This pattern presents significant opportunities for model compression.

In the first part of this seminar, we will explore how this phenomenon can be harnessed to improve the efficiency of LLM compression and parameter-efficient fine-tuning. Despite these opportunities, the underutilization of deeper layers leads to inefficiencies, wasting resources that could be better used to enhance model performance.

The second part of the talk will address the root cause of this ineffectiveness in deeper layers and propose a solution. We identify the issue as stemming from the prevalent use of Pre-Layer Normalization (Pre-LN) and introduce Mix-Layer Normalization (Mix-LN) with combined Pre-LN and Post-LN as a new approach to mitigate this training deficiency.