Hierarchical Quantification of Synergy in Channels
Perrone, P Ay, N Frontiers in Robotics and AI volume 2 35 (08 Jan 2016)
Bimonoidal Structure of Probability Monads
Fritz, T Perrone, P Electronic Notes in Theoretical Computer Science volume 341 121-149 (Dec 2018)
Stochastic order on metric spaces and the ordered Kantorovich monad
Fritz, T Perrone, P Advances in Mathematics volume 366 107081 (Jun 2020)
Monads, Partial Evaluations, and Rewriting
Fritz, T Perrone, P Electronic Notes in Theoretical Computer Science volume 352 129-148 (Oct 2020)
Partition functions and fibering operators on the Coulomb branch of 5d SCFTs
Closset, C Magureanu, H Journal of High Energy Physics volume 2023 issue 1 35- (10 Jan 2023)
Thu, 11 May 2023

14:00 - 15:00
Lecture Room 3

A coordinate descent algorithm on the Stiefel manifold for deep neural network training

Estelle Massart
(UC Louvain)
Abstract

We propose to use stochastic Riemannian coordinate descent on the Stiefel manifold for deep neural network training. The algorithm rotates successively two columns of the matrix, an operation that can be efficiently implemented as a multiplication by a Givens matrix. In the case when the coordinate is selected uniformly at random at each iteration, we prove the convergence of the proposed algorithm under standard assumptions on the loss function, stepsize and minibatch noise. Experiments on benchmark deep neural network training problems are presented to demonstrate the effectiveness of the proposed algorithm.

Thu, 15 Jun 2023

14:00 - 15:00
Lecture Room 3

26 Years at Oxford

Nick Trefethen
(Oxford University)
Abstract

I will reflect on my time as Professor of Numerical Analysis.

Subscribe to