Date
Thu, 11 Jun 2026
Time
12:00 - 12:30
Location
Lecture Room 4, Mathematical Institute
Speaker
Katherine Pearce
Organisation
University of Texas at Austin
Add to calendar

Katherine Pearce is going to talk about: 'Randomized Algorithms for Tensor CUR Approximations in Attention Mechanisms'

Attention mechanisms are a central component of transformer models that capture contextual relationships between tokens in large language models. Although many of the underlying computations (e.g., query, key, and value embeddings in multi-head attention) are inherently multi-way, classical transformer models are built on matrix-based formulations. In this talk, we discuss several ways that tensorial structure can be imposed on and exploited in attention mechanisms of transformer models. We describe how tensor-based attention can capture higher-order contextual relationships among tokens. We then explore how randomized algorithms to compute tensor CUR decompositions may be used to accelerate computations in tensor-based attention and reduce storage requirements.

 

 

Last updated on 9 Jun 2026, 8:32am. Please contact us with feedback and comments about this page.