AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Lu, H
Zhou, Y
Liu, S
Wang, Z
Mahoney, M
Yang, Y
Advances in Neural Information Processing Systems
volume 37
(01 Jan 2024)
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Zhang, Z
Chen, R
Liu, S
Yao, Z
Ruwase, O
Chen, B
Wu, X
Wang, Z
Advances in Neural Information Processing Systems
volume 37
(01 Jan 2024)
A hyperbolic free-by-cyclic group determined by its finite quotients
Andrew, N
Hillen, P
Lyman, R
Pfaff, C
Glasgow Mathematical Journal
(04 Apr 2025)
A hyperbolic free-by-cyclic group determined by its finite quotients
Andrew, N
Hillen, P
Lyman, R
Pfaff, C
Glasgow Mathematical Journal
(04 Apr 2025)
Schrödinger Bridge Flow for Unpaired Data Translation
De Bortoli, V
Korshunova, I
Mnih, A
Doucet, A
Advances in Neural Information Processing Systems
volume 37
(01 Jan 2024)
Score-Optimal Diffusion Schedules
Williams, C
Campbell, A
Doucet, A
Syed, S
Advances in Neural Information Processing Systems
volume 37
(01 Jan 2024)
New horizons for inhomogeneous quenches and Floquet CFT
Jiang, H
Mezei, M
Journal of High Energy Physics
volume 2025
issue 4
(02 Apr 2025)
3d gravity as a random ensemble
Jafferis, D
Rozenberg, L
Wong, G
Journal of High Energy Physics
volume 2025
issue 2
(28 Feb 2025)