Statistical optimal transport via factored couplings

Author: 

Forrow, A
Hütter, J
Nitzan, M
Rigollet, P
Schiebinger, G
Weed, J

Publication Date: 

11 April 2019

Journal: 

Proceedings of Machine Learning Research

Last Updated: 

2020-07-31T21:00:14.067+01:00

Volume: 

89

page: 

2454-2465

abstract: 

We propose a new method to estimate Wasserstein distances and optimal transport plans between two probability distributions from samples in high dimension. Unlike plugin rules that simply replace the true distributions by their empirical counterparts, our method promotes couplings with low trans- port rank, a new structural assumption that is similar to the nonnegative rank of a matrix. Regularizing based on this assumption leads to drastic improvements on highdimensional data for various tasks, including domain adaptation in single-cell RNA sequencing data. These findings are supported by a theoretical analysis that indicates that the transport rank is key in overcoming the curse of dimensionality inherent to datadriven optimal transport.

Symplectic id: 

958941

Submitted to ORA: 

Submitted

Publication Type: 

Conference Paper