15:00
Persistent homology is infeasible to compute when a dataset is very large. Inspired by the bootstrapping method, Chazal et al. (2014) proposed a multiple subsampling approach to approximate the persistence landscape of a massive dataset. In this talk, I will present an extension of the multiple subsampling method to a broader class of vectorizations of persistence diagrams and to persistence diagrams directly. First, I will review the statistical foundation of the multiple subsampling approach as applied to persistence landscapes in Chazal et al. (2014). Next, I will talk about how this analysis extends to a class of vectorized persistence diagrams called Hölder continuous vectorizations. Finally, I will address the challenges in applying this method to raw persistence diagrams for two measures of centrality: the mean persistence measure and the Fréchet mean of persistence diagrams. I will demonstrate these methods through simulation results and applications in estimating data shapes.