On the Use of the Lasso for Instrumental Variables Estimation with Some Invalid Instruments
Windmeijer, F Farbmacher, H Davies, N Smith, G Journal of the American Statistical Association volume 114 issue 527 1339-1350 (03 Jul 2019)
The effects of prescribing varenicline on two‐year health outcomes: an observational cohort study using electronic medical records
Davies, N Taylor, G Taylor, A Jones, T Martin, R Munafò, M Windmeijer, F Thomas, K Addiction volume 113 issue 6 1105-1116 (20 Jun 2018)
Zoonotic host diversity increases in human-dominated ecosystems
Gibb, R Redding, D Chin, K Donnelly, C Blackburn, T Newbold, T Jones, K Nature volume 584 issue 7821 398-402 (05 Aug 2020)
Fri, 11 Sep 2020

15:00 - 16:00
Virtual

TDA analysis of flow cytometry data in acute lymphoblastic leukaemia patients

Salvador Chulián García
(Universidad de Cádiz)
Abstract

High dimensionality of biological data is a crucial element that is in need of different methods to unravel their complexity. The current and rich biomedical material that hospitals generate every other day related to cancer detection can benefit from these new techniques. This is the case of diseases such as Acute Lymphoblastic Leukaemia (ALL), one of the most common cancers in childhood. Its diagnosis is based on high-dimensional flow cytometry tumour data that includes immunophenotypic expressions. Not only the intensity of these markers is meaningful for clinicians, but also the shape of the points clouds generated, being then fundamental to find leukaemic clones. Thus, the mathematics of shape recognition in high dimensions can turn itself as a critical tool for this kind of data. This is why we resort to the use of tools from Topological Data Analysis such as Persistence Homology.

 

Given that ALL relapse incidence is of almost 20% of its patients, we provide a methodology to shed some light on the shape of flow cytometry data, for both relapsed and non-relapsed patients. This is done so by combining the strength of topological data analysis with the versatility of machine learning techniques. The results obtained show us topological differences between both patient sets, such as the amount of connected components and 1-dimensional loops. By means of the so-called persistence images, and for specially selected immunophenotypic markers, a classification of both cohorts is obtained, highlighting the need of new methods to provide better prognosis. 

Thu, 03 Sep 2020

16:00 - 17:00

Topological representation learning

Michael Moor
(ETH Zurich)
Abstract

Topological features as computed via persistent homology offer a non-parametric approach to robustly capture multi-scale connectivity information of complex datasets. This has started to gain attention in various machine learning applications. Conventionally, in topological data analysis, this method has been employed as an immutable feature descriptor in order to characterize topological properties of datasets. In this talk, however, I will explore how topological features can be directly integrated into deep learning architectures. This allows us to impose differentiable topological constraints for preserving the global structure of the data space when learning low-dimensional representations.

Thu, 17 Sep 2020

16:00 - 18:00
Virtual
Fri, 04 Sep 2020

15:00 - 16:00
Virtual

Geometric Fusion via Joint Delay Embeddings

Elchanan Solomon
(Duke University)
Abstract

This talk is motivated by the following question: "how can one reconstruct the geometry of a state space given a collection of observed time series?" A well-studied technique for metric fusion is Similarity Network Fusion (SNF), which works by mixing random walks. However, SNF behaves poorly in the presence of correlated noise, and always reconstructs an intrinsic metric. We propose a new methodology based on delay embeddings, together with a simple orthogonalization scheme that uses the tangency data contained in delay vectors. This method shows promising results for some synthetic and real-world data. The authors suspect that there is a theorem or two hiding in the background -- wild speculation by audience members is encouraged. 

Subscribe to