Network construction methodology based on distance correlation without exogenous information

3 December 2019
Javier Pardo Díaz

We aim to generate gene coexpression networks from gene expression data. In our networks, nodes represent genes and edges depict high positive correlation in their expression across different samples. Methods based on Pearson correlation are the most commonly used to generate gene coexpression networks. We propose the use of distance correlation as an effective alternative to Pearson correlation when constructing gene expression networks. Our methodology pipeline includes a thresholding step which allows us to discriminate which pairs of genes are coexpressed. We select the value of the threshold parameter by studying the stability of the generated network, rather than relying on exogenous biological information known a priori.