Improved gene co-expression network quality through expression dataset down-sampling and network aggregation

Liesecke, Franziska; Verg�s,; De Craene, Johan-Owen; Besseau, Sébastien; Courdavault, Vincent; Clastre, Marc; Vergès, Valentin; Papon, Nicolas; Giglioli-Guivarc’h, Nathalie; Glévarec, Gaëlle; Gl�varec, G.; Pichon, Olivier; Dugé de Bernonville, Thomas

Published in

Nature Research, Scientific Reports, 1(9), 2019

DOI: 10.1038/s41598-019-50885-8

Tools

Export citation

Search in Google Scholar

Improved gene co-expression network quality through expression dataset down-sampling and network aggregation

Journal article published in 2019 by Franziska Liesecke, Verg�s, Johan-Owen De Craene

, Sébastien Besseau, Vincent Courdavault

, Marc Clastre, Valentin Vergès, Nicolas Papon

, Nathalie Giglioli-Guivarc’h, Gaëlle Glévarec, G. Gl�varec, Olivier Pichon, Thomas Dugé de Bernonville

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractLarge-scale gene co-expression networks are an effective methodology to analyze sets of co-expressed genes and discover new gene functions or associations. Distances between genes are estimated according to their expression profiles and are visualized in networks that may be further partitioned to reveal communities of co-expressed genes. Creating expression profiles is now eased by the large amounts of publicly available expression data (microarrays and RNA-seq). Although many distance calculation methods have been intensively compared and reviewed in the past, it is unclear how to proceed when many samples reflecting a wide range of different conditions are available. Should as many samples as possible be integrated into network construction or be partitioned into smaller sets of more related samples? Previous studies have indicated a saturation in network performances to capture known associations once a certain number of samples is included in distance calculations. Here, we examined the influence of sample size on co-expression network construction using microarray and RNA-seq expression data from three plant species. We tested different down-sampling methods and compared network performances in recovering known gene associations to networks obtained from full datasets. We further examined how aggregating networks may help increase this performance by testing six aggregation methods.

Published in

Links

Tools

Improved gene co-expression network quality through expression dataset down-sampling and network aggregation

Abstract