Dissemin is shutting down on January 1st, 2025

Published in

Oxford University Press (OUP), Bioinformatics, 22(27), p. 3135-3141

DOI: 10.1093/bioinformatics/btr528

Links

Tools

Export citation

Search in Google Scholar

Distance-Based Differential Analysis of Gene Curves

Journal article published in 2011 by Christopher Minas, Simon J. Waddell ORCID, Giovanni Montana
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

MOTIVATION: Time course gene expression experiments are performed to study time-varying changes in mRNA levels of thousands of genes. Statistical methods from functional data analysis (FDA) have recently gained popularity for modelling and exploring such time courses. Each temporal profile is treated as the realization of a smooth function of time, or curve, and the inferred curve becomes the basic unit of statistical analysis. The task of identifying genes with differential temporal profiles then consists of detecting statistically significant differences between curves, where such differences are commonly quantified by computing the area between the curves or the l₂ distance. RESULTS: We propose a general test statistic for detecting differences between gene curves, which only depends on a suitably chosen distance measure between them. The test makes use of a distance-based variance decomposition and generalizes traditional MANOVA tests commonly used for vectorial observations. We also introduce the visual l₂ distance, which is shown to capture shape-related differences in gene curves and is robust against time shifts, which would otherwise inflate the traditional l₂ distance. Other shape-related distances, such as the curvature, may carry biological significance. We have assessed the comparative performance of the test on realistically simulated datasets and applied it to human immune cell responses to bacterial infection over time.