Published in

Springer Verlag, Lecture Notes in Computer Science, p. 246-251

DOI: 10.1007/11676935_30

Links

Tools

Export citation

Search in Google Scholar

NEC for Gene Expression Analysis

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Aim of this work is to apply a novel comprehensive machine learning tool for data mining to preprocessing and interpretation of gene express ion data. Furthermore, some visualization facilities are provided. The data mining fr ame- work consists of two main parts: preprocessing and clustering-agglomerating phases. To the first phase belong a noise filtering procedure and a non -linear PCA Neural Network for feature extraction. The second phase is used to accomplish an unsupervised clustering based on a hierarchy of two approaches: a Probabilistic Principal Surfaces to obtain the rough regions of interesting points and a F isher- Negentropy information based approach to agglomerate the regions previously found in order to discover substructures present in the data. Experiments on gene microarray data are made. Several experiments are shown varying the threshold, needed by the agglomerative clustering, to understand the structure of th e ana- lyzed data set.