Dubitzky/Data Mining Techniques in Grid Computing Environments, p. 141-163
DOI: 10.1002/9780470699904.ch9
Full text: Download
The bioinformatics data landscape confronts scientists with significant problems when performing data analyses. The nature of these analyses is, in part, driven by the data landscape. This raises issues in managing the scientific process of in silico experimentation in bioinformatics. The myGrid project has addressed these issues through workflows. Although raising some issues of their own, workflows have allowed scientists to effectively traverse the bioinformatics landscape. The high-throughput nature of workflows, however, has forced us to move from a task of data gathering to data gathering and management. Utilizing workflows in this manner has enabled a systematic, unbiased, and explicit approach that is less susceptible to premature triage. This has profoundly changed the nature of bioinformatics analysis. Taverna is illustrated through an example from the study of trypanosomiasis resistance in the mouse model. In this study novel biological results were obtained from traversing the bioinformatics landscape with workflow.