Taylor and Francis Group, Internet Mathematics, 4(7), p. 255-273
DOI: 10.1080/15427951.2011.604559
Full text: Download
Partitioning biological data objects into groups such that the objects within the groups share common traits is a longstanding challenge in computational biology. ¶ Recently, we developed and established transitivity clustering, a partitioning approach based on weighted transitive graph projection that utilizes a single similarity threshold as density parameter. In previous publications, we concentrated on the graphical user interface and on concrete biomedical application protocols. Here, we contribute the following theoretical considerations: (1) We provide proofs that the average similarity between objects from the same cluster is above the user-given threshold and that the average similarity between objects from different clusters is below the threshold. (2) We extend transitivity clustering to an overlapping clustering tool by integrating two new approaches. (3) We demonstrate the power of transitivity clustering for protein-complex detection. We evaluate our approaches against others by utilizing gold-standard data that was previously used by Brohée et al. for reviewing existing bioinformatics clustering tools. ¶ The extended version of this article is available online at http://transclust.mpi-inf. mpg.de.