Oxford University Press (OUP), Bioinformatics, 10(29), p. 1268-1274
DOI: 10.1093/bioinformatics/btt149
Full text: Download
Motivation: Analysis of millions of pyro-sequences is currently playing a crucial role in the advance of environmental microbiology. Taxonomy-independent, i.e. unsupervised, clustering of these sequences is essential for the definition of Operational Taxonomic Units. For this application, reproducibility and robustness should be the most sought after qualities, but have thus far largely been overlooked.