Oxford University Press (OUP), Bioinformatics, 3(29), p. 391-392
DOI: 10.1093/bioinformatics/bts684
Full text: Download
Summary: READSCAN is a highly scalable parallel program to identify non-host sequences (of potential pathogen origin) and estimate their genome relative abundance in high-throughput sequence datasets. READSCAN accurately classified human and viral sequences on a 20.1 million reads simulated dataset in