scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types

Xie, Kaikun; Huang, Yu; Zeng, Feng; Liu, Zehua; Chen, Ting

Published in

Oxford University Press, NAR Genomics and Bioinformatics, 4(2), 2020

DOI: 10.1093/nargab/lqaa082

Tools

Export citation

Search in Google Scholar

scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types

Journal article published in 2020 by Kaikun Xie

, Yu Huang, Feng Zeng, Zehua Liu, Ting Chen

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Recent advancements in both single-cell RNA-sequencing technology and computational resources facilitate the study of cell types on global populations. Up to millions of cells can now be sequenced in one experiment; thus, accurate and efficient computational methods are needed to provide clustering and post-analysis of assigning putative and rare cell types. Here, we present a novel unsupervised deep learning clustering framework that is robust and highly scalable. To overcome the high level of noise, scAIDE first incorporates an autoencoder-imputation network with a distance-preserved embedding network (AIDE) to learn a good representation of data, and then applies a random projection hashing based k-means algorithm to accommodate the detection of rare cell types. We analyzed a 1.3 million neural cell dataset within 30 min, obtaining 64 clusters which were mapped to 19 putative cell types. In particular, we further identified three different neural stem cell developmental trajectories in these clusters. We also classified two subpopulations of malignant cells in a small glioblastoma dataset using scAIDE. We anticipate that scAIDE would provide a more in-depth understanding of cell development and diseases.

Published in

Links

Tools

scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types

Abstract