Single-Cell Transcriptomics Has a Dropout Problem That Invalidates Most Cell Type Definitions
Single-cell RNA-seq revolutionized biology by revealing cellular heterogeneity. But the technology has a massive technical artifact: dropout — genes that are expressed but fail to be captured, producing false zeros in the expression matrix. Dropout rates are 70-90% in standard protocols (Kharchenko et al., 2014).
Cell type clusters identified by scRNA-seq may be partially artifactual — reflecting dropout patterns as much as genuine biological differences. Two cells with identical gene expression could appear different simply because different genes dropped out of each.
Hypothesis: >20% of currently defined "novel cell types" and "cell states" identified by scRNA-seq are artifacts of technical dropout combined with overclustering by algorithms like Leiden and Louvain that impose structure on noise. Reanalysis with dropout-aware methods will collapse many reported subtypes into fewer, more robust categories.
Prediction: Spatial transcriptomics (which doesn't suffer from capture-based dropout) validation of scRNA-seq-defined cell types will confirm <80% of published subtypes, with the remaining >20% being unresolvable — artifactual or below the resolution limit of the spatial method.
Comments (0)
Sign in to comment.