What question did this study set out to answer?

This research aims to quantify cellular abnormality by utilizing reference-based annotation uncertainty in single-cell RNA sequencing.

June 2, 2026Open Access

AnnQ: reference-based quantification of cellular abnormality at single-cell resolution

Key Points

This research aims to quantify cellular abnormality by utilizing reference-based annotation uncertainty in single-cell RNA sequencing.
Developed AnnQ, a Python framework for analyzing annotation uncertainty features.
Extracted features including confidence, confidence gap, admixture ratio, and entropy from cell type assignments.
Computed out-of-reference (OOR) scores to measure deviation from reference populations.
AnnQ detected aberrant cellular states not identified by conventional clustering analysis.
The application of OOR scores revealed insights in genetic perturbation experiments, enhancing the understanding of cellular deviations.

Abstract

Reference-based annotation tools have become standard for cell type assignment in single-cell RNA sequencing, leveraging large-scale atlases to transfer labels to new datasets. However, a substantial fraction of cells often receive uncertain or ambiguous annotations-low confidence scores, competing label probabilities, or high entropy across cell types. These cells are typically treated as technical artifacts and filtered out, yet in perturbation experiments they may represent the biologically interesting deviations that investigators seek to identify. We present AnnQ (Annotation Quantification of cellular identity uncertainty), a Python framework that repurposes annotation uncertainty as a quantitative measure of cellular abnormality. AnnQ extracts uncertainty-aware features from probabilistic cell type assignments-including confidence, confidence gap, admixture ratio, and entropy-and computes an out-of-reference (OOR) score measuring each cell's deviation from a reference population in multivariate uncertainty space. Applying AnnQ to genetic perturbation and drug resistance datasets, we show that OOR scores detect aberrant cellular states that are not resolved by conventional clustering or differential abundance analyses. AnnQ provides a complementary approach for characterizing transitional and abnormal cell states at single-cell resolution. AnnQ is implemented in Python, and its source code and documentation are available on https://github.com/joonan-lab/AnnQ.git.

AnnQ: reference-based quantification of cellular abnormality at single-cell resolution

Key Points

Abstract

Cite This Study