Abstract Identifying how somatic variants reshape gene expression at single-cell resolution is essential for understanding cancer initiation and evolution, yet joint measurement of DNA and RNA in the same cell remains technically demanding. Single-cell RNA sequencing (scRNA-seq) captures rich transcriptional heterogeneity but typically discards variant-level information or treats it as bulk signal. We developed scVarID, a transcript-aware framework that maps externally called somatic variants from exome or panel sequencing onto single-cell transcriptomes, generating per-cell, per-variant matrices of reference and alternate allele counts across expressed molecules. To benchmark scVarID, we leveraged HG002, a reference individual with a gold-standard genomic variant set and single-cell long-read RNA sequencing data. Mapping curated variants onto scRNA-seq reads demonstrated high recovery of DNA variants in RNA and revealed distinct allele preferences in HLA class I and class II genes at single-cell resolution. Taking advantage of long-read coverage, scVarID further inferred haplotypes for expressed HLA alleles, phasing allele-specific expression patterns to individual cells and states. We then applied scVarID to colorectal cancer cohorts with matched tumor/normal exomes and scRNA-seq, confirming transcriptional activity for many somatic variants and observing similar HLA-focused allelic imbalance in immune and epithelial compartments, including rare normal epithelial subpopulations with skewed HLA-A ratios consistent with early disruption of antigen presentation. These results establish scVarID as a scalable approach for integrating externally called somatic variant profiles with single-cell expression programs in both reference and patient samples. By resolving genotype-phenotype relationships at cellular resolution, scVarID enables the discovery of subtle perturbations in immune-surveillance pathways and other cancer-relevant processes that are invisible to bulk sequencing, and provides a general framework for mapping evolutionary trajectories and high-risk cell populations in precision oncology. Citation Format: Dongkwan Shin, Juyeon Cho, Jonghyun Lee, Seok-Won Jang. scVarID: Linking somatic variants to single-cell transcriptomes to reveal early cancer-associated cell states abstract. In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 2 (Late-Breaking, Clinical Trial, and Invited Abstracts) ; 2026 Apr 17-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2026;86 (8Suppl): Abstract nr LB170.
Shin et al. (Fri,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: