Abstract Lung cancer is a major cause of cancer-related mortality worldwide. Approximately 10%-20% of lung cancers occur in patients with no smoking history and are mostly lung adenocarcinoma (LUAD). Studies in mouse models showed that LUAD cases arising from alveolar epithelial type I (AT1) cells transitioning to alveolar type II (AT2) are less aggressive than those that originate directly from type 2 (AT2), but the origin of human LUAD remains unclear. Leveraging the bulk methylation array data from the EAGLE and Sherlock-Lung studies, two large multi-omics studies of lung cancer in people with and without smoking history, respectively, we infer the cells-of-origin for early-stage tumor samples and examine their associations with smoking history and grade. We impute cells-of-origin from bulk methylation array data of tumor and normal samples collected from 388 stage I LUAD cases without smoking history and 238 with smoking history. For the methylation reference profiles of normal lung cell types, we combined a published normal human cell methylation data set (Loyfer et al. 2023) from whole-genome bisulfite sequencing (WGBS) as well as purified AT1 and AT2 enzymatically converted methylation profiles sequenced in-house. We then perform principled feature engineering to select methylation sites that best distinguish cell types and represent each cell type in a balanced manner. To estimate the proportion of each lung cell type in the samples, we employ a combinatorial optimization strategy by formulating and solving the deconvolution as a quadratic program. The inferred compositions of tumor origins are then evaluated in relation to smoking history and tumor grade based on the International Association for the Study of Lung Cancer (IASLC) grading system. The tumor samples of LUAD cases without smoking history (n=238) have 11% higher AT1 presence compared to those with smoking history (n=388, p0.05). Among cases without smoking history and with coherent grades, the tumor samples with well or moderate grade (n=96) have on average 13% higher AT1 presence than those with poor grade (n=102, p0.05). Notably, tumor samples of stage I LUAD cases with no smoking history have the highest overall AT1 presence among all groups (on average 26%). All normal samples, regardless of smoking history and grade, have no significant difference in AT1 presence (on average 10%). Our study establishes a principled deconvolution framework to infer tumor origins in LUAD using DNA methylation data. Mirroring findings in mouse models, our results indicate that AT1 cells may generate LUAD in humans, and that this appears to occur more frequently in patients without smoking history and with better tumor grades. Our work adds a new dimension to the understanding of early-stage LUAD and sheds light on patient stratification, prognostic evaluation, and therapeutic targeting of tumor origin-specific vulnerabilities. Citation Format: Xuan Cindy Li, Diego Almanza, Phuc Huu Hoang, Thi-Van-Trinh Tranh, Nicholas H. Juul, Maximilian Diehn, Tushar J. Desai, Maria Teresa Landi. DNA methylation differentiates early-stage lung adenocarcinoma cells-of-origin with distinct smoking history and grade abstract. In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 1 (Regular Abstracts); 2026 Apr 17-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2026;86(7 Suppl):Abstract nr 1957.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xuan Cindy Li
Diego Almanza
Phuc Huu Hoang
Cancer Research
Stanford University
National Cancer Institute
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69d1fde4a79560c99a0a4385 — DOI: https://doi.org/10.1158/1538-7445.am2026-1957
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: