What question did this study set out to answer?

Investigate how processed pseudogenes affect genomic analysis and variant reporting in a clinical context.

February 22, 2026

Processed pseudogenes and the potential impact on genomic analysis

Key Points

Investigate how processed pseudogenes affect genomic analysis and variant reporting in a clinical context.
Case series analysis of data complications due to processed pseudogenes
Assessment of genetic testing results in clinical diagnostics
Evaluation of genomic profiling techniques
Identified processed pseudogenes leading to false-positive results in genetic testing
Demonstrated impact on interpretation of copy number variants
Highlighted need for pseudogene-informed data analysis

Abstract

Processed pseudogenes arise in the human genome due to retrotransposition events, during which mRNA transcripts are reverse-transcribed and integrated back into the genome. This additional genetic material generally consists of coding sequences, without introns and promoter regions. Pseudogenes are generally considered to be non-functional and rarely cause genetic disease unless inserted into a genomic location which disrupts gene function. Due to mapping and annotation issues with homologous sequence to parent genes, the presence of processed pseudogenes as well as pseudogenes more broadly may lead to false attribution of a genetic variant in a pseudogene to the parent gene, and therefore false-positive results. In our laboratory, we have observed complications during data analysis in both somatic and germline genetic testing due to the presence of processed pseudogenes. In the clinical diagnostic setting, SMAD4 , SETD2 and B2M processed pseudogenes complicate the interpretation of both tumour comprehensive genomic profiling (CGP) and germline multiplex ligation-dependent probe amplification (MLPA) results, creating the risk of incorrectly reporting a clinically significant copy number variant in these genes. Here we present a case series demonstrating the effect of processed pseudogenes, highlighting the importance of pseudogene-informed interpretation when analysing genomic data.

Bookmark

Processed pseudogenes and the potential impact on genomic analysis

Key Points

Abstract

Cite This Study