Key points are not available for this paper at this time.
Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTRᵣetriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTRᵣetriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTRᵣetriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4. 5× genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTR-RTs with 5'-TG…CA-3' termini, LTRᵣetriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs are Copia elements, with which the LTR is four times shorter than that of other Copia elements, which may be a result of their target specificity. Strikingly, non-TGCA Copia elements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shujun Ou
Ning Jiang
PLANT PHYSIOLOGY
Michigan State University
Building similarity graph...
Analyzing shared references across papers
Loading...
Ou et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69d76666d55abd294a48f43f — DOI: https://doi.org/10.1104/pp.17.01310