Key points are not available for this paper at this time.
DNA variation analysis has become indispensable in many aspects of modern biomedicine, most prominently in the comparison of normal and tumor samples. Thousands of samples are collected in local sequencing efforts and public databases requiring highly scalable, portable, and automated workflows for streamlined processing. Here, we present nf-core/sarek 3, a well-established, comprehensive variant calling and annotation pipeline for germline and somatic samples. It is suitable for any genome with a known reference. We present a full rewrite of the original pipeline showing a significant reduction of storage requirements by using the CRAM format and runtime by increasing intra-sample parallelization. Both are leading to a 70% cost reduction in commercial clouds enabling users to do large-scale and cross-platform data analysis while keeping costs and CO
Building similarity graph...
Analyzing shared references across papers
Loading...
Friederike Hanssen
M. Garcia
Lasse Folkersen
NAR Genomics and Bioinformatics
Yale University
Karolinska Institutet
Technical University of Munich
Building similarity graph...
Analyzing shared references across papers
Loading...
Hanssen et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68e7055ab6db64358767f933 — DOI: https://doi.org/10.1093/nargab/lqae031