This dataset presents the high-quality whole-genome sequence of Deinococcus wulumuqiensis FBCC-B5220, a bacterium exhibiting extracellular protease activity isolated from freshwater sediment in Korea. The genome was assembled using a hybrid approach combining Illumina and PacBio platforms, resulting in five contigs totaling 3,458,218 bp with a G+C content of 66.0%. Functional annotation identified 3,239 protein-coding sequences, including a specialized repertoire of 45 protease-related genes, such as zinc metalloproteases, ATP-dependent proteases, and serine proteases. Phylogenomic and 16S rRNA gene analyses confirmed its taxonomic identity as D. wulumuqiensis , with ANI (97.2%) and dDDH (78.2%) values. The genome sequence and annotation files are available in the NCBI database under the accession number JBSRRC000000000. This dataset provides a high-quality genomic reference for comparative studies within the genus Deinococcus and serves as a primary resource for identifying protease-related genes with potential industrial relevance.
Choi et al. (Sun,) studied this question.