Excessive application of chemical fertilisers has led to soil phosphorus immobilisation and aquatic eutrophication, making the development of highly efficient acid/neutral phosphatases crucial for sustainable phosphorus utilisation. In this study, we systematically investigated strain J2, which was isolated from phosphate-contaminated soil in Laoshan, Nanjing, China. 16S rRNA gene sequence analysis identified this strain as Acinetobacter nosocomialis J2, with 99.78% sequence similarity. Whole-genome sequencing generated a 3.83 Mb genome with a GC content of 38.59%, revealing multiple phospho-metabolism-related enzyme genes, including phospholipase C and α/β-hydrolases. A large language model–based protein representation learning strategy was employed to mine acid/neutral phosphatase genes from the genome, in which the model learned contextual and functional features from known phosphatase sequences and was used to identify semantically similar genes within the J2 genome. This approach predicted nine phosphatase candidate sequences, including AnACPase, a putative acid/neutral phosphatase. Biochemical characterisation showed that AnACPase exhibits optimal activity at pH 6.0 and 50 °C, with a Km value of 0.2454 mmol/L for the p-NPP substrate, indicating high substrate affinity. Mn2+ and Ni2+ significantly enhanced enzyme activity, whereas Cu2+ and Zn2+ strongly inhibited it. Soil remediation experiments further validated the application potential of AnACPase, which solubilised 171.56 mg/kg of phosphate within seven days. Overall, this study highlights the advantages of deep learning-assisted genome mining for functional enzyme discovery and provides a novel technological pathway for the bioremediation of phosphorus-polluted soils.
Chen et al. (Wed,) studied this question.