What question did this study set out to answer?

Develop a DNN-based model to predict protein properties like Coulomb and solvation energies using topological and electrostatic features.

March 25, 2026Open Access

A DNN Biophysics Model with Topological and Electrostatic Features

Key Points

Develop a DNN-based model to predict protein properties like Coulomb and solvation energies using topological and electrostatic features.
Utilized deep neural network architecture for modeling protein properties.
Generated topological features via element-specific persistent homology from heavy or carbon atoms.
Generated electrostatic features using a novel Cartesian treecode to enhance predictions.
Trained models on over 17,000 proteins for Coulomb energy and over 4,000 for solvation energy.
Achieved an MSE of approximately 0.024 and an R² of 0.976 for Coulomb energy prediction.
Achieved an MSE of approximately 0.064 and an R² of 0.926 for solvation energy prediction.
Demonstrated high accuracy and efficiency of the model in predicting protein-related properties.

Abstract

In this project, we present a deep neural network (DNN)-based biophysics model that uses multiscale and uniform topological and electrostatic features to predict protein properties, such as Coulomb energies or solvation energies. The topological features are generated using element-specific persistent homology (ESPH) on a selection of heavy or carbon atoms. The electrostatic features are generated using a novel Cartesian treecode, which adds underlying electrostatic interactions to further improve the model prediction. These features are uniform in number for proteins of varying sizes; therefore, the widely available protein structure databases can be used to train the network. These features are also multiscale, allowing users to balance resolution and computational cost. The optimal model trained on more than 17,000 proteins for predicting Coulomb energy achieves an MSE of approximately 0.024, MAPE of 0.073, and R2 of 0.976. Meanwhile, the optimal model trained on more than 4000 proteins for predicting solvation energy achieves an MSE of approximately 0.064, MAPE of 0.081, and R2 of 0.926, showing the efficiency and fidelity of these features in representing the protein structure and force field. The feature generation algorithms also have the potential to serve as general tools for assisting machine learning-based prediction of protein properties and functions.

A DNN Biophysics Model with Topological and Electrostatic Features

Key Points

Abstract

Cite This Study