March 3, 2026

Robust symbolic regression for dynamical system identification

Key Points

The Symbolic Distribution Flow Learner identifies dynamics of sparsely observed systems effectively, ensuring interpretability.
Using Wasserstein-based loss, SDFL achieves model recovery for complex systems governed by ordinary differential equations.
The paper provides theoretical guarantees on the necessary snapshots for model fidelity in system identification.
Numerical tests on Kuramoto networks and RNA sequencing data demonstrate SDFL's competitive performance against existing methods.

Abstract

Real-world complex systems often miss high-fidelity physical descriptions and are typically subject to partial observability. Learning the dynamics of such systems is a challenging and ubiquitous problem, encountered in diverse critical applications which require interpretability and qualitative guarantees. Our paper addresses this problem in the case of sparsely observed probability distribution flows, governed by ODEs. Specifically, we devise a white box approach -dubbed Symbolic Distribution Flow Learner (SDFL)- leveraging symbolic search with a Wasserstein-based loss function, resulting in a robust model-recovery scheme which naturally lends itself to cope with partial observability. Additionally, we furnish the proposed framework with theoretical guarantees on the number of required snapshots to achieve a certain level of fidelity in the model-discovery. We illustrate the performance of the proposed scheme on the prototypical problem of Kuramoto networks and a standard benchmark of single-cell RNA sequence trajectory data. The numerical experiments demonstrate the competitive performance of SDFL in comparison to the state-of-the-art.

Bookmark

Robust symbolic regression for dynamical system identification

Key Points

Abstract

Cite This Study