February 23, 2024Open Access

Linear Dynamics-embedded Neural Network for Long-Sequence Modeling

Key Points

Key points are not available for this paper at this time.

Abstract

The trade-off between performance and computational efficiency in long-sequence modeling becomes a bottleneck for existing models. Inspired by the continuous state space models (SSMs) with multi-input and multi-output in control theory, we propose a new neural network called Linear Dynamics-embedded Neural Network (LDNN). SSMs' continuous, discrete, and convolutional properties enable LDNN to have few parameters, flexible inference, and efficient training in long-sequence tasks. Two efficient strategies, diagonalization and 'Disentanglement then Fast Fourier Transform (FFT) ', are developed to reduce the time complexity of convolution from O (LNH\L, N\) to O (LN \H, L\). We further improve LDNN through bidirectional noncausal and multi-head settings to accommodate a broader range of applications. Extensive experiments on the Long Range Arena (LRA) demonstrate the effectiveness and state-of-the-art performance of LDNN.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Liang et al. (Fri,) studied this question.

synapsesocial.com/papers/68e77f50b6db6435876f2da6 https://doi.org/https://doi.org/10.48550/arxiv.2402.15290

AI에게 질문

Bookmark

View Full Paper