What question did this study set out to answer?

The aim is to introduce KuraFormer as an efficient adapter utilizing Kuramoto oscillatory dynamics in pretrained Transformers.

March 15, 2026Open Access

KuraFormer: Oscillatory Dynamics as Parameter-Efficient Adapters for Pretrained Transformers

Key Points

The aim is to introduce KuraFormer as an efficient adapter utilizing Kuramoto oscillatory dynamics in pretrained Transformers.
KuraFormer employs oscillatory dynamics for iterative refinement of hidden representations.
Evaluation was conducted on the GSM8K dataset using Mistral-7B and LLaMA-3-8B.
The study analyzed integration steps for accuracy improvement without retraining.
KuraFormer reaches accuracy within 2.9–3.5 percentage points of LoRA with 18% fewer parameters.
A convergence window phenomenon was identified where accuracy fluctuates based on integration steps.
Warm-start initialization eliminated the convergence window, ensuring consistent accuracy across steps.

Abstract

We introduce KuraFormer, a parameter-efficient adapter that injects Kuramoto oscillatory dynamics into pretrained Transformers, enabling iterative refinement of hidden representations at inference time. Unlike LoRA, which adapts weights, KuraFormer adapts computation depth—the same trained adapter can be run for varying numbers of integration steps without retraining. We evaluate on GSM8K with Mistral-7B and LLaMA-3-8B, reporting two findings: (1) a convergence window phenomenon where accuracy improves then degrades with more steps, and (2) that warm-start initialization with integration schedules eliminates this window entirely, producing flat accuracy curves across 4–64 steps. KuraFormer reaches within 2.9–3.5pp of LoRA using 18% fewer parameters while offering variable-depth computation that weight-based adapters cannot provide.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Jesus Tabares Montilla (Fri,) studied this question.

synapsesocial.com/papers/69b606ea83145bc643d1d678 https://doi.org/https://doi.org/10.5281/zenodo.19007695

Bookmark

View Full Paper