What question did this study set out to answer?

The aim is to develop a reinforcement learning-based system for dynamic algorithm configuration in MaxSAT local search.

March 12, 2026Open Access

Learned Control Layers for MaxSAT Local Search: Dynamic Algorithm Configuration of Clause Weighting Parameters

Key Points

The aim is to develop a reinforcement learning-based system for dynamic algorithm configuration in MaxSAT local search.
Utilized a PPO controller to adjust clause-weighting parameters
Observed solver state every 1,000 variable flips
Evaluated performance on generated partial MaxSAT benchmarks
Conducted statistical analysis using Wilcoxon tests
Achieved a 19.0% cost reduction compared to random control (p = 2.4 × 10⁻⁵)
Had a 10.4% improvement over the best static configuration (p = 0.007)
Demonstrated zero-shot transfer to larger instances with significant results (p = 0.004)
Identified five key insights regarding DAC for local search

Abstract

We introduce the first RL-based dynamic algorithm configuration (DAC) system for MaxSAT local search. A PPO controller observes NuWLS solver state every 1,000 variable flips and adjusts four clause-weighting parameters in real time. On generated partial MaxSAT benchmarks (3 seeds × 18 test instances), the learned policy achieves −19.0% cost reduction vs. random control (Wilcoxon p = 2.4 × 10⁻⁵) and −10.4% vs. the best hand-tuned static configuration (p = 0.007). The policy discovers an explore-then-exploit noise schedule without explicit curriculum design. Zero-shot transfer to 10× larger instances remains significant (p = 0.004). We identify five structural insights about DAC for local search, including exploration parameter dominance, scale-dependent feature importance, and solver-specific policy non-transferability. All code, benchmarks, and experimental results are included.

Read Full Paperexternally

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper

Cite This Study

Alex Li (Mon,) studied this question.

synapsesocial.com/papers/69b2582a96eeacc4fcec7817 https://doi.org/https://doi.org/10.5281/zenodo.18924836

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper