What question did this study set out to answer?

The aim is to enhance code generation accuracy by optimising decoding strategies and constraints in large language models.

April 26, 2026Open Access

TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

Key Points

The aim is to enhance code generation accuracy by optimising decoding strategies and constraints in large language models.
Introduced TreeCoder as a framework for exploring decoding strategies and constraints in code generation.
Performed systematic exploration and tuning of decoding configurations on models like CodeLlama, Mistral, and DeepSeek.
Focused on enforcing syntactic and semantic correctness during the decoding process instead of just relying on natural language prompts.
TreeCoder improved accuracy in code generation across Python, SQL, and Rust.
Achieved significant performance boosts compared to unconstrained baseline models.
Demonstrated versatility and effectiveness in applying constraints and optimisation techniques.

Abstract

Large language models (LLMs) have shown remarkable ability to generate code, yet their outputs often violate syntactic or semantic constraints when guided only through natural language prompts. We introduce TreeCoder, the most general and flexible framework to date for exploring decoding strategies, constraints, and hyperparameters in LLMs, and use it in code generation to enforce correctness and structure during decoding rather than relying on prompt engineering.TreeCoder represents decoding as a tree search over candidate programs, where both decoding strategies and constraint functions---such as style, syntax, execution---are treated as first-class, optimisable components. This design enables systematic exploration and automatic tuning of decoding configurations using standard optimisation techniques. Experiments on Python, SQL and Rust show that TreeCoder consistently improves accuracy across open-source models such as CodeLlama, Mistral and DeepSeek, often significantly outperforming their unconstrained baselines.

TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

Key Points

Abstract

Cite This Study

Also Consider

Also Consider