What question did this study set out to answer?

The aim is to evaluate how natural language complexity affects the performance of Transformers in solving logical problems.

March 31, 2026Open Access

Scaffolding or Obstacle: Quantifying the Dual Role of Natural Language in Transformer-based Logic

Key Points

The aim is to evaluate how natural language complexity affects the performance of Transformers in solving logical problems.
Constructed a dataset of Boolean logic Abstract Syntax Trees (ASTs) with varying natural language complexity.
Trained multi-layer Transformers to solve the constructed Boolean logic tasks.
Performed mechanistic analysis to assess the role of natural language structures in the model's reasoning process.
Simple natural language structures enhance the reasoning capacity of Transformers.
Predictive accuracy of the model declines with increased complexity in natural language.
Complex and redundant language leads to greater reliance on deeper Feed-Forward Network layers for logical processing.

Abstract

Large Language Models (LLMs) have demonstrated strong capabilities in solving realworld logical problems. It is commonly assumed that, compared to formal logical languages,natural language increases the computational burden on Transformers due to its inherentambiguities and structural redundancy. However, our controlled experiments suggest a morenuanced picture. We find that simple natural language structures can act as useful ”thinkingtokens,” supporting the reasoning capacity of Transformers. The model’s predictive accuracydeclines primarily when the natural language becomes highly complex and unstructured.To systematically investigate this phenomenon, we construct a dataset of Boolean logicAbstract Syntax Trees (ASTs) with varying levels of natural language complexity and trainmulti-layer Transformers to solve them. Our preliminary mechanistic analysis suggests thatsimple linguistic structures may help the Attention mechanism capture the relationshipsbetween Boolean operators. In contrast, complex and noisy redundant words do not appear to provide the same scaffolding benefit, and are associated with a greater reliance ondeeper Feed-Forward Network (FFN) layers to process the logic. Overall, our study providescontrolled evidence for the role of natural language structure in Transformer-based logicalreasoning.

Scaffolding or Obstacle: Quantifying the Dual Role of Natural Language in Transformer-based Logic

Key Points

Abstract

Cite This Study

Also Consider

Also Consider