What question did this study set out to answer?

The research aims to enhance the adaptation of large language models for classifying faults based on tabular sensor data.

April 22, 2026Open Access

TabEng-QLoRA: Criticality-Aware Tabular-to-Text Adaptation of Large Language Models via Saliency-Guided Quantized Low-Rank Fine-Tuning

Key Points

The research aims to enhance the adaptation of large language models for classifying faults based on tabular sensor data.
Developed a serialization module that organizes sensor data into structured prompts based on critical features.
Implemented a saliency-guided rank allocation mechanism for efficient model adaptation with layer-wise profiling.
Created a domain router for automatic selection of model adapters to optimize performance.
Achieved a mean macro F1 score of 0.908, surpassing standard QLoRA by 10.6%.
Demonstrated 98.1% accuracy with 0.6 ms latency in adapter selection.
Closed 82% of the gap to full fine-tuning memory usage with an efficient processing model.

Abstract

Applying large language models (LLMs) to industrial fault classification is hindered by the mismatch between tabular sensor data and text-based inputs and by the high memory cost of fine-tuning billion-parameter models on edge hardware. This paper presents TabEng-QLoRA, a framework with three contributions: (1) a criticality-aware serialization module that converts tabular sensor records into structured prompts, placing fault-critical features in semantically prominent positions; (2) a saliency-guided rank allocation mechanism that profiles layer-wise activation norms on a 500-sample calibration set and assigns adapter ranks in three tiers (r ∈ 8, 16, 32) ; and (3) a feed-forward domain router for automatic adapter selection (98. 1% accuracy, 0. 6 ms latency). Experiments on three public benchmarks (the AI4I Predictive Maintenance Dataset) using three foundation models (LLaMA-3-8B, Mistral-7B, and Qwen2-7B) show that TabEng-QLoRA achieves a mean macro F1 of 0. 908, a 10. 6% gain over standard QLoRA, within 4. 6–5. 2 GB peak GPU memory. The framework closes 82% of the gap to full fine-tuning, while offering advantages in cross-equipment transfer learning (zero-shot macro F1: 0. 743 vs. 0. 341 for XGBoost retrained on 20% of target-domain data, as XGBoost cannot perform zero-shot transfer). Ablation results confirm statistically significant contributions from all three components (p < 0. 001).

Bookmark

View Full Paper

Bookmark

View Full Paper

TabEng-QLoRA: Criticality-Aware Tabular-to-Text Adaptation of Large Language Models via Saliency-Guided Quantized Low-Rank Fine-Tuning

Key Points

Abstract

Cite This Study