What question did this study set out to answer?

The study aims to enhance building fire hazard identification using an advanced hybrid optimization method.

March 14, 2026Open Access

Cognitively Guided Hybrid Optimization Method for Visual-Language Models in Building Fire Risk Identification

Key Points

The study aims to enhance building fire hazard identification using an advanced hybrid optimization method.
Developed a cognitively guided hybrid-optimization method for visual-language models.
Decomposed professional reasoning into optimizable modules for efficiency.
Utilized a two-stage Bayesian-genetic procedure for discrete prompt search.
Achieved 90.75% macro-F1 score with 94.96% recall on 612 hazard images.
Outperformed LoRA fine-tuning and other proprietary models without requiring training data.
Demonstrated effectiveness through modular prompt engineering for safety tasks.

Abstract

Abstract Building fires are pervasive, high-consequence events, yet current inspection workflows remain inefficient. We propose a cognitively guided hybrid-optimization method that operationalizes modular prompt engineering for open-source visual–language models (VLMs) to automate building fire-hazard identification. Grounded in the ACT-R architecture, the approach decomposes professional reasoning into five optimizable modules and searches the discrete prompt space via a two-stage Bayesian–genetic procedure. Evaluated on 612 images spanning four hazard categories—structural damage, evacuation route, fire equipment missing, and debris accumulation—the system achieves 90.75% macro-F1 with 94.96% recall, outperforming LoRA fine-tuning (86.35% Macro-F1 with 100 training images) using zero training data, while matching proprietary models and retaining the flexibility of open-source VLMs. The results show that methodical prompt modularization and hybrid optimization can elicit professional-level performance in safety-critical tasks without model retraining, providing a scalable and practical computational pipeline for AI-assisted urban building safety supervision.

Cognitively Guided Hybrid Optimization Method for Visual-Language Models in Building Fire Risk Identification

Key Points

Abstract

Cite This Study