What question did this study set out to answer?

The aim is to improve long-horizon air quality forecasting by addressing issues related to missing or irregular data.

April 7, 2026Open Access

Task-Aligned Transformer Imputation for Long-Horizon Air Quality Forecasting

Key Points

The aim is to improve long-horizon air quality forecasting by addressing issues related to missing or irregular data.
Proposed TILSTM method combines Transformer imputation with LSTM forecasting.
Enforces causal horizon boundaries in data imputation to maintain integrity of historical data.
Uses a combined forecasting and self-supervised imputation loss for model training.
Evaluated on hourly PM10 predictions from 21 monitoring stations under different missingness scenarios.
TILSTM achieves the lowest mean absolute error (MAE) and root mean square error (RMSE) at the 168-hour horizon.
Performance varies with missingness regimes, especially at medium forecasting horizons.
Shows consistent improvements at the 24-hour forecasting horizon compared to other methods.

Abstract

Accurate long-horizon air-quality forecasting becomes difficult when historical observations are missing or irregularly sampled because reconstruction errors can propagate into downstream predictions. In this work, we propose the TILSTM method, a task-aligned hybrid architecture that integrates a Transformer-based imputation module with an LSTM forecaster designed to jointly enforce a causal horizon boundary that restricts imputation strictly to the historical look-back window, an observed-preserving merge that leaves measured values unchanged, and a time-aware decay gate applied selectively to imputed positions. The model is trained end-to-end using a combined forecasting loss and a self-supervised imputation loss computed on artificially masked observed entries. We evaluate TILSTM on hourly PM10 forecasting from 21 monitoring stations in Slovenia across three forecasting horizons and three missingness regimes. Among the compared methods, TILSTM shows the clearest and most consistent gains at the 24 h horizon, while at medium horizons, the relative ranking becomes more dependent on the missingness regime. In pooled error summaries, TILSTM achieves the lowest MAE and RMSE at the 168 h horizon under the real and nearₒrigin missingness regimes, while the overall results indicate that no single method is uniformly best across all long-horizon settings.

Read Full Paperexternally

Demander à l'IA

Bookmark

View Full Paper

Cite This Study

Vrbančič et al. (Fri,) studied this question.

synapsesocial.com/papers/69d49f1cb33cc4c35a227aad https://doi.org/https://doi.org/10.3390/math14071196

Demander à l'IA

Bookmark

View Full Paper