What question did this study set out to answer?

The research aims to develop an energy-efficient scheduling framework for elevator systems under varying demand conditions.

March 25, 2026Open Access

Energy-Efficient Resilience Scheduling for Elevator Group Control via Queueing-Based Planning and Safe Reinforcement Learning

Key Points

The research aims to develop an energy-efficient scheduling framework for elevator systems under varying demand conditions.
Developed a two-layer resilience scheduling framework incorporating queueing-based planning and safe reinforcement learning.
Utilized Sample Average Approximation (SAA) and Conditional Value-at-Risk (CVaR) to filter candidates for mode and emergency switching cards.
Formulated online dispatch as a constrained Markov decision process with action masking.
Confirmed reduction in tail risk and accelerated recovery during peak times and surges.
Maintained peak power within prescribed limits throughout all tests.
Demonstrated distributional robustness and generalization across different scenarios.

Abstract

High-rise elevator group control systems operate under pronounced nonstationarity during commuting peaks, post-event surges, and capacity degradation, where the waiting time distribution becomes right-tail heavy and stresses service-level agreements (SLAs) defined by coverage and high-quantile targets. At the same time, the time-of-use tariffs and carbon constraints sharpen the tension between peak-power control, energy savings, and service capacity. This paper proposes a two-layer resilience scheduling framework that integrates queueing-based planning with safe reinforcement learning (RL) fine-tuning. In the planning layer, parsimonious queueing approximations and scenario-based evaluation construct a finite set of implementable mode cards and emergency switching cards; Sample Average Approximation (SAA) combined with Conditional Value-at-Risk (CVaR) constraints filter candidates to enforce tail-risk-aware service limits while keeping power demand within a prescribed envelope. In the execution layer, online dispatch is formulated as a constrained Markov decision process; within the planning layer limits, action masking and Lagrangian safe RL learn small adaptive adjustments to suppress tail-waiting risk and improve recovery dynamics without increasing peak-power commitments. The experiments under morning peaks and post-event surges confirm tail risk reduction and accelerated recovery. For partial outages, the framework prioritizes SLA coverage and recovery speed, accepting a bounded increase in tail risk as a manageable trade-off. Throughout all tests, peak power remains within the prescribed limits. Improvements persist across random seeds and demand fluctuations, indicating distributional robustness and cross-scenario generalization. Ablation studies further reveal complementary roles: removing the planning layer CVaR screening worsens tail performance, while removing the execution layer action masking increases constraint violations and destabilizes recovery.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Sat,) studied this question.

synapsesocial.com/papers/69c37b62b34aaaeb1a67dbfd https://doi.org/https://doi.org/10.3390/machines14030352

Bookmark

View Full Paper