What question did this study set out to answer?

The aim is to enhance cloud task scheduling by overcoming local optima limitations through a novel hybrid algorithm.

April 22, 2026Open Access

D3QN-Guided Sand Cat Swarm Optimization with Hybrid Exploration for Multi-Objective Cloud Task Scheduling

Key Points

The aim is to enhance cloud task scheduling by overcoming local optima limitations through a novel hybrid algorithm.
Developed a hybrid algorithm combining multi-objective sand cat swarm optimization and D3QN.
Conducted 50 independent experiments in a simulated cloud environment.
Analyzed convergence processes and Pareto front results to assess performance.
MoSCO achieved an average resource utilization of 92.20%.
Reduced average maximum Makespan to 528 and Tardiness to 4187.
Demonstrated superior performance with stable solutions and effective handling of conflicting objectives.

Abstract

Task scheduling in cloud computing environments is a complex NP-hard problem that requires maximizing resource utilization while satisfying quality-of-service (QoS) constraints. Traditional meta-heuristic algorithms often become stuck in local optima, while single deep reinforcement learning (DRL) models exhibit instability when exploring large-scale solution spaces. To address this, this paper proposes a hybrid scheduling algorithm based on multi-objective sand cat colony optimization (MoSCO). This algorithm utilizes a D3QN network to extract task features and guide population initialization, followed by a multi-objective Sand Cat Swarm Optimization (SCSO) algorithm for refined local search. Results from 50 independent replicate experiments conducted in a simulated cloud environment, coupled with an analysis of the dynamic convergence process, demonstrate that MoSCO exhibits significant superiority and robustness. Scatter plot convergence analysis further confirms that MoSCO’s knowledge injection mechanism effectively overcomes the blind exploration phase of traditional algorithms and successfully breaks through the local optimum bottleneck in the late iteration stages of single reinforcement learning, achieving higher-quality, denser, and more stable convergence. Furthermore, 3D and 2D Pareto front analyses show that MoSCO generates highly competitive, well-distributed non-dominated solutions, offering flexible trade-off options for conflicting objectives. Compared to PureD3QN, H-SCSO, and NSGA-II, MoSCO exhibits the smallest performance fluctuations in box plots. Specifically, MoSCO elevates the average resource utilization of clusters to 92.20%, while reducing the average maximum Makespan and Tardiness to 528 and 4187, respectively. Experimental data confirm that MoSCO effectively balances global exploration with local exploitation, delivering stable, high-quality solutions for dynamic cloud task scheduling.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Shao et al. (Mon,) studied this question.

synapsesocial.com/papers/69e8656e6e0dea528dde9e71 https://doi.org/https://doi.org/10.3390/a19040321

Bookmark

View Full Paper