What question did this study set out to answer?

This study aims to evaluate the performance of various constraint programming solvers on makespan minimization in job shop scheduling problems.

June 19, 2026Open Access

A Comprehensive Benchmark of Constraint Programming Solvers for the Makespan-Minimisation Job Shop Scheduling Problem

Key Points

This study aims to evaluate the performance of various constraint programming solvers on makespan minimization in job shop scheduling problems.
Evaluated four CP solvers: IBM ILOG CP Optimizer, Google OR-Tools (CP-SAT), Hexaly, and OptalCP.
Conducted evaluations on 332 instances from nine benchmark families within a 600-second time budget.
Analyzed optimality gaps and instance competitiveness using a Friedman test and Nemenyi post hoc comparisons.
OptalCP certified optimality on 191 out of 332 instances (57.5%) with an average optimality gap of 3.55%.
Hexaly excelled on industrial-scale problems, producing 22 new best-known upper bounds and 1 new best-known lower bound.
Solver performance varied significantly based on instance size and n/m ratio, with square instances being the hardest.

Abstract

The job shop scheduling problem (JSSP) is a paradigmatic and strongly NP-hard combinatorial optimisation problem that underpins production planning in modern manufacturing systems, and constraint programming (CP) has become one of the leading methodologies for tackling it. However, comparative studies of CP solvers for the JSSP have so far been restricted to a single benchmark family, a single instance-size range, or a single hardware setting, which limits the practical guidance they offer to both researchers and practitioners. This paper presents a controlled empirical evaluation of four state-of-the-art CP solvers—IBM ILOG CP Optimizer, Google OR-Tools (CP-SAT), Hexaly, and OptalCP—on the makespan-minimisation JSSP. The four engines are run with default parameters and a uniform 600 s wall-clock time budget on 332 instances drawn from nine canonical benchmark families (Fisher–Thompson, Lawrence, Adams–Balas–Zawack, Applegate–Cook, Yamada–Nakano, Storer–Wu–Vaccari, Taillard, Demirkol–Mehta–Uzsoy, and Da Col–Teppan), spanning sizes from 6×6 to 1000×1000 operations. OptalCP emerges as the most robust engine overall, certifying optimality on 191 of the 332 instances (57.5%) with the smallest average optimality gap (3.55%), followed by CP Optimizer (166 optima), OR-Tools (144), and Hexaly (116), while Hexaly dominates on industrial-scale problems and produces the bulk of the 22 new best-known upper bounds and one new best-known lower bound reported here. A Friedman test followed by Nemenyi post hoc comparisons confirms that OptalCP attains significantly smaller optimality gaps than the three other engines (p<0.001). Solver competitiveness depends sharply on instance size and the n/m ratio, with square instances confirmed as the hardest case. In practical terms, these findings support an instance-aware approach to CP solver selection: OptalCP is the default choice for small to large instances of moderate aspect ratio, whereas Hexaly is preferable for industrial-scale problems with tens of thousands of operations or extreme n/m ratios, where it is the only engine that reliably returns high-quality feasible schedules within the time budget.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper