Abstract This paper introduces a novel expectation-maximization (EM) algorithm for estimating general phase-type (PH) distributions from left-truncated and right-censored (LTRC) data, a common challenge in survival analysis. The proposed algorithm is highly efficient with computational complexity that scales with the number of nonzero elements in the generator matrix. This feature makes the estimation of high-dimensional, sparse PH models computationally tractable and enables the practical use of the computationally intensive extended information criterion for model selection. Numerical experiments demonstrate its significant speed advantage over a modern benchmark and the applicability of PH models to complex lifetime data.
Okamura et al. (Tue,) studied this question.