This editorial discusses the transition from the original EuroSCORE to EuroSCORE II, emphasizing the importance of updating clinical prediction models while noting potential limitations regarding correlated variables and interaction terms.
Physicians need to make predictions on the prognosis of a treatment that helps them in the choice of therapy. Medicine used to be much more subjective than in the current evidence-based era. Shared decision-making, where physicians and patients both participate in deciding on choices for therapy, is also more common 1. Clinical prediction models like EuroSCORE may provide the evidence-based input for shared decision-making by providing an estimate of the operative risk of patients undergoing cardiac surgery. An ideal clinical model would be something simple dividing the patients into ‘good’ and ‘bad’ without further specification of the survival chances. The original EuroSCORE was a compromise between the ‘statistical ideal’ and the ‘clinical ideal’. It was developed from a large multinational European population and was a model predicting mortality based on 17 variables, either from a logistic regression equation or from an additive model. Numerous institutions throughout the world have tested and validated EuroSCORE. Loss of calibration with the additive and logistic EuroSCORE has been observed by many investigators and an update of the EuroSCORE was warranted. One possible reason for the poor calibration of the original EuroSCORE score is that the score was developed from patients undergoing surgery almost 20 years ago. As surgical and perioperative care evolves and the impact of clinical variables change, prediction models therefore require revision. These factors may also vary between institutions and it is well known that the quality of care and comorbidities of patients differs between countries. The original EuroSCORE already identified major differences in the risk profile of national samples 2. This is therefore one of the major concerns with EuroSCORE II: 154 hospitals from 43 countries participated, of which many were outside Europe 3. One may, therefore, question whether the term EuroSCORE is still valid or another name should be used that reflects the fact that so many countries outside Europe participated. With this in mind, it becomes even more important that, as indicated by the authors, units and surgeons calculate their own risk-adjusted mortality ratio. The model is probably more reliable in the prediction of death over a wide range of risk groups rather than the prediction of the vital status of an individual patient. Another reason for the poor calibration in the original EuroSCORE might be that a large number of risk factors in the model are highly correlated. It is important to recognize correlation between predicting variables, as the additional risk contribution of certain variables can in some part be explained by the effect of other variables. Some predicting variables may also be more important for some types of operations then for others. The large number of risk factors with potential interaction may overestimate risk in certain categories of patients (e.g. intermediate risk or extreme risk). It is therefore a pity that the authors have not explored possible interaction terms in the new
Kappetein et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: