Key points are not available for this paper at this time.
Risk stratification is an essential tool in the fight against many diseases, including chronic kidney disease. Recent work has focused on applying techniques from machine learning and leveraging the information contained in a patient’s electronic health record (EHR). Irregular intervals between data entries and the large number of variables tracked in EHR datasets can make them challenging to work with. Many of the difficulties associated with these datasets can be overcome by using large language models, such as bidirectional encoder representations from transformers (BERT). Previous attempts to apply BERT to EHR for risk stratification have shown promise. In this work we propose HERBERT, a novel application of BERT to EHR data. We identify two key areas where BERT models must be modified to adapt them to EHR data, namely: the embedding layer and the pretraining task. We show how changes to these can lead to improved performance, relative to the previous state of the art. We evaluate our model by predicting the transition of chronic kidney disease patients to end stage renal disease. The strong performance of our model justifies our architectural changes and suggests that large language models could play an important role in future renal risk stratification.
Building similarity graph...
Analyzing shared references across papers
Loading...
Alex Moore
Peacocks Medical Group (United Kingdom)
Bastien Orset
École Polytechnique Fédérale de Lausanne
Arrash Yassaee
Imperial College London
ACM Transactions on Computing for Healthcare
University of Oxford
Imperial College London
Oxford BioMedica (United Kingdom)
Building similarity graph...
Analyzing shared references across papers
Loading...
Moore et al. (Fri,) studied this question.
synapsesocial.com/papers/68e5fb7ab6db64358758f980 — DOI: https://doi.org/10.1145/3665899