Key points are not available for this paper at this time.
Citation screening is an essential and time-consuming step of the systematic literature review process in medicine. Multiple previous studies have proposed various automation techniques to assist manual annotators in this tedious task. The most widely used measure for the evaluation of automated citation screening techniques is Work Saved over Sampling (WSS). In this work, we analyse this measure and examine its drawbacks. We subsequently propose to normalise WSS which enables citation screening performance comparisons across different systematic reviews. We analytically show that normalised WSS is equivalent to the True Negative Rate (TNR). Finally, we provide benchmark scores for fifteen systematic review datasets with email protected% recall measure and compare the measure with Precision and AUC.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wojciech Kusa
Aldo Lipani
Petr Knoth
SHILAP Revista de lepidopterología
Intelligent Systems with Applications
University College London
TU Wien
The Open University
Building similarity graph...
Analyzing shared references across papers
Loading...
Kusa et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69d7667cd55abd294a48f455 — DOI: https://doi.org/10.1016/j.iswa.2023.200193