Abstract The use of machine learning (ML) models in forensic anthropology (FA) has increased in the last half decade; however, there is a lack of a standardized protocol on how to curate, use, and assess ML models. We introduce PUMAA (A Protocol for Utilizing Machine Learning in Forensic Anthropological Analyses), which includes a flowchart and a checklist that forensic practitioners can follow when creating, using, or assessing ML models for forensic research. The five factors that must be assessed in determining an ML model's true performance are explored. The most common types of supervised ML models are also explained in lay terms and accompanied by visuals to increase accessibility of these complex concepts. Although not exhaustive, examples of information that should be reported have been discussed for seven types of ML models. In addition, their various strengths and limitations are evaluated in order to equip researchers with the necessary tools to make decisions regarding when and how to use ML models. This protocol provides an initial standard for the use of ML in FA.
Faisal et al. (Wed,) studied this question.