Los puntos clave no están disponibles para este artículo en este momento.
Perception of toxicity evolves over time and often differs between geographies and cultural backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as the Perspective API, are not static, but frequently retrained to address any unattended weaknesses and biases. We evaluate the implications of these changes on the reproducibility of findings that compare the relative merits of models and methods that aim to curb toxicity. Our findings suggest that research that relied on inherited automatic toxicity scores to compare models and techniques may have resulted in inaccurate findings. Rescoring all models from HELM, a widely respected living benchmark, for toxicity with the recent version of the API led to a different ranking of widely used foundation models. We suggest caution in applying apples-to-apples comparisons between studies and call for a more structured approach to evaluating toxicity over time.
Building similarity graph...
Analyzing shared references across papers
Loading...
Luiza Pozzobon
Beyza Ermiş
Bahçeşehir University
Patrick A. Lewis
Royal Veterinary College
Building similarity graph...
Analyzing shared references across papers
Loading...
Pozzobon et al. (Sun,) studied this question.
synapsesocial.com/papers/69ffb0e810d6befb257751b6 — DOI: https://doi.org/10.18653/v1/2023.emnlp-main.472
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: