January 1, 2020Open Access

Dice Loss for Data-imbalanced NLP Tasks

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Many NLP tasks such as tagging and machine reading comprehension (MRC) are faced with the severe data imbalance issue: negative examples significantly outnumber positive ones, and the huge number of easy-negative examples overwhelms training. The most commonly used cross entropy criteria is actually accuracy-oriented, which creates a discrepancy between training and test. At training time, each training instance contributes equally to the objective function, while at test time F1 score concerns more about positive examples.

Me gusta

Guardar

Ver artículo completo

Cite This Study

Li et al. (Wed,) studied this question.

synapsesocial.com/papers/69d9a35ba1d151c65f684b3a https://doi.org/https://doi.org/10.18653/v1/2020.acl-main.45

Me gusta

Guardar

Ver artículo completo