Predictive analytics in education holds promise for supporting student learning but poses risks of bias potentially disadvantaging certain groups based on demographic attributes such as gender. Prior work on fairness in educational prediction has largely focused on binary outcomes in higher education, leaving continuous predictions, such as standardized test scores, comparatively understudied, especially in early schooling where consequences can be long-lasting. Despite a few fairness metrics available for continuous outcomes, they often rely on mean differences, obscuring distributional disparities that may disadvantage specific students. Even fewer studies have attempted to address bias in continuous predictions. This study addresses these gaps by introducing a fairness evaluation framework grounded in statistical distance measures with two adapted metrics, AreaPDF and AreaCDF, that capture group disparities across the full distribution of continuous predicted outcomes. We evaluate these metrics in the context of predicting national standardized test scores in early childhood education and further develop five reweighting-based data-balancing methods to mitigate bias in continuous prediction tasks. Empirical analyses show that the proposed metrics reveal biases overlooked by traditional measures and that the reweighting strategies substantially improve fairness, underscoring the importance of aligning debiasing methods with demographic and score-distribution characteristics. • Measurement of bias in continuous educational predictions should go beyond simple group averages to account for difference across the entire distribution between groups, as the proposed AreaCDF and AreaPDF metrics do, offering a finer-grained view of disparities in predictions that empowers stakeholders with greater flexibility to act on these insights. • In continuous educational predictions, bias more likely arises from the complex interaction between demographic group representation and score distributions, rather than demographic factors alone. Importantly, achieving balanced demographic group representation does not guarantee fair predictive performance across groups. • There is no one-size-fits-all data balancing methods for improving predictive fairness. Fairness interventions should be adapted to the specific data characteristics and predictive models at hand, taking into account whether demographic imbalances are global, local, or intertwined with skewed score distributions.
Building similarity graph...
Analyzing shared references across papers
Loading...
L. Li
Monash University
Namrata Srivatava
Jia Rong
Monash University
Computers and Education Artificial Intelligence
Vanderbilt University
Monash University
Jinan University
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Sun,) studied this question.
synapsesocial.com/papers/69c8c1f4de0f0f753b39c276 — DOI: https://doi.org/10.1016/j.caeai.2026.100585